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Preface 


Algebra is used by virtually all mathematicians, be they analysts, combinatorists, com¬ 
puter scientists, geometers, logicians, number theorists, or topologists. Nowadays, ev¬ 
eryone agrees that some knowledge of linear algebra, groups, and commutative rings is 
necessary, and these topics are introduced in undergraduate courses. We continue their 
study. 

This book can be used as a text for the first year of graduate algebra, but it is much more 
than that. It can also serve more advanced graduate students wishing to learn topics on 
their own; while not reaching the frontiers, the book does provide a sense of the successes 
and methods arising in an area. Finally, this is a reference containing many of the standard 
theorems and definitions that users of algebra need to know. Thus, the book is not only an 
appetizer, but a hearty meal as well. 

When I was a student, Birkhoff and Mac Lane’s A Survey of Modern Algebra was the 
text for my first algebra course, and van der Waerden’s Modern Algebra was the text for 
my second course. Both are excellent books (I have called this book Advanced Modern 
Algebra in homage to them), but times have changed since their first appearance; Birkhoff 
and Mac Lane’s book first appeared in 1941, and van der Waerden’s book first appeared 
in 1930. There are today major directions that either did not exist over 60 years ago, or 
that were not then recognized to be so important. These new directions involve algebraic 
geometry, computers, homology, and representations (A Survey of Modern Algebra has 
been rewritten as Mac Lane-Birkhoff, Algebra, Macmillan, New York, 1967, and this 
version introduces categorical methods; category theory emerged from algebraic topology, 
but was then used by Grothendieck to revolutionize algebraic geometry). 

Let me now address readers and instructors who use the book as a text for a beginning 
graduate course. If I could assume that everyone had already read my book, A First Course 
in Abstract Algebra, then the prerequisites for this book would be plain. But this is not a 
realistic assumption; different undergraduate courses introducing abstract algebra abound, 
as do texts for these courses. For many, linear algebra concentrates on matrices and vector 
spaces over the real numbers, with an emphasis on computing solutions of linear systems 
of equations; other courses may treat vector spaces over arbitrary fields, as well as Jordan 
and rational canonical forms. Some courses discuss the Sylow theorems; some do not; 
some courses classify finite fields; some do not. 

To accommodate readers having different backgrounds, the first three chapters contain 
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many familiar results, with many proofs merely sketched. The first chapter contains the 
fundamental theorem of arithmetic, congruences, De Moivre’s theorem, roots of unity, 
cyclotomic polynomials, and some standard notions of set theory, such as equivalence 
relations and verification of the group axioms for symmetric groups. The next two chap¬ 
ters contain both familiar and unfamiliar material. “New” results, that is, results rarely 
taught in a first course, have complete proofs, while proofs of “old” results are usually 
sketched. In more detail, Chapter 2 is an introduction to group theory, reviewing permuta¬ 
tions, Lagrange’s theorem, quotient groups, the isomorphism theorems, and groups acting 
on sets. Chapter 3 is an introduction to commutative rings, reviewing domains, fraction 
fields, polynomial rings in one variable, quotient rings, isomorphism theorems, irreducible 
polynomials, finite fields, and some linear algebra over arbitrary fields. Readers may use 
“older” portions of these chapters to refresh their memory of this material (and also to 
see my notational choices); on the other hand, these chapters can also serve as a guide for 
learning what may have been omitted from an earlier course (complete proofs can be found 
in A First Course in Abstract Algebra). This format gives more freedom to an instructor, 
for there is a variety of choices for the starting point of a course of lectures, depending 
on what best fits the backgrounds of the students in a class. I expect that most instruc¬ 
tors would begin a course somewhere in the middle of Chapter 2 and, afterwards, would 
continue from some point in the middle of Chapter 3. Finally, this format is convenient 
for the author, because it allows me to refer back to these earlier results in the midst of a 
discussion or a proof. Proofs in subsequent chapters are complete and are not sketched. 

I have tried to write clear and complete proofs, omitting only those parts that are truly 
routine; thus, it is not necessary for an instructor to expound every detail in lectures, for 
students should be able to read the text. 

Here is a more detailed account of the later chapters of this book. 

Chapter 4 discusses fields, beginning with an introduction to Galois theory, the inter¬ 
relationship between rings and groups. We prove the insolvability of the general polyno¬ 
mial of degree 5, the fundamental theorem of Galois theory, and applications, such as a 
proof of the fundamental theorem of algebra, and Galois’s theorem that a polynomial over 
a field of characteristic 0 is solvable by radicals if and only if its Galois group is a solvable 
group. 

Chapter 5 covers finite abelian groups (basis theorem and fundamental theorem), the 
Sylow theorems, Jordan-Holder theorem, solvable groups, simplicity of the linear groups 
PSL(2, k), free groups, presentations, and the Nielsen-Schreier theorem (subgroups of free 
groups are free). 

Chapter 6 introduces prime and maximal ideals in commutative rings; Gauss’s theorem 
that R [x] is a UFD when R is a UFD; Hilbert’s basis theorem, applications of Zorn’s lemma 
to commutative algebra (a proof of the equivalence of Zorn’s lemma and the axiom of 
choice is in the appendix), inseparability, transcendence bases, Liiroth’s theorem, affine va¬ 
rieties, including a proof of the Nullstellensatz for uncountable algebraically closed fields 
(the full Nullstellensatz, for varieties over arbitrary algebraically closed fields, is proved 
in Chapter 11); primary decomposition; Grobner bases. Chapters 5 and 6 overlap two 
chapters of A First Course in Abstract Algebra, but these chapters are not covered in most 
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undergraduate courses. 

Chapter 7 introduces modules over commutative rings (essentially proving that all 
A’-modules and R-maps form an abelian category); categories and functors, including 
products and coproducts, pullbacks and pushouts, Grothendieck groups, inverse and direct 
limits, natural transformations; adjoint functors; free modules, projectives, and injectives. 

Chapter 8 introduces noncommutative rings, proving Wedderburn’s theorem that finite 
division rings are commutative, as well as the Wedderburn-Artin theorem classifying semi- 
sirnple rings. Modules over noncommutative rings are discussed, along with tensor prod¬ 
ucts, flat modules, and bilinear forms. We also introduce character theory, using it to prove 
Burnside’s theorem that finite groups of order p m q" are solvable. We then introduce multi¬ 
ply transitive groups and Frobenius groups, and we prove that Frobenius kernels are normal 
subgroups of Frobenius groups. 

Chapter 9 considers finitely generated modules over PIDs (generalizing earlier theorems 
about finite abelian groups), and then goes on to apply these results to rational, Jordan, and 
Smith canonical forms for matrices over a field (the Smith normal form enables one to 
compute elementary divisors of a matrix). We also classify projective, injective, and flat 
modules over PIDs. A discussion of graded k-algebras, for k a commutative ring, leads to 
tensor algebras, central simple algebras and the Brauer group, exterior algebra (including 
Grassmann algebras and the binomial theorem), determinants, differential forms, and an 
introduction to Lie algebras. 

Chapter 10 introduces homological methods, beginning with semidirect products and 
the extension problem for groups. We then present Schreier’s solution of the extension 
problem using factor sets, culminating in the Schur-Zassenhaus lemma. This is followed 
by axioms characterizing Tor and Ext (existence of these functors is proved with derived 
functors), some cohomology of groups, a bit of crossed product algebras, and an introduc¬ 
tion to spectral sequences. 

Chapter 11 returns to commutative rings, discussing localization, integral extensions, 
the general Nullstellensatz (using Jacobson rings), Dedekind rings, homological dimen¬ 
sions, the theorem of Serre characterizing regular local rings as those noetherian local 
rings of finite global dimension, the theorem of Auslander and Buchsbaum that regular 
local rings are UFDs. 

Each generation should survey algebra to make it serve the present time. 

It is a pleasure to thank the following mathematicians whose suggestions have greatly 
improved my original manuscript: Ross Abraham, Michael Barr, Daniel Bump, Heng Huat 
Chan, Ulrich Daepp, Boris A. Datskovsky, Keith Dennis, Vlastimil Dlab, Sankar Dutta, 
David Eisenbud, E. Graham Evans, Jr., Daniel Flath, Jeremy J. Gray, Daniel Grayson, 
Phillip Griffith, William Haboush, Robin Hartshorne, Craig Huneke, Gerald J. Janusz, 
David Joyner, Carl Jockusch, David Leep, Marcin Mazur, Leon McCulloh, Emma Previato, 
Eric Sommers, Stephen V. Ullom, Paul Vojta, William C. Waterhouse, and Richard Weiss. 


Joseph Rotman 



Etymology 


The heading etymology in the index points the reader to derivations of certain mathematical 
terms. For the origins of other mathematical terms, we refer the reader to my books Journey 
into Mathematics and A First Course in Abstract Algebra, which contain etymologies of 
the following terms. 

Journey into Mathematics: 

n, algebra, algorithm, arithmetic, completing the square, cosine, geometry, irrational 
number, isoperimetric, mathematics, perimeter, polar decomposition, root, scalar, secant, 
sine, tangent, trigonometry. 

A First Course in Abstract Algebra: 

affine, binomial, coefficient, coordinates, corollary, degree, factor, factorial, group, 
induction, Latin square, lemma, matrix, modulo, orthogonal, polynomial, quasicyclic, 
September, stochastic, theorem, translation. 
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Things Past 


This chapter reviews some familiar material of number theory, complex roots of unity, and 
basic set theory, and so most proofs are merely sketched. 


1.1 Some Number Theory 


Let us begin by discussing mathematical induction. Recall that the set of natural numbers 
N is defined by 

N = {integers «:«>()}; 

that is, N is the set of all nonnegative integers. Mathematical induction is a technique of 
proof based on the following property of N: 

Least Integer Axiom . 1 There is a smallest integer in every nonempty subset C of N. 

Assuming the axiom, let us see that if m is any fixed integer, possibly negative, then 
there is a smallest integer in every nonempty collection C of integers greater than or equal 

to m. If m > 0, this is the least integer axiom. If m < 0, then C C {m,m + 1.—1} UN 

and 

c= (cn{/«,m + i,...,-i})u(cnN). 

If the finite set C fl {in, m + 1,..., — 1} ^ 0, then it contains a smallest integer that is, 
obviously, the smallest integer in C; if C fl {m, m + 1,..., — 1} = 0, then C is contained 
in N, and the least integer axiom provides a smallest integer in C. 

Definition. A natural number p is prime if p > 2 and there is no factorization p = ab, 
where a < p and b < p are natural numbers. 


1 This property is usually called the well-ordering principle. 
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Proposition 1.1. Every integer n > 2 is either a prime or a product of primes. 

Proof. Let C be the subset of N consisting of all those n > 2 for which the proposition 
is false; we must prove that C — 0. If, on the contrary, C is nonempty, then it contains a 
smallest integer, say, m. Since m e C, it is not a prime, and so there are natural numbers 
a and b with m — ab , a < m, and b < m. Neither a nor b lies in C, for each of them is 
smaller than m, which is the smallest integer in C, and so each of them is either prime or a 
product of primes. Therefore, m — ah is a product of (at least two) primes, contradicting 
the proposition being false for m. • 

There are two versions of induction. 

Theorem 1.2 (Mathematical Induction). Let S(n) be a family of statements, one for 
each integer n > m, where m is some fixed integer. If 

(i) S(m) is true, and 

(ii) S(n) is true implies S(n + 1) is true, 
then S(n) is true for all integers n > m. 

Proof. Let C be the set of all integers n > m for which Sin) is false. If C is empty, we 
are done. Otherwise, there is a smallest integer k in C. By (i), we have k > m, and so there 
is a statement S(k — 1). But k — 1 < k implies k — 1 f C, for k is the smallest integer in 
C. Thus, S(k — 1) is true. But now (ii) says that S(k) — S([k — 1] + 1) is true, and this 
contradicts k e C [which says that S{k) is false]. • 

Theorem 1.3 (Second Form of Induction). Let S(n ) be a family of statements, one for 
each integer n > m, where m is some fixed integer. If 

(i) S(m) is true, and 

(ii) ifS(k) is true for all k with m < k < n, then S{n) is itself true, 
then S(n) is true for all integers n > m. 

Sketch of Proof. The proof is similar to the proof of the first form. • 

We now recall some elementary number theory. 

Theorem 1.4 (Division Algorithm). Given integers a and b with a f 0, there exist 
unique integers q and r with 

b — qa + r and 0 < r < |a|. 

Sketch of Proof. Consider all nonnegative integers of the form b — na, where n e Z. 
Define r to be the smallest nonnegative integer of the form b — na, and define q to be the 
integer n occurring in the expression r — b — na. 

If qa + r — q'a + r', where 0 < r' < \a\, then |(</ — q')a\ = \r' — r |. Now 0 < 

| r' — r\ < \a\ and, if | q — q'\ f 0, then |(</ — q')a\ > \a\. We conclude that both sides 
are 0; that is, q = cf and r = rk • 
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Definition. If a and b are integers with a f 0, then the integers q and r occurring in the 
division algorithm are called the quotient and the remainder after dividing b by a. 

Warning! The division algorithm makes sense, in particular, when b is negative. A 
careless person may assume that b and —b leave the same remainder after dividing by a, 
and this is usually false. For example, let us divide 60 and —60 by 7. 

60 = 7 • 8 + 4 and - 60 = 7 • (-9) + 3 

Thus, the remainders after dividing 60 and —60 by 7 are different. 

Corollary 1.5. There are infinitely many primes. 

Proof. (Euclid) Suppose, on the contrary, that there are only finitely many primes. If 
pi, p 2 , .. ■, pk is the complete list of all the primes, define M = (pi ■ ■ ■ pi ) +1. By 
Proposition 1.1, M is either a prime or a product of primes. But M is neither a prime 
(M > pi for every i) nor does it have any prime divisor /?,, for dividing M by p t gives 
remainder 1 and not 0. For example, dividing M by pi gives M — pi(p 2 • • • Pk) + 1, so 
that the quotient and remainder are q = p 2 - ■ ■ pk and r = 1; dividing M by p 2 gives M — 
Pl(p\ Pi ■ ■ ■ Pk) + 1, so that q — p\ p 3 • ■ ■ pk and r — 1; and so forth. This contradiction 
proves that there cannot be only finitely many primes, and so there must be an infinite 
number of them. • 


Definition. If a and b are integers, then a is a divisor of b if there is an integer d with 
b — ad. We also say that a divides b or that b is a multiple of a, and we denote this by 

a | b. 

There is going to be a shift in viewpoint. When we first learned long division, we 
emphasized the quotient q\ the remainder r was merely the fragment left over. Here, we 
are interested in whether or not a given number b is a multiple of a number a, but we are 
less interested in which multiple it may be. Hence, from now on, we will emphasize the 
remainder. Thus, a \ b if and only if b has remainder r = 0 after dividing by a. 


Definition. A common divisor of integers a and b is an integer c with e | a and c | b. 
The greatest common divisorov gcd of a and b, denoted by (a, b), is defined by 

, [ 0 if a = 0 = b 

(a, b) = \ 

the largest common divisor of a and b otherwise. 


Proposition 1.6. If p is a prime and b is any integer, then 


(p, b) 


P if p | b 
1 otherwise. 


Sketch of Proof. A positive common divisor is, in particular, a divisor of the prime p, and 
hence it is p or 1. • 
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Theorem 1.7. If a and b are integers, then ( a,b ) = d is a linear combination of a and 
b; that is, there are integers s and t with d — sa + tb. 

Sketch of Proof. Let 

I — {sa + tb : s, t e Z} 

(the set of all integers, positive and negative, is denoted by Z). If I {0}, let d be the 
smallest positive integer in /; as any element of /, we have d = sa + tb for some integers 
s and t. We claim that I — {cl), the set of all multiples of d. Clearly, (cl) C /. For the 
reverse inclusion, take cel. By the division algorithm, c = qd + r, where 0 < r < d. 
Now r = c — qd e I, so that the minimality of d is contradicted if r f 0. Hence, d | c, 
c e (d), and I — (d). It follows that d is a common divisor of a and b, and it is the largest 
such. • 

Proposition 1.8. Let a and b be integers. A nonnegative common divisor d is their gcd if 
and only if c \ d for every common divisor c. 

Sketch of Proof. If d is the gcd, then d = sa + tb. Hence, if c | a and c | b, then c divides 
sa + tb — d. Conversely, if d is a common divisor with c | d for every common divisor c, 
then c < d for all c, and so d is the largest. • 

Corollary 1.9. Let I be a subset ofL such that 

(i) 0 e I; 

(ii) if a, b e I, then a — b e I; 

(iii) if a e I and q eh, then qa e I. 

Then there is a natural number del with I consisting precisely of all the multiples of d. 

Sketch of Proof. These are the only properties of the subset I in Theorem 1.7 that were 
used in the proof. • 

Theorem 1.10 (Euclid’s Lemma). If p is a prime and p \ ab, then p \ a or p \ b. More 
generally, if a prime p divides a product a\a 2 • ■ • a n , then it must divide at least one of the 
factors ai. 

Sketch of Proof. If p \ a, then ( p, a) — 1 and 1 = sp + ta. Hence, b — spb + tab is a 
multiple of p. The second statement is proved by induction on n >2. • 

Definition. Call integers a and b relatively prime if their gcd (a, b) = 1. 

Corollary 1.11. Let a, b, and c be integers. Ifc and a are relatively prime and ifc \ ab, 
then c | b. 

Sketch of Proof. Since 1 = sc + ta, we have b — scb + tab. • 



Sec. 1.1 Some Number Theory 


5 


Proposition 1.12. If p is a prime, then p | j for 0 < j < p. 

Sketch of Proof. By definition, the binomial coefficient ( p j = p\/j\(p — j)\, so that 

By Euclid’s lemma, p \ j\(p — j)\ implies p \ ( p j. • 

If integers a and b are not both 0, Theorem 1.7 identifies (a, b) as the smallest positive 
linear combination of a and b. Usually, this is not helpful in actually finding the gcd, but 
the next elementary result is an exception. 

Proposition 1.13. 

(i) If a and b are integers, then a and b are relatively prime if and only if there are 
integers s and t with I = sa + tb. 

(ii) If d — (a, b), where a and b are not both 0, then ( a/d , b/d) = 1. 

Proof, (i) Necessity is Theorem 1.7. For sufficiency, note that 1 being the smallest posi¬ 
tive integer gives, in this case, 1 being the smallest positive linear combination of a and b, 
and hence (a, b) — 1. Alternatively, if c is a common divisor of a and b, then c \ sa + tb\ 
hence, c | 1, and so c = ±1. 

(ii) Note that d ^ 0 and a/d and b/d are integers, for d is a common divisor. The equation 
d — sa + tb now gives 1 = s(a/d) + tfb/d). By part (i), (a/d, b/d) = 1. • 

The next result offers a practical method for finding the gcd of two integers as well as 
for expressing it as a linear combination. 

Theorem 1.14 (Euclidean Algorithm). Let a and b be positive integers. There is an 
algorithm that finds the gcd, d — (a, b ), and there is an algorithm that finds a pair of 
integers s and t with d — sa + tb. 

Remark. More details can be found in Theorem 3.40, where this result is proved for 
polynomials. 

To see how the Greeks discovered this result, see the discussion of antanairesis in 
Rotman, A First Course in Abstract Algebra, page 49. ◄ 

Sketch of Proof. This algorithm iterates the division algorithm, as follows. Begin with 
b — qa + r, where 0 < r < a. The second step is a — q’r + r’, where 0 < f < r; the next 
step is r = q"r’ + r", where 0 < r" < r 1 , and so forth. This iteration stops eventually, and 
the last remainder is the gcd. Working upward from the last equation, we can write the gcd 
as a linear combination of a and b. • 
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Proposition 1.15. Ifh > 2 is an integer, then every positive integer m has an expression 
in base b: There are integers d, with 0 < d, < b such that 


m — d^b k + dk-[b k 1 

moreover, this expression is unique if dk f 0. 


■ do\ 


Sketch of Proof. By the least integer axiom, there is an integer k > 0 with if < m < 
b k+l , and the division algorithm gives m = dkb k + r, where 0 < r < b k . The existence of 
b- adic digits follows by induction on m > 1. Uniqueness can also be proved by induction 
on m, but one must take care to treat all possible cases that may arise. • 

The numbers dk, dk-i,..., do are called the b-adic digits of m. 

Theorem 1.16 (Fundamental Theorem of Arithmetic). Assume that an integer a >2 
has factorizations 

a — pi ■ ■ ■ p m and a = q\ ■ ■ • q„, 

where the p ’s and q’s are primes. Then n = m and the q ’s may be reindexed so that 
qi — pi for all i. Hence, there are unique distinct primes p\ and unique integers e; > 0 
with 

e \ e n 

a = Pi ■■■Pn ■ 

Proof. We prove the theorem by induction on l, the larger of m and n. 

If l = 1, then the given equation is a = p\ — q\, and the result is obvious. For the 
inductive step, note that the equation gives p m \ qi ■ ■ - q n - By Euclid’s lemma, there is 
some i with p m \ qj. But q,, being a prime, has no positive divisors other than 1 and 
itself, so that qt — p m . Reindexing, we may assume that q n — p m . Canceling, we have 
pi ■ ■ ■ Pm -1 = q i • • • q„- 1 - By the inductive hypothesis, n — 1 = m — 1 and the q 's may 
be reindexed so that qt = pi for all i. • 


Definition. A common multiple of integers a and b is an integer c with a \ c and b \ c. 
The least common multiple or 1cm of a and b, denoted by [a, b], is defined by 


[a, b] = 


| 0 if a — 0 = b 

I the smallest positive common multiple of a and b otherwise. 


Proposition 1.17. Let a — p\ l ■ ■ ■ //" and let b — p[' ■ ■ ■ /r", where ej > 0 and f > 0 
for all i ; define 

mi = min {<?;, /;} and M-, = max {<?,-, f}. 

Then the gcd and the 1cm of a and b are given by 


Ml M n 


(a,b) = p l ■ ■ ■ p n n and [a, b] = p l ■ ■ ■ p n 

Sketch of Proof. Use the fact that p € y ■ ■ ■ p e f \ pf 1 ■ ■ ■ pjf if and only if e,- < f for 
all i. • 
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Definition. Let m > 0 be fixed. Then integers a and b are congruent modulo m , denoted 
by 

a = b mod m, 

if m | (a — b ). 


Proposition 1.18. If m > 0 is a fixed integer, then for oil integers a, b, c, 

(i) a=a mod m ; 

(ii) if a = b mod m, then b = a mod m; 

(iii) if a = b mod m and b = c mod m, then a = c mod m. 

Remark, (i) says that congruence is reflexive, (ii) says it is symmetric, and (iii) says it is 
transitive. ■* 

Sketch of Proof All the items follow easily from the definition of congruence. • 

Proposition 1.19. Let m > 0 be a fixed integer. 

(i) If o — qm + r, then a = r mod m. 

(ii) IfO < r' < r < m, then r ^ r' mod m\ that is, r and r' are not congruent mod m. 

(iii) a = b mod m if and only if a and b leave the same remainder after dividing by m. 

(iv) If m > 2, each integer a is congruent mod m to exactly one of 0, 1, ..., m — 1. 

Sketch of Proof. Items (i) and (iii) are routine; item (ii) follows after noting that 
0 < r — r' < in, and item (iv) follows from (i) and (ii). • 

The next result shows that congruence is compatible with addition and multiplication. 

Proposition 1.20. Let m > 0 be a fixed integer. 

(i) If a = a' mod m and b = b' mod m, then 

a + b = a' + b' mod m. 

(ii) If a = a' mod m and b = b' mod m, then 

ab = a'b' mod m. 

(iii) If a = b mod m, then a” = b n mod m for all n > 1. 


Sketch of Proof. All the items are routine. • 
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Earlier we divided 60 and —60 by 7, getting remainders 4 in the first case and 3 in the 
second. It is no accident that 4 + 3 = 7. If a is an integer and m > 0, let a = r mod m and 
—a = r mod m . It follows from the proposition that 

0 = —a + a = r + r mod m. 

The next example shows how one can use congruences. In each case, the key idea is to 
solve a problem by replacing numbers by their remainders. 

Example 1.21. 

(i) Prove that if a is in Z, then a 2 = 0, 1, or 4 mod 8. 

If a is an integer, then a = r mod 8, where 0 < r < 7; moreover, by Proposi¬ 
tion 1.20(iii), a 2 = r 2 mod 8, and so it suffices to look at the squares of the remainders. 


r 

0 

1 

2 

3 

4 

5 

6 

7 

r 2 

0 

1 

4 

9 

16 

25 

36 

49 

r 2 mod 8 

0 

1 

4 

1 

0 

1 

4 

1 


Table 1.1. Squares mod 8 

We see in Table 1.1 that only 0, 1, or 4 can be a remainder after dividing a perfect square 
by 8. 

(ii) Prove that n — 1003456789 is not a perfect square. 

Since 1000 = 8 • 125, we have 1000 = 0 mod 8, and so 

n = 1003456789 = 1003456 • 1000 + 789 = 789 mod 8. 

Dividing 789 by 8 leaves remainder 5; that is, n = 5 mod 8. Were n a perfect square, then 
n = 0, 1, or 4 mod 8. 

(iii) If m and n are positive integers, are there any perfect squares of the form 3'" + 3" + 1 ? 
Again, let us look at remainders mod 8. Now 3 2 = 9 = 1 mod 8, and so we can evaluate 

3'” mod 8 as follows: If m — 2k, then 3"' = 3 2k — 9 k = 1 mod 8; if m = 2& + 1, then 

yn = 3 2 k+l = gk . 3 _ 3 mod g Thus> 

m _ I 1 mod 8 if m is even: 

I 3 mod 8 if m is odd. 

Replacing numbers by their remainders after dividing by 8, we have the following possi¬ 
bilities for the remainder of 3'" + 3" + 1, depending on the parities of m and n: 

3 + 1 + 1 = 5 mod 8 
3 + 3 + 1 = 7 mod 8 
1 + 1 + 1 = 3 mod 8 
1 + 3 + 1 = 5 mod 8. 
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In no case is the remainder 0, 1, or 4, and so no number of the form 3'” + 3" + 1 can be a 
perfect square, by part (i). ■* 


Proposition 1.22. A positive integer a is divisible by 3 (or by 9) if and only if the sum of 
its ( decimal) digits is divisible by 3 (or by 9). 

Sketch of Proof. Observe that 10” = 1 mod 3 (and also that 10” = 1 mod 9). • 

Proposition 1.23. If p is a prime and a and b are integers, then 

( a + b) p = a p +b p mod p. 

Sketch of Proof. Use the binomial theorem and Proposition 1.12. • 

Theorem 1.24 (Fermat). If p is a prime, then 

a p = a mod p 

for every a in Z. More generally, for every integer k > 1, 

a 1 = a mod p. 

Sketch of Proof If a > 0, use induction on a; the inductive step uses Proposition 1.23. 
The second statement follows by induction on k > 1. • 

Corollary 1.25. Let p be a prime and let n be a positive integer. If m > 0 and if £ is the 
sum of the p-adic digits of m, then 

n m = mod p. 

Sketch of Proof. Write m in base p. and use Fermat’s theorem. • 

We compute the remainder after dividing 10 100 by 7. First, 10 100 = 3 100 mod 7. 
Second, since 100 = 2 • 7 2 + 2, the corollary gives 3 100 = 3 4 = 81 mod 7. Since 
81 = 11 x 7 + 4, we conclude that the remainder is 4. 

Theorem 1.26. If (a, m ) = 1, then, for every integer b, the congruence 

ax = b mod m 

can be solved for x; in fact, x = sb, where sa = 1 mod m is one solution. Moreover, any 
two solutions are congruent mod m. 

Sketch of Proof. If 1 = sa + tm , then b = sab + tmb. Hence, b = a(sb) mod m. If, 
also, b = ax mod m, then 0 = a(x — sb) mod m, so that m \ a(x — sb). Since (m, a) — 1, 
we have m \ (x — sb ); hence, x = sb mod m, by Corollary 1.11. • 
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Corollary 1.27. If p is a prime and a is not divisible by p, then the congruence 

ax = b mod p 


is always solvable. 

Sketch of Proof. If a is not divisible by p. then (a, p) — 1. • 

Theorem 1.28 (Chinese Remainder Theorem). If m and m' are relatively prime, then 
the two congruences 


x = b mod m 
x = b' mod m' 

have a common solution , and any two solutions are congruent mod mm'. 

Sketch of Proof. By Theorem 1.26, any solution x to the first congruence has the form 
x — sb + km for some k e Z (where 1 — sa + tin). Substitute this into the second 
congruence and solve for k. Alternatively, there are integers .v and s' with 1 = sm + s'm!, 
and a common solution is 

x — b'ms + bin's'. 

To prove uniqueness, assume that y = b mod m and y = b' mod in'. Then x — y = 
0 mod m and x — y = 0 mod m'\ that is, both m and in' divide x — y. The result now 
follows from Exercise 1.19 on page 13. • 


Exercises 


1.1 Prove that l 2 + 2 2 + • • • + rr = g«(n + 1)(2 n + 1) = j« 3 + \n 2 + g/r. 

1.2 Prove that l 3 + 2 3 + • • • + n 3 = ^n 4 + + \n 2 . 

1.3 Prove that l 4 + 2 4 + • • • + if = ^ n 4 + j« 3 — j^n. 


Remark. There is a general formula that expresses ffj— / f° r k > 1, as a polynomial in n: 

n— 1 k 


(k+l)J2 i k = n k+1 + J2 

i= 1 /—I 


k+ 1 

j 


Bjn k+1 ~ j ; 


the coefficients involve rational numbers B j, for j > 1, called Bernoulli numbers, defined by 

*- = i + T Bi 
r- 


~ — = i + y — x j\ 
-i f-; /! 

j> i 


see Borevich-Shafarevich, Number Theory, page 382. ◄ 
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1.4 Derive the formula for \ * by computing the area (n + l) 2 of a square with sides of length 
n + 1 using Figure 1.1. 

Hint. The triangular areas on either side of the diagonal have equal area. 


5 

1 

1 

1 

1 

1 


4 

1 

1 

1 

1 



3 

1 

1 

1 




2 

1 

1 





1 

1 














Figure 1.1 


1 

1 

1 

1 

1 


1 

1 

1 

1 



1 

1 

1 




1 

1 





1 







Figure 1.2 


1.5 (i) Derive the formula for ! by computing the area n(n + 1) of a rectangle with base 

n + 1 and height n, as pictured in Figure 1.2. 

(ii) ( Alhazen , ca. 965-1039) For fixed k > 1, use Figure 1.3 to prove 


(n + pyv 

1 = 1 


XE +1 

i=l 


+ 


n i 

HZ' 1 )' 


i=l t=l 


Hint. As indicated in Figure 1.3, a rectangle with height n + 1 and base X^=i ^ can 
be subdivided so that the shaded staircase has area | » , whereas the area above 

it is 


1* + (1* + 2 k ) + (1* + 2 k + 3 k ) + ■ ■ ■ + (l k + 2 k + • • • + n k ). 



Figure 1.3 
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(iii) Given the formula = b n ( n + 1), use part (ii) to derive the formula for 5Z/=i 

Hint. In Alhazen's formula, write J]" =1 = \ E/Li * 2 + j J]"_i *> an d 

then solve for 5Z;=i* 2 i n terms of the rest. 

1.6 ( Leibniz ) A function /: R —>■ R is called a C°° -function if it has an nth derivative f^ n ' ) for 
every natural number n (/^ 0) is defined to be /). If / and g are C°°-functions, prove that 

( /*) w = E(")/ w -« 6, - r) . 

r=0 W 

1.7 (Double Induction) Let S(m,n) be a doubly indexed family of statements, one for each 
m > 1 andn > 1. Suppose that 

(i) 5(1, 1) is true; 

(ii) if S(m, 1) is true, then S(m + 1, 1) is true; 

(iii) if S{m, ri) is true for all m, then 5(w, n + 1) is true for all m. 

Prove that S(m,n) is true for all m > 1 and n > 1. 

1.8 Use double induction to prove that 

(m + 1)" > mn 


for all m, n > 1. 

1.9 Prove that \/2 is irrational. 

Hint. If \fl is rational, then = alb , and we can assume that (a, b) = 1 (actually, it 
is enough to assume that at least one of a and b is odd). Squaring this equation leads to a 
contradiction. 

1.10 Prove the converse of Euclid’s lemma: An integer p > 2, which, whenever it divides a product 
necessarily divides one of the factors, must be a prime. 

1.11 Let pi, pi, P 3 , ... be the list of the primes in ascending order: p\ = 2, P 2 = 3, P 3 = 5, ... 

Define fa = p\pj ■ ■ ■ Pk + 1 f° r k>\. Find the smallest k for which fa is not a prime. 

Hint. 19 | fa, but 7 is not the smallest k. 

1.12 If d and d' are nonzero integers, each of which divides the other, prove that d' = ±d. 

1.13 Show that every positive integer m can be written as a sum of distinct powers of 2; show, 
moreover, that there is only one way in which m can so be written. 

Hint. Write m in base 2. 

1.14 If (r, a) = 1 = (/, a), prove that (rr', a) = 1. 

1.15 (i) Prove that if a positive integer n is squarefree (i.e., n is not divisible by the square of 

any prime), then Jii is irrational. 

(ii) Prove that an integer m > 2 is a perfect square if and only if each of its prime factors 
occurs an even number of times. 

1.16 Prove that s/l is irrational. 

Hint. Assume that i/l can be written as a fraction in lowest terms. 

1.17 Find the gcd d = (12327, 2409), find integers .s and t with d = 12327s + 2409L and put the 
fraction 2409/12327 in lowest terms. 



Sec. 1.1 Some Number Theory 


13 


1.18 Assume that d = sa + tb is a linear combination of integers a and b. Find infinitely many 
pairs of integers (i£, fy) with 

d = s^a + tfcb. 

Hint. If 2s + 3t = 1, then 2(s + 3) + 3(f - 2) = 1. 

1.19 If a and b are relatively prime and if each divides an integer n, then their product ab also 
divides n. 

1.20 If a > 0, prove that a(b, c) = (ab, ac ). [We must assume that a > 0 lest a(b, c) be negative.] 
Hint. Show that if k is a common divisor of ab and ac, then k \ a(b, c). 

Definition. A common divisor of integers a\, <Z 2 ,..., a n is an integer c with c \ a; for all i ; the 
largest of the common divisors, denoted by (aj, 02 , ■ ■ ■, a n ), is called the greatest common divisor. 

1.21 (i) Show that if d is the greatest common divisor of oj, aj ,.... a n , then d = H a i> where 

tj is in Z for 1 <i < n. 

(ii) Prove that if c is a common divisor of a\, aj, ■ ■ ■, a n , then c \ d. 

1.22 (i) Show that (a, b, c), the gcd of a, b, c, is equal to (a, ( b , c)). 

(ii) Compute (120, 168, 328). 

1.23 A Pythagorean triple is an ordered triple (a, b , c) of positive integers for which 

a 2 + b 2 = c 2 \ 

it is called primitive if gcd (a, b, c) = 1 . 

(i) If q > p are positive integers, prove that 

(q 2 - p 2 , 2 qp, q 2 + p 2 ) 

is a Pythagorean triple. [One can prove that every primitive Pythagorean triple (a, b, c) 
is of this type.] 

(ii) Show that the Pythagorean triple (9, 12, 15) (which is not primitive) is not of the type 
given in part (i). 

(iii) Using a calculator that can find square roots but that can display only 8 digits, prove that 

(19597501, 28397460, 34503301) 
is a Pythagorean triple by finding q and p. 

Definition. A common multiple of a \, 02 , ■ ■ ■, a n is an integer m with a, \ m for all i. The least 
common multiple, written 1 cm and denoted by \a\, « 2 , • • •, a n \, is the smallest positive common 
multiple if all aj 7 ^ 0 , and it is 0 otherwise. 

1.24 Prove that an integer M > 0 is the 1cm of a\, aj . a n if and only if it is a common multiple 

of a \, 02 ,..., a n that divides every other common multiple. 

1.25 Let a\/b\, ..., a n /b n e Q, where (aj, bj) = 1 for all i. If M = lcmjfq, ..., b n }. prove that 
the gcd of Ma\/b \, ..., Ma n /b n is 1. 

1.26 (i) Prove that [a, b](a, b) = ab, where [a, b\ is the least common multiple of a and b. 

Hint. If neither a nor b is 0, show that ab/(a, b) is a common multiple of a and b that 
divides every common multiple c of a and b. Alternatively, use Proposition 1.17. 
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1.27 (i) Find the gcd (210. 48) using factorizations into primes. 

(ii) Find (1234, 5678). 

1.28 If a and b are positive integers with (a, b) — 1, and if ab is a square, prove that both a and b 
are squares. 

Hint. The sets of prime divisors of a and b are disjoint. 

1.29 Let n = p'm, where p is a prime not dividing an integer m > 1. Prove that 


P\ 



Hint. Assume otherwise, cross multiply, and use Euclid’s lemma. 

1.30 Let m be a positive integer, and let m' be an integer obtained from m by rearranging its (dec¬ 
imal) digits (e.g., take in = 314159 and m' = 539114). Prove that m — m' is a multiple 
of 9. 

1.31 Prove that a positive integer n is divisible by 11 if and only if the alternating sum of its 
digits is divisible by 11 (if the digits of a are d £ .. .djdid^, then their alternating sunt is 
do — d[ + di — ■ ■ ■)■ 

Hint. 10 = —1 mod 11. 

1.32 (i) Prove that 10 q + r is divisible by 7 if and only if q — 2r is divisible by 7. 

(ii) Given an integer a with decimal digits • • • ^ 0 - define 

a — d^dk—i ■ • • d\ — 2dQ. 


Show that a is divisible by 7 if and only if some one of a', a", a"',... is divisible by 7. 
(For example, if a = 65464, then a’ = 6546 — 8 = 6538, a" = 653 — 16 = 637, and 
a’" = 63 — 14 = 49; we conclude that 65464 is divisible by 7.) 

1.33 (i) Show that 1000 = -1 mod 7. 

(ii) Show that if a = i'q + 1000ri + 1000 2 r2 + • • ■, then a is divisible by 7 if and only if 
r 0 ~ r l + r 2 ~ ''' is divisible by 7. 


Remark. Exercises 1.32 and 1.33 combine to give an efficient way to determine whether large 
numbers are divisible by 7. If a = 33456789123987, for example, then a = 0 mod 7 if and only if 
987 - 123 + 789 - 456 + 33 = 1230 = 0 mod 7. By Exercise 1.32, 1230 = 123 = 6 mod 7, and so 
a is not divisible by 7. ◄ 

1.34 Prove that there are no integers x, y, and z such that 

x 2 + y 2 + z 2 = 999. 

Hint. Use Example 1.21 (i). 

1.35 Prove that there is no perfect square a~ whose last two digits are 35. 

Hint. If the last digit of a 2 is 5. then a 2 = 5 mod 10; if the last two digits of a~ are 35. then 
a 2 = 35 mod 100. 

1.36 If x is an odd number not divisible by 3, prove that x 2 = 1 mod 4. 

1.37 Prove that if p is a prime and if cr = 1 mod p, then a = ±1 mod p. 

Hint. Use Euclid's lemma. 
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1.38 If (a, m ) = d. prove that ax = b mod m has a solution if and only if d \ b. 

1.39 Solve the congruence x~ = 1 mod 21. 

Hint. Use Euclid’s lemma with 21 | (a + 1)(« — 1). 

1.40 Solve the simultaneous congruences: 

(i) x = 2 mod 5 and 3x = 1 mod 8 ; 

(ii) 3x = 2 mod 5 and 2x = 1 mod 3. 

1.41 (i) Show that (a + b)" = a n + b n mod 2 for all a and b and for all n > 1. 

Hint. Consider the parity of a and of b. 

(ii) Show that (a + b ) 2 ^ a 2 + b 2 mod 3. 

1.42 On a desert island, five men and a monkey gather coconuts all day, then sleep. The first man 
awakens and decides to take his share. He divides the coconuts into five equal shares, with 
one coconut left over. He gives the extra one to the monkey, hides his share, and goes to sleep. 
Later, the second man awakens and takes his fifth front the remaining pile; he, too, finds one 
extra and gives it to the monkey. Each of the remaining three men does likewise in turn. Find 
the minimum number of coconuts originally present. 

Hint. Try —4 coconuts. 


1.2 Roots of Unity 

Let us now say a bit about the complex numbers C. We define a complex number z = a+ib 
to be the point (a , b) in the plane; a is called the real part of z and b is called its imaginary 
part. The modulus \z\ of z = a + ib — (a, b) is the distance from z to the origin: 



Proposition 1.29 (Polar Decomposition). Every complex number z. has a factorization 

Z = r(cos 9 + i sind), 
where r — |z| > 0 and 0 < 0 < 2n. 

Proof. If z — 0, then |z| = 0, and any choice of 9 works. If z = a+ib 0, then |z| 0, 

and z/\z\ — (a/|z|, b/\z\) has modulus 1 , because 

(a/\z\) 2 + (b/\z\) 2 = (a 2 + b 2 )/\z\ 2 = l. 

Therefore, there is an angle 9 (see Figure 1.4 on page 16) with z/\z\ = cos 9 + i sin0, and 
so z = |z|(cos 9 + i sin@) = r(cos9 + i sind). • 

It follows that every complex number z of modulus 1 is a point on the unit circle, and so 
it has coordinates (cos 9 , sin 9) (9 is the angle from the x-axis to the line joining the origin 
to (a, b), because cos 9 — a/ 1 and sinf? = b/\). 

If z = a + ib — r (cos 9 + i sin 9 ), then (r, 9) are the polar coordinates of z; this is the 
reason why Proposition 1.29 is called the polar decomposition of z. 

The trigonometric addition formulas for cos(0 + f) and sin(@ + i/O have a lovely trans¬ 
lation into the language of complex numbers. 
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Proposition 1.30 (Addition Theorem). If 

Z — cos 9 + i sin 9 and w = cos if + i sin if. 


then 

Proof. 


zw — cos (9 + if) + i sin(0 + if). 


zw = (cos 9 + i sin0)(cos if + i sin if) 

= (cos 9 cos if — sin 9 sin if) + i (sin 9 cos if + cos 9 sin if). 

The trigonometric addition formulas show that 

zw — cos (9 + if) + i sin(0 + if). • 


The addition theorem gives a geometric interpretation of complex multiplication. 


Corollary 1.31. Ifz and w are complex numbers with polar coordinates (r, 9) and ( s , if), 
respectively, then the polar coordinates ofzw are 2 

( rs, 9 + if). 


and so 


\zw | = \z\ \w\. 


Proof. If the polar decompositions of z and w are z = r(cos 9 + i sin 0) and w = 
s( cos if + i sin if), respectively, then 


zw — r^[cos(0 + if) + i sin(0 + if)]. • 


’This formula is correct if 8 + t/f < 2tt ; otherwise, the angle should be 9 + i// — 2 jt. 
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In particular, if |z| = 1 = \w\, then \zw\ = 1; that is, the product of two complex 
numbers on the unit circle also lies on the unit circle. 

In 1707, A. De Moivre (1667-1754) proved the following elegant result. 

Theorem 1.32 (De Moivre). For every real number x and every positive integer n, 
cos (nx) + i sin(nx) = (cos* + i sin x) n . 

Proof. We prove De Moivre’s theorem by induction on n > 1. The base step n — 1 is 
obviously true. For the inductive step, 

(cos x + i sinx)" +1 = (cosx + i sin x)" (cos x + i sinx) 

= (cos (nx) + i sin(Hx))(cosx + i sinx) 

(inductive hypothesis) 

= cos (nx + x) + i sin (nx + x) 

(addition formula) 

= cos([« + l]x) + i sin([n + l]x). • 


Corollary 1.33. 

(i) cos(2x) = cos 2 x — sin 2 x = 2 cos 2 x — 1 
sin(2x) = 2 sinx cos x. 

(ii) cos(3x) = cos 3 x — 3 cos x sin 2 x = 4 cos 3 x — 3 cos x 
sin(3x) = 3 cos 2 x sinx — sin 3 x = 3 sinx — 4 sin 3 x. 

9 

Proof, (i) cos(2x) + i sin(2x) = (cosx + i sinx)” 

= cos 2 x + 2 i sinx cosx + i 2 sin 2 x 
= cos 2 x — sin 2 x + i (2 sin x cos x). 

Equating real and imaginary parts gives both double angle formulas. 

(ii) De Moivre’s theorem gives 

cos(3x) + i sin(3x) = (cosx + i sinx) 3 

•3 9 9 9 -i 3 

= cos x + 3; cos” x sin x + 3 i cos x sin” x + i sin x 
= cos 3 x — 3 cos x sin 2 x + i (3 cos 2 x sin x — sin 3 x). 

Equality of the real parts gives cos(3x) = cos 3 x — 3 cos x sin 2 x; the second formula for 
cos(3x) follows by replacing sin 2 x by 1 — cos 2 x. Equality of the imaginary parts gives 
sin(3x) = 3 cos 2 x sinx—sin 3 x = 3 sinx—4 sin 3 x; the second formula arises by replacing 
cos 2 x by 1 — sin 2 x. • 
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Corollary 1.33 can be generalized. If fiix) = 2x~ — 1, then 


cos(2x) = 2cos“x — 1 = / 2 (cosx), 
and if fj, (x) = 4x 3 — 3x, then 

cos(3x) = 4cos 3 x — 3 cosx = foicosx). 

Proposition 1.34. For all n > 1, there is a polynomial f„(x) having all coefficients 
integers such that 

cos {nx) = /„( cosx). 

Proof. By De Moivre’s theorem, 

cos (nx) + i sin(nx) = (cosx + i sin x ) n 


The real part of the left side, cos (nx), must be equal to the real part of the right side. Now 
i r is real if and only if r is even, and so 


s (nx) = ^ 


If r = 2k , then i r = i 2k = (-If, and 


[ 5 ] 

5 (nx) = J^(-1) A ' 


cos" 2k xsin 2k x, 


where [|] is the largest integer But s\n 2k x — (sin 2 x) k — (1 — cos 2 x) k , which is a 
polynomial in cosx. This completes the proof. • 

It is not difficult to show, by induction on n > 2, that f n (x) begins with 2" _1 x". A sine 
version of Proposition 1.34 can be found in Exercise 1.49 on page 25. 


Euler’s Theorem. For all real numbers x, 

e lx = cosx + i sinx. 

The basic idea of the proof, aside from matters of convergence, is to examine the real 
and imaginary parts of the power series expansion of e' x . Using the fact that the powers of 
i repeat in cycles of length 4: 1, i, — 1, —i, 1 ..., we have 



= cosx + i sinx. 
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It is said that Euler was especially pleased with the equation 

e ni = -1; 

indeed, this formula is inscribed on his tombstone. 

As a consequence of Euler’s theorem, the polar decomposition can be rewritten in ex¬ 
ponential form: Every complex number z has a factorization 


where r > 0 and 0 < 9 < 2tc . The addition theorem and De Moivre’s theorem can be 
restated in complex exponential form. The first becomes 

e ix e iy = e Kx+y) ; 


the second becomes 

(e ix ) n = e inx . 

Definition. If n > 1 is an integer, then an nth root of unity is a complex number £ with 
= 1 . 

The geometric interpretation of complex multiplication is particularly interesting when 
Z and w lie on the unit circle, so that |z| = 1 = w\. Given a positive integer n, let 
9 — 2jz /n and let £ = e' e . The polar coordinates of £ are (1, 9). the polar coordinates of 
if 1 are (1,2 9). the polar coordinates of are (1, 39),. .., the polar coordinates of f" -1 are 
(1, (n — 1)0), and the polar coordinates of f" = 1 are (1, nO) — (1,0). Thus, the nth roots 
of unity are equally spaced around the unit circle. Figure 1.5 shows the 8th roots of unity 
(here, 9 — 2n/% = Jt/4). 


i 



Figure 1.5: 8th Roots of Unity 
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Corollary 1.35. Every nth wot of unity is equal to 

e 2^/n = C os(2|*) +J - s in(2|fc), 
for some k = 0, 1, 2, ... ,n — 1, and hence it has modulus 1. 

Proof Note that e 2?r ' = cos 2ti-\- i sinln — 1. By De Moivre’s theorem, if t; — e 2jn /" — 
cos(2it/n) + i sm(2jr/n), then 

j- n _ (glni/ny _ g 2 ni _ ^ 

so that £ is an nth root of unity. Since = 1, it follows that (£ k ) n = (£")* = l k = 1 
for all /r = 0, 1, 2,..., n — 1, so that t, k = e 2lz,k l n is also an nth root of unity. We have 
exhibited n distinct nth roots of unity; there can be no others, for it will be proved in 
Chapter 3 that a polynomial of degree n with rational coefficients (e.g., x" — 1) has at most 
n complex roots. • 

Just as there are two square roots of a number a , namely, ~/a and —-Ja, there are n 
different nth roots of a , namely, e 27Tlk ' n ffa for k — 0, 1, ..., n — 1. 

Every nth root of unity is, of course, a root of the polynomial x" — 1. Therefore, 

x n -1= n - o. 

f ,, =i 

If f is an nth root of unity, and if n is the smallest positive integer for which f" = 1, we 
say that £ is a primitive nth root of unity. Thus, i is an 8th root of unity, but it is not a 
primitive 8th root of unity; however, i is a primitive 4th root of unity. 

Lemma 1.36. If an nth root of unity $ is a primitive dth root of unity, then d must be a 
divisor ofn. 

Proof The division algorithm gives n — qd + r, where q are r are integers and the 
remainder r satisfies 0 < r < d. But 

1 £.n f.qd-\-r ^qd^r ^r 

because t; qd — ( i; d ) q — 1. If r ^ 0, we contradict d being the smallest exponent for which 
t, d = 1. Hence, n = qd, as claimed. • 

Definition. If d is a positive integer, then the c/th cyclotomic 3 polynomial is defined by 

$</(*) = Y\ (x - ^)> 

where t, ranges over all the primitive c/th roots of unity. 

The following result is almost obvious. 

3 The roots of x" — 1 are the nth roots of unity: 1, f, f 2 ,_where f = e 2ni l n = cos(2n-/n) + 

i sin(2jr/n). Now these roots divide the unit circle {f € <C : |z| = 1} into n equal arcs (see Figure 1.5 on 
page 19). This explains the term cyclotomic, for its Greek origin means “circle splitting." 



Sec. 1.2 Roots of Unity 


21 


Proposition 1.37. For every integer n > 1, 

x" -!=]"[ <t> d (x), 

d\n 

where d ranges overall the divisors d ofn [in particular, <I>i(x) and 4>„(x) occur]. 

Proof. In light of Corollary 1.35, the proposition follows by collecting, for each divisor d 
of n, all terms in the equation x" — 1 = ]~[(x — £) with £ a primitive d/th root of unity. • 

For example, if p is a prime, then x p — 1 = <t>] fx)0 ; , (x). Since O ] (x) = x — 1, it 
follows that 

$> p (x) = x p ~ l +x p ~ 2 H-hx + 1. 

Definition. Define the Euler 0 -function as the degree of the nth cyclotomic polynomial: 

000 = deg(<J>„(x)). 

We now give another description of the Euler 0-function that does not depend on roots 
of unity. 

Proposition 1.38. If n > 1 is an integer, then <p(n) is the number of integers k with 
l < k < n and ( k, n) = 1. 

Proof. It suffices to prove that e 2:Tll< /" j s a primitive nth root of unity if and only if k and 
n are relatively prime. 

If k and n are not relatively prime, then n — dr and k — ds, where d , r, and ,v are 
integers, and d > 1; it follows that r < n. Hence, ^ ^ so that (e 2lz,k l n ) r — 

(f 2nls l r y — 1, and hence e 2nlk ^ n is not a primitive nth root of unity. 

Conversely, suppose that C, — e 2jIlk / n is not a primitive nth root of unity. Lemma 1.36 
says that f must be a d/th root of unity for some divisor d of n with d < n; that is, there is 
1 < m < d with 

£ ^2nik/n ^2 nim/d ^2nimr/dr ^Inimr/n 

Since both k and mr are in the range between 1 and n, it follows that k — mr fif 0 < 
x, y < 1 and e l7Tlx — e 2niy , then x = y)\ that is, r is a divisor of k and of n, and so k and 
n are not relatively prime. • 

Corollary 1.39. For every integer n > 1, we have 

n = 0 (d/). 

d\n 

Proof. Note that 0 (n) is the degree of <t>„ (x), and use the fact that the degree of a product 
of polynomials is the sum of the degrees of the factors. • 

Recall that the leading coefficient of a polynomial /(x) is the coefficient of the high¬ 
est power of x occurring in /(x); we say that a polynomial /(x) is monic if its leading 
coefficient is 1. 
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Proposition 1.40. For every positive integer n, the cyclotomic polynomial <t>„ (x ) is a 
monic polynomial all of whose coefficients are integers. 

Proof The proof is by induction on n > 1. The base step holds, for Oj (x) = x — 1. For 
the inductive step, we assume that <t>,/(x) is a monic polynomial with integer coefficients. 
From the equation x" — 1 = \\ d <3><;(x), we have 

x n - 1 = <J>„ (*)/(*), 

where /(x) is the product of all 4></(x), where d < n and d is a divisor of n. By the 
inductive hypothesis, /(x) is a monic polynomial with integer coefficients. Because /(x) 
is monic, long division (i.e., the division algorithm for polynomials) shows that all the 
coefficients of <t>„(x) = (x" — l)//(x) are also integers , 4 as desired. • 

The following corollary will be used in Chapter 8 to prove a theorem of Wedderburn. 

Corollary 1.41. If q is a positive integer, and ifd is a divisor of an integer n with d < n, 
then <J >„{q) is a divisor of both q n — 1 and (q n — 1 )/{q d — 1 ). 

Proof We have already seen that x" — 1 = <t>„(x)/(x), where /(x) is a monic poly¬ 
nomial with integer coefficients. Setting x — q gives an equation in integers: q" — 1 = 
d),, (q) f(q)\ that is, <t> n (q) is a divisor of q n — 1 . 

If d is a divisor of n and d < n, consider the equation x d — \ = \\{x — if), where f 
ranges over the c/th roots of unity. Notice that each such if is an nth root of unity, because 
d is a divisor of n. Since d < n, collecting terms in the equation x" — 1 = ]~[(x — f) gives 

x" - 1 = <t>„(x)(x d - l)g(x), 

where g(x) is the product of all the cyclotomic polynomials <t^(x) for all divisors S of n 
with 8 < n and with 8 not a divisor of d. It follows from the proposition that g(x) is a 
monic polynomial with integer coefficients. Therefore, g(q) e Z and 

x" - 1 

— -- = 0 /? (x)g(x) 

x a — 1 

gives the result. • 

Here is the simplest way to find the reciprocal of a complex number. If z = a + ib e C, 
where a, b e R, define its complex conjugate z — a — ib. Note that zz, = a 2 + b 2 = \z\ 2 , 
so that 0 if and only if zz ^ 0. If z ^ 0, then 

z~ l — 1 /z — z/zz — ( a/zz ) — i(b/zz)\ 

that is, 

__ 

4 If this is not clear, look at the proof of the division algorithm on page 131. 


1 / a 

a + ib \a 2 + b 2 



Sec. 1.2 Roots of Unity 


23 


If |z| = 1, then z~ l = Z- In particular, if z is a root of unity, then its reciprocal is its 
complex conjugate. 

Complex conjugation satisfies the following identities: 

z + w = z + ui\ 

Tw — zw; 
z = z; 

z = Z if and only if z is real. 

We are regarding complex numbers as points in the plane and, as in vector calculus, a 
point z is identified with the vector represented by the arrow Oz from the origin O to z. 
Let us define the dot product of z = a + ib and w = c + id to he 

z ■ w = ac + bd. 

Thus, z ■ w — |z||ty| cos#, where 9 is the angle between Oz and Ow [since cos 6 — 
cos(2;r — 9), it makes no difference whether 9 is measured from Oz to Ow or from Ow 
to Oz\. Note that 

z-z = \z\ 2 = zz. 

It is clear that z ■ w — w ■ z, and it is easy to check that 

z ■ (w + w') = z ■ w + z ■ u/ 
for all complex numbers z, w, and w'. 

The following result will be used in Chapter 8 to prove a theorem of Burnside. 
Proposition 1.42. If s\, ..., e n are wots of unity, where n > 2, then 

n 

<Y.\ s i\= n - 

7 = 1 7 = 1 

Moreover, there is equality if and only if all the Sj are equal. 

Proof. The proof of the inequality is by induction on n > 2. The base step follows from 
the triangle inequality, for all complex numbers u and v, 

\u + v\ < \u\ + |u|. 

The proof of the inductive step is routine, for roots of unity have modulus 1. 

Suppose now that all the sj are equal, say sj — e for all then it is clear that there 
is equality |£^ =1 Sj — \ne\ = n\e\ = n. The proof of the converse is by induction on 
n > 2. For the base step, suppose that |ei + £ 2 ! —2.. Using the dot product, we have 

4 = |£i + £ 2 1~ 

= (£i + £2) • (£1 + £2) 

= |£i| 2 + 2fii • £2 + 1^21 2 
— 2 + 2e\ ■ £ 2 . 
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Hence, 2 = 1 + ei ■ S 2 , so that 


1 = £l ' E2 
— |e 1 11e 2 1 cost? 
= cos 9, 


where 0 is the angle between Oe\ and 0s2 (for |ei| = 1 = |£ 2 |). Therefore, 9 = 0 or 
6 — it, so that £2 = ±£i. Since £2 = —£1 gives |£i + £ 2 ! = 0, we must have £2 = £ 1 . 

For the inductive step, suppose that £j — n + 1. If |£”=l e ;l < n, then the 

triangle inequality gives 


(X» 


+ £»+l 


7=1 


+ 1 < n + 1, 


contrary to hypothesis. Therefore, |^" =1 £j| = n, and so the inductive hypothesis gives 
£ 1 , ..., £„ all equal, say, to co. Hence, YTj=i s j — w<w, and so 

I nco + £„+i| = n + 1. 


The argument concludes as that of the base step: 

in + l) 2 = (nco + £„+i) • (nco + £„+i) 

— n~ 2nco • £/i-(-i T 1, 

so that 1 — co ■ £„+i = |ftj||£„ + i| cost), where 9 is the angle between Oco and Oe„+i. 
Hence, co = ±£„ + i, and co — £„+i. • 


Exercises 


1.43 Evaluate (cos 3° + i sin3°) 40 . 

1.44 (i) Find (3 +40/(2-t). 

(ii) If 7 = re'®, prove thatc -1 = r~ l e~ is . 

(iii) Find the values of *Ji. 

(iv) Prove that e' 0 /" is an nth root of e is . 

1.45 Find <J> 6 (x). 

1.46 If o' is a number for which cos(jra) = j (where the angle no/ is in radians), prove that a is 
irrational. 

Hint. If a = j-, evaluate cos nna + i sin nna using De Moivre’s theorem. 

1.47 Let f{x) = tig + a\x + aix" + • • • + a n x n be a polynomial with all of its coefficients real 
numbers. Prove that if z is a root of fix), then z is also a root of fix). 
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1.48 (i) Prove that the quadratic formula holds for quadratic polynomials with complex coeffi¬ 

cients. 

(ii) Find the roots of x 2 + (2 + i)x + 2 i. Why aren’t the roots complex conjugates of one 
another? 

1.49 Prove that for every odd integer n > 1, there is a polynomial g n ( x ) with integer coefficients, 
such that 

sin nx = g„(sin.v). 

1.50 Every Pythagorean triple (a, h, c) determines a right triangle having legs a and b and hy¬ 
potenuse 5 c. Call two Pythagorean triples (a, b, c) and (a', b' , c') similar if the right triangles 
they determine are similar triangles; that is, if corresponding sides are proportional. 

(i) Prove that the following statements are equivalent for Pythagorean triples (a, b, c) and 
(a\ b' , c'). 


1.51 


(1) (a, b, c) and (a', b’ , c’) are similar. 

(2) There are positive integers m and l with ( ma, mb, me) = (la 1 , lb', lc') 

(3) § + '! = fr + i^r- 

(ii) Prove that every Pythagorean triple is similar to a primitive Pythagorean triple. 


(i) Call a complex number of modulus 1 rational if both its real and imaginary parts are 
rational numbers. If ^ + i | is a rational complex number with both a and b nonzero, 
prove that (|a|, \b\, |c|) is a Pythagorean triple. 


(ii) Prove that the product of two rational complex numbers is also a rational complex num¬ 
ber, and use this fact to define a product of two Pythagorean triples (up to similarity). 
What is the product of (3, 4, 5) with itself? 

(iii) Show that the square of a Pythagorean triple (a, b, c) is (a 2 — b 2 , 2ab, a 2 + b 2 ). 


1.3 Some Set Theory 

Functions are ubiquitous in algebra, as in all of mathematics, and we discuss them now. 
A set A is a collection of elements (numbers, points, herring, etc.); we write 

x e X 

to denote x belonging to X. Two sets X and Y are defined to be equal, denoted by 


if they are comprised of exactly the same elements; for every element x, we have x e X if 
and only if x e Y. 

A subset of a set A is a set S each of whose elements also belongs to A: If s e S, then 
s e X. We denote S being a subset of A by 

S C A; 

5 Hypotenuse comes from the Greek word meaning "to stretch.” 
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synonyms are “S is contained in X” and “S is included in XT Note that X C X is always 
true; we say that a subset S of X is a proper subset of X , denoted by S C X, if S C X and 
S X. It follows from the definitions that two sets X and Y are equal if and only if each 
is a subset of the other: 

X — Y if and only if X c Y and Y C. X. 

Because of this remark, many proofs showing that two sets are equal break into two parts, 
each half showing that one of the sets is a subset of the other. For example, let 

X — {a e R : a > 0} and Y — {r 2 : r e R], 

If a € X, then a > 0 and a = r 2 , where r — */a; hence, a e Y and X c Y. For the 
reverse inclusion, choose r 2 e Y. If r > 0, then r 2 > 0; if r < 0, then r = —s, where 
s > 0, and r 2 = (—l) 2 s 2 = ,v 2 > 0. In either case, r 2 > 0 and r 2 e X. Therefore, Y C. X, 
and so X = Y. 

Calculus books define a function fix) as a “rule” that assigns, to each number a, exactly 
one number, namely, f (a). This definition is certainly in the right spirit, but it has a defect; 
What is a rule? To ask this question another way, when are two rules the same? For 
example, consider the functions 

f(x) = (x + T) 2 and g(x) = x 2 + 2x + 1. 

Is f(x) = g (x ) ? The evaluation procedures are certainly different: for example, /(6) = 
(6 + l) 2 = 7 2 , while gi 6) = 6 2 + 2 • 6 + 1 = 36+12+1. Since the term rule has not been 
defined, it is ambiguous, and our question has no answer. Surely the calculus description 
is inadequate if we cannot decide whether these two functions are the same. 

The graph of a function is a concrete thing [for example, the graph of fix) — x 2 is 
a parabola], and the upcoming formal definition of a function amounts to saying that a 
function is its graph. The informal calculus definition of a function as a rule remains, but 
we will have avoided the problem of saying what a rule is. In order to give the definition, we 
first need an analog of the plane [for we will want to use functions fix) whose argument 
x does not vary over numbers]. 

Definition. If X and Y are (not necessarily distinct) sets, then their cartesian product 
X x Y is the set of all ordered pairs (x, y ), where x e X and y e Y. 

The plane is R x R. 

The only thing we need to know about ordered pairs is that 

(x, y) — (x\ y') if and only if x — x and y — y' 

(see Exercise 1.62 on page 37). 

Observe that if X and Y are finite sets, say, \X\ = m and Y\ = n (we denote the number 
of elements in a finite set X by |X|), then \X x F| = mn. 
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Definition. Let A and Y be (not necessarily distinct) sets. A function f from X to Y. 
denoted by 6 

f- X —*■ Y, 

is a subset / C X x Y such that, for each a e X, there is a unique b e Y with (a, b ) e /. 

For each a e X, the unique element b e Y for which (a, b) e f is called the value of 
/ at a , and b is denoted by f (a). Thus, / consists of all those points in X x Y of the form 
(a, f(a)). When / : R —> R, then / is the graph of fix). 

Call X the domain of /, call Y the target (or codomain) of /, and define the image (or 
range) of /, denoted by im /, to be the subset of Y consisting of all the values of /. 

Definition. Two functions /: X —> Y and g : X' —> Y' are equal if X — X',Y — Y', 
and the subsets / C X x Y and g C X' x Y' are equal. 

For example, if A is a set, then the identity function 1 x ■ X —> A is defined by 1 x(x) = 
x for all x e A; if A = R, then 1 r is the line with slope 1 that passes through the origin. 
If /: A —> Y is a function, and if 5 is a subset of A, then the restriction of / to S is the 
function f\S: S —> Y defined by (/|S)(s) = / (s) for all s € S. If S is a subset of a set A, 
define the inclusion i: S -> A to be the function defined by i(s ) = ,v for all ,v e S. If S is 
a proper subset of A, then the inclusion i is not the identity function 1 y because its target 
is A, not S ; it is not the identity function lx because its domain is .S', not A. 

A function /: A —» Y has three ingredients: its domain A, its target Y. and its graph, 
and we are saying that two functions are equal if and only if they have the same domains, 
the same targets, and the same graphs. 

It is plain that the domain and the graph are essential parts of a function. Why should 
we care about the target of a function when its image is more important? 

As a practical matter, when first defining a function, we usually do not know its image. 
For example, we say that /: R —> R, given by f(x) — x 2 + 3x — 8, is a real-valued 
function, and we then analyze / to find its image. But if targets have to be images, then 
we could not even write down /: A —» Y without having first found the image of / 
(and finding the precise image is often very difficult, if not impossible); thus, targets are 
convenient to use. 

In linear algebra, we consider a vector space V and its dual space V* = {all linear 
functionals on V} (which is also a vector space). Moreover, every linear transformation 
T: V —» W defines a linear transformation 

r*: W* -* V*, 

and the domain of T*, being W*, is determined by the target W of T. (In fact, if a matrix 
for T is A, then a matrix for T* is A' , the transpose of A.) Thus, changing the target of T 
changes the domain of T*, and so T* is changed in an essential way. 

Atoiti now on, we denote a function by / instead of by f(x). The notation fix) is reserved for the value of 
/ at x; there are a few exceptions: We will continue to write sin x, e x . and x~. for example. 
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Proposition 1.43. Let f : A —> Y and g: X —>■ Y be functions. Then f — g if and only 
if f (a) = g(a) for every a e X. 

Remark. This proposition resolves the problem raised by the ambiguous term rule. If /, 
g : R. —>■ R. are given by /(x) = (x + l) 2 and g(x) = x 2 + 2x + 1, then f — g because 
f(a ) = g(a ) for every number a. ◄ 

Proof. Assume that / = g. Functions are subsets of X x Y. and so f = g means that 
each of / and g is a subset of the other (informally, we are saying that / and g have the 
same graph). If a e X, then (a, f (a)) e / = g, and so (a, f(a)) e g. But there is 
only one ordered pair in g with first coordinate a. namely, (a, g(a )) (because the definition 
of function says that g gives a unique value to a). Therefore, (a, f(a )) = (a, g(a )), and 
equality of ordered pairs gives f(a) = g(a), as desired. 

Conversely, assume that f (a) — g(a ) for every a e X . To see that / = g, it suffices 
to show that / C g and g C f. Each element of / has the form (a, f{a)). Since 
f(a) = g(a). we have (a. f{a)) = (a, g(a)), and hence (a, f (a)) e g. Therefore, / C g. 
The reverse inclusion g C /'is proved similarly. • 

We continue to regard a function / as a rule sending x e X to /(x) e Y, but the 
precise definition is now available whenever we need it, as in the proof of Proposition 1.43. 
However, to reinforce our wanting to regard functions /: X —»• Y as dynamic things 
sending points in X to points in Y . we often write 

f'.xv+y 

instead of /(x) = y. For example, we may write /: x x 2 instead of /(x) = x 2 , and 
we may describe the identity function by x i->- x for all x. 

Instead of saying that values of a function / are unique, we usually say that / is well- 
defined (or single-valued). Does the formula g(a/b ) = ab define a function g: Q —> Q? 
There are many ways to write a fraction; since \ — g, we see that g(j) = 1 • 2 ^ 3 • 6 = 

g and so g is not a function because it is not well-defined. Had we said that the 
formula g(a/b) = ab holds whenever a/b is in lowest terms, then g would be a function. 

The formula f(a/b) = 3 a/b does define a function / : Q —> Q, for it is well-defined: 
If a/b — a'/b', we show that 

f(a/b) = 3 a/b = 3a’/b' = f(a’/b’)\ 


a/b — a'/b’ gives ab' — a’b. so that 3ab’ — 3a'b and 3a/b — 3a’/b’. Thus, / is a bona 
fide function; that is, / is well-defined. 

Example 1.44. 

Our definitions allow us to treat a degenerate case. If A is a set, what are the functions 
X -» 0? Note first that an element of A x 0 is an ordered pair (x, y) with x e A and 
y e 0; since there is no y € 0, there are no such ordered pairs, and so A x 0 = 0. Now 
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a function X —>■ 0 is a subset of X x 0 of a certain type; but X x 0 = 0, so there is only 
one subset, namely 0, and hence at most one function, namely, / = 0. The definition of 
function A —> 0 says that, for each x e X, there exists a unique y e 0 with (x, y) e /. 
If X ^0, then there exists x e A" for which no such y exists (there are no elements y 
at all in 0), and so / is not a function. Thus, if X ^ 0, there are no functions from X 
to 0. On the other hand, if X — 0, then / = 0 is a function. Otherwise, the negation 
of the statement “/ is a function” begins “there exists x e 0, etc.” We need not go on; 
since 0 has no elements in it, there is no way to complete the sentence so that it is a true 
statement. We conclude that / = 0 is a function 0 —> 0, and we declare it to be the 
identity function 1 0 . ■* 

The special case when the image of a function is the whole target has a name. 
Definition. A function /: X —» K is a surjection (or is onto) if 

im f = Y. 

Thus, / is surjective if, for each y e Y, there is some x e X (probably depending on y) 
with y = f(x). 

The following definition gives another important property a function may have. 

Definition. A function /: X —> Y is an injection (or is one-to-one ) if, whenever a and 
a' are distinct elements of X , then f(a) / / (a'). Equivalently (the contrapositive states 
that) / is injective if, for every pair a, a' e X, we have 

f (a) — f(a') implies a — a'. 

The reader should note that being injective is the converse of being well-defined: / 
is well-defined if a = a' implies f(a) — f is injective if f (a) = f(a') implies 

a = a'. 

There are other names for these functions. Surjections are often called epimorphisms 
and injections are often called monomorphisms. The notation A -» B is used to denote a 
suijection, and the notations A ^ B or A >—» B are used to denote injections. However, 
we shall not use this terminology or these notations in this book. 

Example 1.45. 

Consider the function /; M -* K., given by f(x) — 3x —4. To see whether / is surjective, 
take y e R and ask whether there is a e R with y = 3a — 4. We solve to obtain 
a — j(y + 4), and we conclude that / is surjective. Also, the function / is injective, for if 
3a — 4 — 3b — 4, then a — b. 

As a second example, consider the function g: R — {1} —> R given by 

3x -4 


8 O') = 


x — l 
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Now g is an injection, for if (3a— 4) /(a— 1) = (3 b — A)/ (b — 1), then cross multiplying 
gives a = b. On the other hand, g is not surjective. Given y e R, is there a e R with 
y — (3a — 4 )/(a — 1)? Solving, a = (4 — y)/(3 — y). This suggests that y = 3 is not a 
value of g, and, indeed, it is not: 3 = (3a — 4)/ (a — 1) is not solvable. < 


Definition. If /: X —> Y and g: Y —> Z are functions (note that the target of / is equal 
to the domain of g ), then their composite , denoted by g o /, is the function X -> Z given 
by 

g o f: x i-» g(f(x)); 

that is, first evaluate / on x, and then evaluate g on f (x). 

The chain rule in calculus is a formula for the derivative (g o /)' in terms of g' and /': 

(g ° fY = [g' ° /] • /'• 

For example, 

, 1 
(sin(lnv)) = cos(lnx) • —. 

A' 

Given a set X, let 

T(X) = {all functions X —>■ X}. 

We have just seen that the composite of two functions in T(X) is always defined; moreover, 
the composite is again a function in T{X). We may thus regard T(X) as being equipped 
with akind of multiplication. This multiplication is not commutative ; that is, fog and gof 
need not be equal. For example, if f(x) — x+1 and g(x) — v 2 , then fog: 1 1 2 +1 = 2 

while gof: 1 i-> (1 + l) 2 = 4; therefore, / o g g o f. 

Lemma 1.46. 

(i) Composition is associative: If 

f: X Y, g : Y Z, and h : Z W 


are functions, then 

ho(go f) = (h og)o f. 

(ii) Iff: X -> Y, then lyo/=/ = /ol x . 

Sketch of Proof Use Proposition 1.43. • 

Are there "reciprocals” in T(X): that is, are there any functions / for which there is 
g e T(X) with / o g = \ x and g o / = l z ? 

Definition. A function /: X -» Y is a bijection (or a one-to-one correspondence ) if it is 
both an injection and a surjection. 
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Definition. A function /: X —»■ Y has an inverse if there is a function g: Y —>• X with 
both composites g o / and fog being identity functions. 


Proposition 1.47. 

(i) If f : X —> Y and g : Y —> X are functions such that g o f = lx, then f is injective 
and g is surjective. 

(ii) A function f: X —> Y has an inverse g : Y -> X if and only if f is a bijection. 

Proof, (i) Suppose that f(x) — fix')', apply g to obtain gifix)) — gifix'))', that is, 
x — x' [because gifix)) — x], and so / is injective. If x e X, then x = gifix)), so that 
x e im g; hence g is surjective. 

(ii) If / has an inverse g. then part (i) shows that / is injective and surjective, for both 
composites g o f and / o g are identities. 

Assume that / is a bijection. For each y e Y, there is a e X with / (a) = y, since / 
is surjective, and this element a is unique because / is injective. Defining g{y) — a thus 
gives a (well-defined) function whose domain is Y, and it is plain that g is the inverse of 
/; that is, f(g(y)) = f(a) = y for all y e Y and g(f(a)) = g(y) = a for all a e X. • 


Remark. Exercise 1.59 on page 36 shows that if both / and g are injective, then so is 
their composite fog. Similarly, / o g is a surjection if both / and g are suijections. It 
follows that the composite of two bijections is itself a bijection. -4 

Notation. The inverse of a bijection / is denoted by f~ l (Exercise 1.54 on page 36 says 
that a function cannot have two inverses). 


Example 1.48. 

Here is an example of two functions / and g one of whose composites g o f is the identity 
while the other composite / o g is not the identity; thus, / and g are not inverse functions. 
If N = {n e Z : n > 0}, define /, g: N -> N as follows: 


f(n) — n + 1; 


gin) = 


0 

n — 1 


if n — 0 
if n > 1. 


The composite g o f = ljj, for gif in)) — g(n + 1) = n, because n + 1 > 1. On the other 
hand, / og^l N , because f(g(0 )) = /(0) = 1^0. 

Notice that / is an injection but not a surjection, and that g is a surjection but not an 
injection. 4 
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Two strategies are now available to determine whether or not a given function is a 
bijection: (i) use the definitions of injection and surjection; (ii) find an inverse. For ex¬ 
ample, if R :> denotes the positive real numbers, let us show that the exponential function 
/: R —> K^, defined by fix) — e x — 'ffx' 1 /n\, is a bijection. It is simplest to use the 
(natural) logarithm g(y) — lny = dt/t. The usual formulas e' ny = y and lne A = x 
say that both composites fog and go f are identities, and so / and g are inverse functions. 
Therefore, / is a bijection, for it has an inverse. (A direct proof that / is an injection would 
require showing that if e a — e h , then a — b; a direct proof showing that / is surjective 
would involve showing that every positive real number c has the form e a for some a.) 

Let us summarize the results just obtained. 

Theorem 1.49. If the set of all the bijections from a set X to itself is denoted by Sx, then 
composition of functions satisfies the following properties: 

(i) if f, 8 e S x , then f o g e S x \ 

(ii) h o (g o f) = (h o g) o f for all /, g, h e S x : 

(iii) the identity \x lies in Sx, and lx ° f — f — / o 1 x for every f e Sx: 

(iv) for every f e Sx, there is g e Sx with g o f = lx — fog. 


Sketch of Proof Part (i) follows from Exercise 1.59 on page 36, which shows that the 
composite of two bijections is itself a bijection. The other parts of the statement have been 
proved above. • 

If X and Y are sets, then a function f:X—>Y defines a “forward motion” carrying 
subsets of X into subsets of Y , namely, if S C X , then 

f(S) = {y e Y : y = f(s) for some s e 5}, 

and a “backward motion” carrying subsets of Y into subsets of X , namely, i ( W C Y , then 

f~\W) = {x € X : fix) e W}. 

We call f~\W) the inverse image of W. Formally, denote the family of all the subsets of 
a set X by VfX). If /: X —> Y, then there are functions 

U:ViX)^ViY), 


given by /,: S h- fiS), and 

/*: ViY) -» P(X), 

given by /*: ff h> f~ l (W). When / is a surjection, then these motions set up a bijection 
between all the subsets of Y and some of the subsets of X. 
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Proposition 1.50. Let X and Y be sets, and let f: X Y be a surjection. 

(i) |T C S are subsets ofX, then f(T)C. f(S), and ifU0V are subsets ofY, then 
f~ l (U) C f-\V). 

(ii) IfU c Y, then ff~\U) = U. 

(iii) The composite /*/*: V(Y) —»■ V1Y) = 1 -p(Y)> and so f*: W i->- is an 

injection. 

(iv) If S C X, then S C but strict inclusion is possible. 


Remark. If / is not a surjection, then W i->- / 1 (W) need not be an injection: There is 
some y e Y with y f(X), and / _1 ({>'}) — 0 — ◄ 

Proof, (i) If y e f(T), then y — fit) for some t e T. But t e S, because T c S, and so 
fit) e f(S). Therefore, f IT) C /(.S'). The other inclusion is proved just as easily. 

(ii) If u e U , then / being surjective says that there is x e X with fix) — u\ hence, 
x e / —1 (C/), and so u — fix) e ff~ l lU). For the reverse inclusion, let a e ff~ l lU)\ 
hence, a — fix') for some x' e f~ l lU). But this says that a — fix') e U, as desired. 

(iii) Part (ii) says that /*/* = 1 -pty), and so Proposition 1.47 says that /* is an injection. 

(iv) If s e S, then /(*) e flS), and so .v e f~ l fls) c f~ l flS). 

To see that there may be strict inclusion, let / : R —> C be given byx h> e 27T,x . If 
5 = {0}, then flS) = {1} and f~ l fl{l}) = Z. • 

In Exercise 1.68 on page 37, we will see that if /: X —> Y, then inverse image behaves 
better on subsets than does forward image; for example, f~ l IS fl T) = f~ l )S) fl f~\T), 
where S, T c Y, but for A, B c X, it is possible that /(A f!5)/ /(A) fl f(B). 

We will need cartesian products of more than two sets. One may view an element 
lxi,X 2 ) e Xi x X 2 as the function /; {1,2} —> X\ U X 2 with f(i) — Xj e X ,• for 
i = 1,2. 

Definition. Let I be a set and let [Xj ; i e /} be an indexed family of sets. Then the 
cartesian product is the set 

Y[ x i = {/= 1 -> U* 1 ' : /(*) e forall/ e 7 }- 

iel iel 

The elements x e P|, x i can be viewed as “vectors” x = (xj ) whose /th coordinate is 
Xi = f ( i ) for all i e I. If I is finite, say, I — {1,2,..., /;}, then it is not difficult to see 
that [] ■ Xj = X\ x • • • x X n , where the latter set is defined, inductively, by 

X\ x • • • x X n+ i = (Xi x • • • x X n ) x X n+l . 

If the index set I is infinite and all the X, are nonempty, it is not obvious that \\, r j X , is 
nonempty. Indeed, this assertion is equivalent to the axiom of choice (see the Appendix). 
The notion of relation , which generalizes that of a function, is useful. 
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Definition. If X and Y are sets, then a relation from X to Y is a subset R C X x Y. We 
usually write 

x R y 

to denote (x, y) e R. If X — Y, then we say that R is a relation on X. 

Let us give a concrete illustration to convince the reader that this definition is reasonable. 
One expects that < is a relation on R, and let us see that it does, in fact, realize the definition 
of relation. Let 

R = {(x, y) e R x R : (x, y) lies on or above the line y — x}. 

The reader should recognize that x R y holds if and only if, in the usual sense, x < y. 

Example 1.51. 

(i) Every function /: X —* Y is a relation. 

(ii) Equality is a relation on any set X; it is the diagonal 

Ax = {(x, x) € X x X}. 

(iii) The empty set 0 defines a relation on any set, but it is not very interesting. ◄ 


Definition. A relation x = y on a set X is 

reflexive : if x = x for all x e X\ 
symmetric: if x = y implies y = x for all x, y e X: 
transitive: if x = y and y = z imply x = z for all x. y. z e X. 

A relation that has all three properties—reflexivity, symmetry, and transitivity—is called 
an equivalence relation. 

Example 1.52. 

(i) Equality is an equivalence relation on any set X. We should regard any equivalence 
relation as a generalized equality. 

(ii) For any integer m > 0, congruence mod m is an equivalence relation on Z. a 

An equivalence relation on a set X yields a family of subsets of X. 

Definition. Let = be an equivalence relation on a set X. If a e X , the equivalence class 
of a, denoted by [a], is defined by 

[a] — [x e X : x = a} c. X. 

For example, under congruence mod m, the equivalence class [a] of an integer a is called 
its congruence class. 

The next lemma says that we can replace equivalence by honest equality at the cost of 
replacing elements by their equivalence classes. 
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Lemma 1.53. If = is an equivalence relation on a set X, then x = y if and only if 

[xi = [yl 

Proof. Assume that x = y. If z e [x], then z = x, and so transitivity gives z = y, hence 
[x] C [y], By symmetry, y = x, and this gives the reverse inclusion [y] C [x]. Thus, 
[x] - l.v]. 

Conversely, if [x] = [y], then x e [x], by reflexivity, and so x e [x] = [y]. Therefore, 
x = y. • 


Definition. A family of subsets A, of a set X is called pairwise disjoint if 


Aj n Aj = 0 


for all i f j . A partition of a set A is a family of pairwise disjoint nonempty subsets, 
called blocks, whose union is all of X. 


Proposition 1.54. If= is an equivalence relation on a set X, then the equivalence classes 
form a partition of X. Conversely, given a partition {A,- : i e 1} ofX, there is an equiva¬ 
lence relation on X whose equivalence classes are the blocks Aj. 

Proof. Assume that an equivalence relation = on X is given. Each x e X lies in the 
equivalence class [x] because = is reflexive; it follows that the equivalence classes are 
nonempty subsets whose union is X. To prove pairwise disjointness, assume that a e 
[x] IT [y], so that a = x and a = y. By symmetry, x = a, and so transitivity gives x = y. 
Therefore, [x] = [y], by the lemma, and the equivalence classes form a partition of X. 

Conversely, let {A, : i e 1} be a partition of X. If x, y e X, define x = y if there is 
i e I with both x e A, and y e A, . It is plain that = is reflexive and symmetric. To see 
that = is transitive, assume that x = y and y = z; that is, there are i, j e I with x, y e Al¬ 
and y, z e Aj. Since y e A/ IT Aj, pairwise disjointness gives A; = Aj, so that i = j and 
x, z e A,-; that is, x = z. We have shown that = is an equivalence relation. 

It remains to show that the equivalence classes are the A/’s. If x e X, then x e A/, for 
some i. By definition of =, if y e A/, then y = x and y e [x]; hence. A, C [x]. For the 
reverse inclusion, let z e [x], so that z = x. There is some j with x e A/ and z e Aj; 
thus, x e Aj IT Aj. By pairwise disjointness, i — j, so that z e A,-, and [x] C A/. Hence, 
[x] = Aj. • 


Example 1.55. 

(i) We have just seen that an equivalence relation can be defined on a set from a partition. 
Let I = [0, 1] be the closed unit interval, and define a partition of I whose blocks are the 2- 
point set {0, 1} and all the 1-point sets {a}, where 0 < a < 1. The family of all the blocks, 
that is, of all the equivalence classes, can be viewed as a circle, for we have identified the 
two endpoints of the interval. 
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Here is another construction of the circle, now from R instead of from I. Define a 
relation on M by a = b if a — b € Z. It is easy to see that this is an equivalence relation on 
]R, and the equivalence class of a number a is 

[a] = [r e R : r = a + n for some n e Z}. 

The family of all blocks is again the circle (we have identified the endpoints of any interval 
of length 1). 

(ii) Define an equivalence relation on the square Ixl in which the blocks are { (a , 0 ), (a, 1)}, 
one for each a e I, {( 0 , h), (1, /?)}, one for each h e I, as well as all the singleton sets 
{(a, b)} in the interior of the square. The family of all equivalence classes can be viewed 
as a torus (the surface of a doughnut): Identifying the left and right sides of the square 
gives a cylinder, and further identifying the top and bottom ends of the cylinder gives a 
torus. ■* 


Exercises 

1.52 Let X and Y be sets, and let / : X —»■ Y be a function. If 5 is a subset of X, prove that the 
restriction f\S is equal to the composite / o i, where i : S —> X is the inclusion map. 

Hint. Use Proposition 1.43. 

1.53 If / : X —> Y has an inverse g, show that g is a bijection. 

Hint. Does g have an inverse? 

1.54 Show that if /: X —>• Y is a bijection, then it has exactly one inverse. 

1.55 Show that /: R —» R, defined by f(x) — 3.v + 5, is a bijection, and find its inverse. 

1.56 Determine whether /: Q x Q -*■ Q, given by 

f(a/b , c/d) = (a + c)/(b + d ), 

is a function. 

1.57 Let X — {*], ..., x m J and Y = {yi..... y n ) be finite sets. Show that there is a bijection 
/: X —> Y if and only if |kj = |F|; that is, m — n. 

Hint. If / is a bijection, there are m distinct elements ..., f(x m ) in Y, and so m < n; 

using the bijection f~ l in place of / gives the reverse inequality n < m. 

1.58 If X and Y are finite sets with the same number of elements, show that the following conditions 
are equivalent for a function f: X -*■ Y: 

(i) / is injective; 

(ii) / is bijective; 

(iii) / is surjective. 

Hint. If A C X and |A| = n = |X|, then A = X\ after all. how many elements are in X but 
not in A? 

1.59 Let /: X —>• Y and g: Y —*■ Z be functions. 

(i) If both / and g are injective, then g o / is injective. 
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(ii) If both / and g are surjective, then g o / is surjective. 

(iii) If both / and g are bijective, then g o / is bijective. 

(iv) If g o / is a bijection, prove that / is an injection and g is a surjection. 

1.60 If /: (—jr/2, n/2) —> R is defined by a tan a, prove that / has an inverse function g; 
indeed, g = arctan. 

1.61 If A and B are subsets of a set X, define 

A — B = [a e A : a £ B }. 

Prove that A — B = A D B'. where B' = X — B is the complement of B \ that is, 

B' = {.x e X :x $ B). 

1.62 Let A and B be sets, and let a e A and h e B. Define their ordered pair as follows: 

(a, b) — (a, (a, b}}. 

If a' e A and b' e B, prove that (a', b') = (a, b) if and only if a' = a and b' = b. 

Hint. One of the axioms constraining the e relation is that the statement 

Cl £ X £ Cl 

is always false. 

1.63 (i) What is wrong with the following argument, which claims to prove that a symmetric 

and transitive relation R on a set X is reflexive? If x e X , then take y e X with x R y. 
By symmetry, we have y R x, and by transitivity, we have x R x. 

(ii) Give an example of a symmetric and transitive relation on a set that is not reflexive. 

1.64 (i) Let X be a set, and let R C X x X. Define R = flfi's,? ■ w h ere £ is the family of all 

the equivalence relations R' on X containing R. Prove that R is an equivalence relation 
on X (R is called the equivalence relation generated by R). 

(ii) Let R be a reflexive and symmetric relation on a set X. Prove that R, the equivalence 
relation generated by R, consists of all (.v, y) e X x X for which there exist finitely 
many ( x , y) € R. say, (x \, yj), ..., ( x n , >’«), with x = x\, y n = and y; = for 
alii > 1. 

1.65 Let X = {(a, b) : a, b e Z and b ^ Oj. Prove that the relation on X, defined by (a, b) = (c, d) 
if ad = be, is an equivalence relation on X. What is the equivalence class of (1, 2)? 

1.66 Define a relation on C by z = if if |z| = |tu|. Prove that this is an equivalence relation on C 
whose equivalence classes are the origin and the circles with center the origin. 

1.67 (i) Let /: X —> Y be a function (where X and Y are sets). Prove that the relation on X, 

defined by x = x' if fix) = fix'), is an equivalence relation. 

(ii) Define /: R — > .S’ 1 , where S 1 C C is the unit circle, by fix) = e 2nlx . What is the 
equivalence class of 0 under the equivalence relation in part (i)? 

1.68 Let /: X —> Y be a function and let V, W C Y. 

(i) Prove that 

rhv nw) = r l iV)nf- l iW) and f-\WJW) = f~hV)\J f~\W). 
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(ii) Prove that f(V U W) = f(V) U f(W). 

(iii) Give an example showing that f(V 0 IV) 7 ^ /(V) fl /(IV). 

(iv) Prove that / _1 (IV') = (/ _1 (W/, where W' = {y € Y : y $ W] is the complement of 
IV, and give an example of a function / such that f(S') 7 ^ (f(S)Y for some SCI. 
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Groups I 


2.1 Introduction 

One of the major open problems, following the discovery of the cubic and quartic formulas 
in the 1500s, was to find a formula for the roots of polynomials of higher degree, and it 
remained open for almost 300 years. For about the first 100 years, mathematicians recon¬ 
sidered what number means, for understanding the cubic formula forced such questions 
as whether negative numbers are numbers and whether complex numbers are legitimate 
entities as well. By 1800, P. Ruffini claimed that there is no quintic formula (which has 
the same form as the quadratic, cubic, and quartic formulas; that is, it uses only arith¬ 
metic operations and nth roots), but his contemporaries did not accept his proof (his ideas 
were, in fact, correct, but his proof had gaps). In 1815, A. L. Cauchy introduced the 
multiplication of permutations and proved basic properties of what we call the symmetric 
group S„ ; for example, he introduced the cycle notation and proved the unique factoriza¬ 
tion of permutations into disjoint cycles. In 1824, N. Abel (1802-1829) gave an acceptable 
proof that there is no quintic formula; in his proof, Abel constructed permutations of the 
roots of a quintic, using certain rational functions introduced by J. L. Lagrange in 1770. 
E. Galois (1811-1832), the young wizard who was killed before his 21st birthday, mod¬ 
ified the rational functions but, more important, he saw that the key to understanding the 
problem involved what he called groups: subsets of S„ that are closed under multiplica¬ 
tion - in our language, subgroups of S„. To each polynomial fix), he associated such 
a group, nowadays called the Galois group of fix). He recognized conjugation, normal 
subgroups, quotient groups, and simple groups, and he proved, in our language, that a 
polynomial (over a field of characteristic 0) has a formula for its roots, analogous to the 
quadratic formula, if and only if its Galois group is a solvable group (solvability being a 
property generalizing commutativity). A good case can be made that Galois was one of the 
most important founders of modern algebra. For an excellent account of the history of this 
problem we recommend the book, Galois’ Theory of Algebraic Equations , by J.-R Tignol. 
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Along with results usually not presented in a first course, this chapter will also review 
some familiar results whose proofs will only be sketched. 


2.2 Permutations 

For Galois, groups consisted of certain permutations (of the roots of a polynomial), and 
groups of permutations remain important today. 

Definition. A permutation of a set A is a bijection from A to itself. 

In high school mathematics, a permutation of a set X is defined as a rearrangement of 
its elements. For example, there are six rearrangements of A = {1,2, 3}: 

123; 132; 213; 231; 312; 321. 

Now let A = {1,2A rearrangement is a list, with no repetitions, of all the 
elements of A. All we can do with such lists is count them, and there are exactly n\ 
permutations of the n -element set A. 

Now a rearrangement i \, i 2 , ...,/„ of A determines a function a: X -* A, namely, 
a(l ) = f 1 , a(2) = 12 , • • •, oc(n) = i n . For example, the rearrangement 213 determines the 
function a with cr(l) = 2, a(2) = 1, and a (3) = 3. We use a two-rowed notation to denote 
the function corresponding to a rearrangement; if a(j) is the j th item on the list, then 

_ / 1 2 ... j ... n \ 

01 \a(l) a( 2 ) ... a( 7 ) ... a(n)J ' 

That a list contains all the elements of A says that the corresponding function a is surjec¬ 
tive, for the bottom row is ima; that there are no repetitions on the list says that distinct 
points have distinct values; that is, a is injective. Thus, each list determines a bijection 
a: X -> A; that is, each rearrangement determines a permutation. Conversely, every per¬ 
mutation a determines a rearrangement, namely, the list a(l), a( 2 ),..., a(n) displayed as 
the bottom row. Therefore, rearrangement and permutation are simply different ways of 
describing the same thing. The advantage of viewing permutations as functions, however, 
is that they can now be composed and, by Exercise 1.59 on page 36, their composite is also 
a permutation. 

Definition. The family of all the permutations of a set A, denoted by Sx , is called the 
symmetric group on A. When A = {1,2,..., n}, Sx is usually denoted by S„ , and it is 
called the symmetric group on n letters. 

Let us simplify notation by writing fia instead of /3 o a and (1) instead of lx- 
Notice that composition in S 3 is not commutative. Aside from being cumbersome, there 
is a major problem with the two-rowed notation for permutations. It hides the answers to 
elementary questions such as. Do two permutations commute? Is the square of a permuta¬ 
tion the identity? The special permutations introduced next will remedy this defect. 
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Definition. Let / 1 , / 2 , - - -, i r be distinct integers in {1, 2If a e S n fixes the other 
integers (if any) and if 

a(i't) = h, 01(h) — h, ■■■, udr-i) = ir , ot(i r ) = L, 
then a is called an r-cycle. We also say that a is a cycle of length r, and we denote it by 

OL = (;I i 2 ... i r ). 

A 2-cycle interchanges if and L and fixes everything else; 2-cycles are also called 
transpositions. A 1-cycle is the identity, for it fixes every i ; thus, all 1-cycles are equal: 
0 ) = ( 1 ) for all i. 

The term cycle comes from the Greek word for circle. Picture the cycle 0t h ■ ■ ■ ir) 
as a clockwise rotation of the circle, as in Figure 2.1. 

h 

ii 2 

Figure 2.1 

Any i j can be taken as the “starting point,” and so there are r different cycle notations for 
any r-cycle: 

0i h ■■■ ir) = O 2 h ■■■ i>- h) = ■■■ = ( ir h h ■■■ ir- 1 ). 

Let us now give an algorithm to factor a permutation into a product of cycles. For example, 
take 

_(( 2345678 9\ 

4 7 2 5 1 8 9 3/ 

Begin by writing “(1.” Now a: 1 k 6 , so write “(1 6 .” Next, a: 6 i-> 1, and so the 
parentheses close: a begins “(1 6 ).” The first number not having appeared is 2, and so 
we write “(1 6)(2.” Now a: 2 4, so we write “(1 6)(2 4.” Since a: 4 2, the 

parentheses close once again, and we write “(1 6 ) (2 4).” The smallest remaining number 
is 3; now 3 1 -^ 7, 7 8 , 8 i-> 9, and 9 h- 3; this gives the 4-cycle (3 7 8 9). Finally, 

a (5) = 5; we claim that 


a = (1 6)(2 4)(3 7 8 9)(5). 
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Since multiplication in S„ is composition of functions, our claim is that 

«(«) = [(1 6)(2 4)(3 7 8 9)(5)](0 

for every i between 1 and 9 [after all, two functions / and g are equal if and only if 
f(i) — g(i) for every i in their domain]. The right side is the composite /8y<5, where 
P — (1 6 ), y — (2 4), and 8 = (3 7 8 9) [we may ignore the 1-cycle (5) when we are 
evaluating, for it is the identity function]. Now a(l) = 6; let us evaluate the composite on 
the right when i — 1. 


Py8(l) = 

= P(y( 1)) 5 = (3 7 8 9) fixes 1 

= P(l) y = ( 2 4) fixes 1 
= 6 p = (16). 

Similarly, a(i) — PyS(i) for every i, proving the claim. 

We multiply permutations from right to left, because multiplication here is composite 
of functions; that is, to evaluate o//3( 1), we compute «(/J (1)). Here is another example: Let 
us compute the product 

a = (1 2) (1 3 42 5)(2 5 1 3) 

in .S5. To find the two-rowed notation for cr, evaluate, starting with the cycle on the right: 

a \ 1 i-k 3 i-k 4 i-k 4; 
o\ 2 i—5 i—^ 1 i-> 2; 
o'. 3 i—2 i—5 i—^ 5; 
a: 4 k 4i-> 2 i-> 1; 
ct: 5 k 1 k 3 k 3. 


Thus, 1 



2 3 
2 5 



The algorithm given earlier, when applied to this two-rowed notation for cr, now gives 


cr = (l 4)(2)(5 3). 


In the factorization of a permutation into cycles, given by the preceding algorithm, we 
note that the family of cycles is disjoint in the following sense. 

1 There are authors who multiply permutations differently, so that their a o f3 is our f3 oa. This is a consequence 
of their putting “functions on the right”: Instead of writing a (i) as we do, they write (i)a. Consider the composite 
of permutations a and (3 in which we first apply (3 and then apply a. We write i i—>• fi(i) i—>• a(ft(i)). In the right¬ 
sided notation, i i—>• (i)(3 i—>• ((i)/3)a. Thus, the notational switch causes a switch in the order of multiplication. 
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Definition. Two permutations a, f> e S„ are disjoint if every i moved by one is fixed by 
the other: If a(i) ^ i, then P(i) = i, and if f{j) ^ j, then a(j) — j. A family Pi ..., p t 
of permutations is disjoint if each pair of them is disjoint. 

Lemma 2.1. Disjoint permutations a, p e S n commute. 

Proof. It suffices to prove that if 1 < f < n, then afi(i) — fa(i). If p moves i, say, 
fi(i) — j / i, then ft also moves j [otherwise, fi(j) — j and ft(i) — j contradicts 
yS’s being an injection]; since a and ft are disjoint, a(i) — i and a (j ) = j. Hence 
fia(i) — j — af(i). The same conclusion holds if a moves i. Finally, it is clear that 
af(i) = fa(i) if both a and p fix i. • 

Proposition 2.2. Every permutation a e S n is either a cycle or a product of disjoint 
cycles. 

Proof. The proof is by induction on the number k of points moved by a. The base step 
k = 0 is true, for now a is the identity, which is a 1 -cycle. 

Iffc > 0, let ii be a point moved by a. Define L = cr (/ |)- M = <x(i 2 ),..., tV+t = ot(i r ), 
where r is the smallest integer for which i r +\ e {i i, ij, ■ ■ ■, i r } (since there are only n 
possible values, the list t'i, h, h, ■ ■ ■, ik> ■ ■ ■ must eventually have a repetition). We claim 
that a(i r ) = i\. Otherwise, a(i r ) — ij for some j > 2; but a(ij- 1 ) = ij , and this 
contradicts the hypothesis that a is an injection. Let o be the r-cycle (i\ (4 (3 ... i r ). 
If r = n, then a = o. If r < n , then a fixes each point in Y, where Y consists of the 
remaining n — r points, while a(Y ) = Y. Define a' to be the permutation with a'(i) — u{i) 
for i e Y that fixes all / j Y. and note that 

a — oa'. 

The inductive hypothesis gives a' — f>\ - ■ ■ f t , where fi \,..., f t are disjoint cycles. Since 
o and a' are disjoint, a — of \ ■ ■ ■ f) t is a product of disjoint cycles. • 

Usually we suppress the 1-cycles in this factorization [for 1-cycles equal the identity 
(1)]. However, a factorization of a in which we display one 1-cycle for each i fixed by a, 
if any, will arise several times. 

Definition. A complete factorization of a permutation a is a factorization of a into dis¬ 
joint cycles that contains exactly one 1 -cycle (i) for every i fixed by a. 

For example, the complete factorization of the 3-cycle a = (1 3 5) in S 5 is a = 
(1 3 5) (2) (4). 

There is a relation between an r-cycle ft = (i\ i 2 ... i r ) and its powers f J > k . where 
f k denotes the composite of ft with itself k times. Note that L = f(i 1 ), is — fiih) = 
P(PUi)) = yS 2 (;'i), M = fids) = P(P 2 dt)) = p 3 d l), and, more generally, 

ik +1 = P k (i 1 ) 


for all k < r. 
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Theorem 2.3. Let a e S n and let a = Pi ■ ■ ■ f$ t be a complete factorization into disjoint 
cycles. This factorization is unique except for the order in which the cycles occur. 

Sketch of Proof. Since every complete factorization of a has exactly one 1-cycle for each 

i fixed by a, it suffices to consider (not complete) factorizations into disjoint cycles of 
length > 2. Let a — y\ ■ ■ ■ y s be a second such factorization of a into disjoint cycles. 

The theorem is proved by induction on l , the larger of t and .v. The inductive step begins 
by noting that if f, moves i \ , then Pj (i i) = a k (i\ ) for all k > 1. Some yj must also move 

i i and, since disjoint cycles commute, we may assume that y s moves i\. It follows that 
ft = Ys\ right multiplying by gives Pi • • • P t -i = Yi • • • Ys-i- • 

Every permutation is a bijection; how do we find its inverse? In the pictorial representa¬ 
tion of a cycle f as a clockwise rotation of a circle, the inverse f~ l is just a counterclock¬ 
wise rotation. The proof of the next proposition is straightforward. 

Proposition 2.4. 

(i) The inverse of the cycle a = (i\ h ... i r ) is the cycle (i r i r - 1 ... (’i): 

(f 1 12 ■■■ ir)~ l = C Ir ir -1 • ■ • D- 

(ii) Ify e S n and y — P 1 • • • fk, then 



Definition. Two permutations a, f e S„ have the same cycle structure if their complete 
factorizations have the same number of r-cycles for each r. 

According to Exercise 2.4 on page 50, there are 

(1 /r)[n(n — 1) • • • (n — r + 1)] 

r-cycles in S„. This formula can be used to count the number of permutations having any 
given cycle structure if we are careful about factorizations having several cycles of the 
same length. For example, the number of permutations in 54 of the form (a b)(c d) is 
^[^(4 x 3)] x [^(2 x 1)] = 3, the “extra” factor | occurring so that we do not count 
(o b)(c d) — (c d)(a b) twice. 

Example 2.5. 

(i) The types of permutations in G = S 4 are counted in Table 2.1. 


Cycle Structure 

Number 

(1) 

1 

(12) 

6 

(12 3) 

8 

(1 2 3 4) 

6 

(1 2)(3 4) 

3 

24 


Table 2.1. Permutations in .S 4 
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(ii) The types of permutations in G — S$ are counted in Table 2.2. 


Cycle Structure 

Number 

(1) 

1 

(12) 

10 

(12 3) 

20 

(1 2 3 4) 

30 

(1 2 3 4 5) 

24 

(1 2)(3 4 5) 

20 

(12)0 4) 

15 


120 


Table 2.2. Permutations in 5s 


Here is a computational aid. We illustrate its statement in the following example before 
stating the general result. 

Example 2.6. 

If y = (1 3)(2 4 7)(5)( 6 ) and a = (2 5 6)(1 4 3), then 

aya~' = (4 1)(5 3 7)(6)(2) = (al a3)(Q;2 a4 a7)(a5)(a6). ◄ 

Lemma 2.1. Ify, a e S„, then aya~ i has the same cycle structure as y. In more detail, 
if the complete factorization ofy is 

y = f]_p 2 •••O't h ...)■■■ ft, 

then aya~ l is the permutation that is obtained from y by applying a to the symbols in the 
cycles ofy. 

Proof. The idea of the proof is that yay~ l : y(i\) i->- i\ i-> i 2 i-> y(i 2 ). Let a denote 
the permutation defined in the statement. 

If y fixes i, then o fixes a (i), for the definition of a says that ot(i) lives in a 1-cycle in 
the factorization of a. On the other hand, aya~ l also fixes a(i): 

aya~ { {a{i)) = ay(i) — a (i), 

because y fixes i. 

Assume that y moves a symbol i\ , say, y(i\) — i 2 , so that one of the cycles in the 
complete factorization of y is 

O't h 

By the definition of a , one of its cycles is 

(kl ...), 

where a(i\) — k and a(i 2 ) = I\ hence, a: k i. But aya~ l : k i-> i\ i 2 i->- l, 
and so aya~ l (k) = <r(k). Therefore, a and aya~ l agree on all symbols of the form 
k = a(i i). Since a is surjective, every k is of this form, and so rr — aya~ l . • 
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Example 2.8. 

In this example, we illustrate that the converse of Lemma 2.7 is true; the next theorem will 
prove it in general. In S 5 , place the complete factorization of a 3-cycle ft over that of a 
3-cycle y, and define a to be the downward function. For example, if 

ft = (1 2 3) (4) (5) 

Y = (5 2 4) (1) (3), 

then 

(\ 2 3 4 5\ 
a= {5 2 4 1 3)' 

and so a = (1 5 3 4). Now a e S 5 and 

y — (al a2 a3), 

so that y = aft a -1 , by Lemma 2.7. Note that rewriting the cycles of ft, for example, as 
ft = (1 2 3)(5)(4), gives another choice for a. a 


Theorem 2.9. Permutations y and a in S n have the same cycle structure if and only if 
there exists a e S„ with a — aya~ l . 

Sketch of Proof. Sufficiency was just proved in Lemma 2.7. For the converse, place one 
complete factorization over the other so that each cycle below is under a cycle above of the 
same length: 


Y — <5i<52 ••■Li h •••)••• S t 
aya~ l = rjit ]2 ■■■ (k £■■■)■■■ rj t . 

Now define a to be the “downward" function, as in the example; hence, a(i 1 ) = k, a(h) = 
l, and so forth. Note that a is a permutation, for there are no repetitions of symbols in 
the factorization of y (the cycles ij are disjoint). It now follows from the lemma that 
a = aya _> . • 

There is another useful factorization of a permutation. 

Proposition 2.10. Ifn > 2, then every a e S n is a product of transpositions. 

Sketch of Proof. In light of Proposition 2.2, it suffices to factor an r-cycle ft into a product 
of transpositions, and this is done as follows: 




£ = (12 ... r) = (1 r)(l r — 1) • • • (1 3)(1 2). 
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Every permutation can thus be realized as a sequence of interchanges, but such a fac¬ 
torization is not as nice as the factorization into disjoint cycles. First, the transpositions 
occurring need not commute: (12 3) = (1 3)(1 2) ^ (1 2)(1 3); second, neither the 
factors themselves nor the number of factors are uniquely determined. For example, here 
are some factorizations of (1 2 3) in S 4 : 

(12 3) = (1 3) (1 2) 

= (2 3)(1 3) 

= (1 3)(4 2)(1 2)(1 4) 

= (1 3)(4 2)(1 2)(1 4)(2 3)(2 3). 

Is there any uniqueness at all in such a factorization? We now prove that the parity of 
the number of factors is the same for all factorizations of a permutation a; that is, the 
number of transpositions is always even or always odd (as suggested by the factorizations 
of a = (1 2 3) displayed above). 

Example 2.11. 

The 15-puzzle has a starting position that is a 4 x 4 array of the numbers between 1 and 
15 and a symbol #, which we interpret as “blank.” For example, consider the following 
starting position: 


3 

15 

4 

8 

10 

11 

1 

9 

2 

5 

13 

12 

6 

7 

14 

# 


A simple move interchanges the blank with a symbol adjacent to it; for example, there 
are two beginning simple moves for this starting position: Either interchange # and 14 or 
interchange # and 12. We win the game if, after a sequence of simple moves, the starting 
position is transformed into the standard array 1, 2, 3, ..., 15, #. 

To analyze this game, note that the given array is really a permutation a e .S’ 15 (if we 
now call the blank 16 instead of #). More precisely, if the spaces are labeled 1 through 16, 
then aii) is the symbol occupying the /th square. For example, the given starting position 
is 

/I 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16\ 

^3 15 4 8 10 11 1 9 2 5 13 12 6 7 14 16 )' 

Each simple move is a special kind of transposition, namely, one that moves 16 (remember 
that the blank is now 16). Moreover, performing a simple move (corresponding to a special 
transposition r) from a given position (corresponding to a permutation fi) yields a new 
position corresponding to the permutation rfi. For example, if a is the position above and 
r is the transposition interchanging 14 and 16, then ra(16) = t( 16) = 14 and ra(15) = 
r(14) = 16, while r a{i) — i for all other i. That is, the new configuration has all the 
numbers in their original positions except for 14 and 16 being interchanged. To win the 
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game, we need special transpositions Ti,T 2 ,, x m so that 

r m ■ ■ ■ r 2 ri a = (1). 

It turns out that there are some choices of a for which the game can be won, but there are 
others for which it cannot be won, as we shall see in Example 2.15. < 


Definition. A permutation a e S„ is even if it can be factored into a product of an even 
number of transpositions; otherwise, a is odd. The parity of a permutation is whether it is 
even or odd. 

It is easy to see that (1 2 3) and (1) are even permutations, for there are factorization 
(12 3) = (1 3)(1 2) and (1) = (1 2)(1 2) having two transpositions. On the other 
hand, we do not yet have any examples of odd permutations! If a is a product of an 
odd number of transpositions, perhaps it also has some other factorization into an even 
number of transpositions. The definition of odd permutation a, after all, says that there is 
no factorization of a into an even number of transpositions. 

Definition. If a e S n and a = fi\ ■ ■ ■ is a complete factorization into disjoint cycles, 
then signum a is defined by 

sgn (a) = (-1)"-'. 

Theorem 2.3 shows that sgn is a (well-defined) function, for the number t is uniquely 
determined by a. Notice that sgn(e) = 1 for every 1-cycle e because t — n. If r is a 
transposition, then it moves two numbers, and it fixes each of the n — 2 other numbers; 
therefore, t — (n — 2) + 1 = n — 1, and so sgn(r) = (—1)" _( " _1) — —1. 

Theorem 2.12. For all a, f e S n , 

sgn(a/3) = sgn(a) sgn03). 

Sketch of Proof. If k, l > 0 and the letters a, b, ci, d j are all distinct, then 

(a b)(a ci ... Ck b d\ ... df) — (a c\ ... Ck)(b d\ ... df)\ 

multiplying this equation on the left by (a b) gives 

(i a b){a ci ... Ck){b d i ... dc) = (a q ... Ck b d i ... di). 

These equations are used to prove that sgn(ro') = — sgn(a) for every a e S„, where r 
is the transposition (a b). If a € S n has a factorization a — n • • • r m , where each r,- is a 
transposition, we now prove, by induction on m, that sgn(aft) = sgn(a) sgn(/3) for every 
fi e S n . • 
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Theorem 2.13. 

(i) Let a € .S',,; ifsgn(cn) = 1, then a is even, and ifsgn(a) — — 1, then a is odd. 

(ii) A permutation a is odd if and only if it is a product of an odd number of transposi¬ 
tions. 

Proof, (i) If a. — x\ ■ ■ ■ x q is a factorization of a into transpositions, then Theorem 2.12 
gives sgn(a) = sgn(rj) • • • sgnlr^) = (— \) q . Thus, if sgn(cn) = 1, then q must always be 
even, and if sgn(a) = — 1 , then q must always be odd. 

(ii) If a is odd, then a is not even, and so sgn(a) 1; that is, sgn(a) = —1. Now 
a — x\ - ■ ■ x q , where the r; are transpositions, so that sgn(a) = —1 = (— l) q \ hence, q is 
odd (we have proved more; every factorization of a into transpositions has an odd number 
of factors). Conversely, if a = x\ ■ ■ ■ x q is a product of transpositions with q odd, then 
sgn(oi) = — 1 ; therefore, a is not even and, hence, a is odd. • 


Corollary 2.14. Let a, f> e S n . If a and f have the same parity, then a/3 is even, while if 
a and f have distinct parity, then a/3 is odd. 


Example 2.15. 

An analysis of the 15-puzzle in Example 2.11 shows that if a e .S’if, is the starting position, 
then the game can be won if and only if a is an even permutation that fixes 16. For a proof 
of this, we refer the reader to McCoy-Janusz, Introduction to Modern Algebra, pages 229- 
234. The proof in one direction is fairly clear, however. The blank 16 starts in position 16. 
Each simple move takes 16 up, down, left, or right. Thus, the total number m of moves 
is u + d + l + r, where u is the number of up moves, and so on. If 16 is to return home, 
each one of these must be undone: There must be the same number of up moves as down 
moves (i.e., u — d) and the same number of left moves as right moves (i.e., r — l). Thus, 
the total number of moves is even: m — 2u + 2 r. That is, if x m ■ ■ ■ x\a — (1), then m is 
even; hence, a = x\ ■ ■ ■ x m (because r -1 = r for every transposition r), and so a is an 
even permutation. Armed with this theorem, we see that if the starting position a is odd, 
the game starting with a cannot be won. In Example 2.11, 

a = (1 34892 15 14 7)(5 10)(6 11 13)(12)(16) 

[(12) and (16) are 1-cycles]. Now sgn(cn) = (— l ) 16-5 = —1, so that a is an odd permuta¬ 
tion. Therefore, it is impossible to win this game. ■* 


Exercises 


2.1 Find sgn(a) and a 1 , where 


/I 2 3 4 5 6 7 

y9 8 7 6 5 4 3 


8 9\ 

2 I)' 
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2.2 If a e S„, prove that sgn(a = sgn(a). 

2.3 If a € S n fixes some j, where 1 < j < n [that is, a(j) = j], define a' e S„-\ by 
a’(i) = a{i) for all i ^ j. Prove that 

sgn(o' / ) = sgn(cr). 

Hint. Use the complete factorizations of a and of a'. 

2.4 If 1 < r < n, show that there are 

\[n{n - 1 ) ■•■(« — r + 1 )] 

r-cycles in S n . 

Hint. There are r cycle notations for any r-cycle. 

2.5 (i) If a is an r-cycle, show that a r — (1). 

Hint. If a = (Jq ... i r -\), show that a k ( iq) = if,. 

(ii) If a is an r-cycle, show that r is the smallest positive integer k such that a k = (1). 
Hint. Use Proposition 2.2. 

2.6 Show that an r-cycle is an even permutation if and only if r is odd. 

2.7 Given X — {1, 2,.... n}, let us call a permutation r of X an adjacency if it is a transposition 
of the form (i i + 1) for i < n. 

(i) Prove that every permutation in S n , for n > 2. is a product of adjacencies. 

(ii) If i < j , prove that (i j) is a product of an odd number of adjacencies. 

Hint. Use induction on j — i. 

2.8 Define /: [0. 1, 2, ..., 10) -► {0, 1, 2,..., 10) by 

f(n) = the remainder after dividing An~ — 3« 7 by 11. 

(i) Show that / is a permutation. 2 

(ii) Compute the parity of /. 

(iii) Compute the inverse of /. 

2.9 If a is an r-cycle and 1 < k < r, is a k an r-cycle? 

2.10 (i) Prove that if a and f) are (not necessarily disjoint) permutations that commute, then 

(aP) k = ct k p k for all k > 1. 

Hint. First show that fia k = a k p by induction on k. 

(ii) Give an example of two permutations a and p for which (a/3) 2 ^ a~p 2 . 

2.11 (i) Prove, for all i, that a e S n moves i if and only if a 1 moves i. 

(ii) Prove that if a, p e S n are disjoint and if a/3 = (1). then a = (1) and p = (1). 

2.12 Prove that the number of even permutations in S n is !. 

Hint. Let r = (1 2), and define /: A n -*■ O n , where A„ is the set of all even permutations 
in S n and O n is the set of all odd permutations, by 

/: a ta. 

Show that / is a bijection, so that \A n \ = \O n \ and, hence, |A„ | = \n !. 

2 If k is a finite field, then a polynomial f(x ) with coefficients in k is called a permutation polynomial if 
the evaluation function /: k —^ k, defined by a i-> f (a ), is a permutation of k. A theorem of Hermite and 
Dickson characterizes permutation polynomials (see Lidl-Niederreiter, Introduction to Finite Fields and Their 
Applications). 
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2.13 (i) How many permutations in S 5 commute with a = (1 2 3), and how many even permu¬ 

tations in S 5 commute with a? 

Hint. There are 6 permutations in S$ commuting with a , only 3 of which are even. 

(ii) Same questions for (1 2) (3 4). 

Hint. There are 8 permutations in S 4 commuting with (1 2) (3 4), and only 4 of them 
are even. 

2.14 Give an example of a, f},y € S 5 , with a 7 ^ ( 1 ), such that afi = j5a, ay = ya and fiy 7 ^ yf). 

2.15 Ifn > 3, show that if a e S n commutes with every /S e S n , then a = (1). 

2.16 If a — f}\ ■ ■ ■ f} m is a product of disjoint cycles, prove that y = /S^ 1 ■ • • <5 commutes with 

a , where e,- > 0 for all /', and S is disjoint from a. 


2.3 Groups 

Since Galois’s time, groups have arisen in many areas of mathematics other than the study 
of roots of polynomials, for they are the way to describe the notion of symmetry, as we 
shall see. 

The essence of a “product” is that two things are combined to form a third thing of the 
same kind. For example, ordinary multiplication, addition, and subtraction combine two 
numbers to give another number, while composition combines two permutations to give 
another permutation. 

Definition. A binary operation on a set G is a function 

* : G x G -> G. 

In more detail, a binary operation assigns an element *(x, y) in G to each ordered 
pair (x, y) of elements in G. It is more natural to write x * y instead of *(x, y); thus, 
composition of functions is the function (g, /) i-> jo /; multiplication, addition, and 
subtraction are, respectively, the functions (x, y) i-> xy, (x, y) i->- x + y, and (x, y) i-> 
x — y. The examples of composition and subtraction show why we want ordered pairs, for 
x *y and y * x may be distinct. As with any function, a binary operation is well-defined; 
when one says this explicitly, it is usually called the law of substitution : 

If x = x' and y = y', then x *y = x' * y'. 

Definition. A group is a set G equipped with a binary operation * such that 

(i) the associative law holds: for every x, y, z e G, 

x * (y * z) — (x * y) * z; 

(ii) there is an element e e G, called the identity , with e*x — x—x*e for all x e G; 

(iii) every x e G has an inverse ; there is x' e G with x * x' — e — x' * x. 
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By Theorem 1.49, the set Sx of all permutations of a set X, with composition as the 
operation and lx — (1) as the identity, is a group (the symmetric group on X). In Ex¬ 
ercise 2.22 on page 61, the reader will see that some of the equations in the definition of 
group are redundant. This is a useful observation, for it is more efficient, when verifying 
that a set with an operation is actually a group, to check fewer equations. 

We are now at the precise point when algebra becomes abstract algebra. In contrast to 
the concrete group S n consisting of all the permutations of (1.2,...,;;}, we have passed 
to groups whose elements are unspecified. Moreover, products of elements are not ex¬ 
plicitly computable but are, instead, merely subject to certain rules. It will be seen that 
this approach is quite fruitful, for theorems now apply to many different groups, and it is 
more efficient to prove theorems once for all instead of proving them anew for each group 
encountered. In addition to this obvious economy, it is often simpler to work with the 
“abstract” viewpoint even when dealing with a particular concrete group. For example, 
we will see that certain properties of S„ are simpler to treat without recognizing that the 
elements in question are permutations (see Example 2.26). 

Definition. A group G is called abelian 3 if it satisfies the commutative law. 

x * y — y * x 


holds for every x, y E G. 

The groups S n , for n > 3, are not abelian because (12) and (1 3) are elements of S n 
that do not commute: (1 2)(1 3) = (1 3 2) and (1 3)(1 2) = (1 2 3). 

Lemma 2.16. Let G be a group. 

(i) The cancellation laws hold : If either x * a — x * b or a * x = b * x, then a — b. 

(ii) The element e is the unique element in G with e * x — x — x * e for all x E G. 

(iii) Each x E G has a unique inverse: There is only one element x' E G with x * x' — 
e = x' * x ( henceforth, this element will be denoted by x _1 ). 

(iv) (x -1 ) -1 = x for all x E G. 

Proof, (i) Choose x' with x' * x = e = x * x'\ then 

a = e * a = (x r * x) * a = x r * (x * a) 

= x' * (x * b) — (x r * x) * b = e * b — b. 

A similar proof works when x is on the right. 

(ii) Let eo E G satisfy eo * x = x = x * eo for all x E G. In particular, setting x = e in 
the second equation gives e = e * eo; on the other hand, the defining property of e gives 
e * eo — eo. so that e = eo- 

''the reason why commutative groups are called abelian can be found on page 236. 
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(iii) Assume that x" e G satisfies x * x" = e = x" * x. Multiply the equation e — x * x' 
on the left by x" to obtain 

x" = x" * e — x" * (x * x') — (. x" * x) * x' — e * x' = x'. 

(iv) By definition, (x -1 ) -1 * x _1 = e = x -1 * (x -1 ) -1 . But x * x -1 — e — x~ l * x, so 
that (x -1 ) -1 = x, by (iii). • 

From now on, we will usually denote the product x * y in a group by xy (we have 
already abbreviated a o fit to afl in symmetric groups), and we will denote the identity by 1 
instead of by e. When a group is abelian, however, we will often use the additive notation 
x + y; in this case, we will denote the identity by 0 , and we will denote the inverse of an 
element x by —x instead of by x _1 . 

Example 2.17. 

(i) The set Q x of all nonzero rationals is an abelian group, where * is ordinary multiplica¬ 
tion, the number 1 is the identity, and the inverse of r e Q x is 1 /r. Similarly, R x and C x 
are multiplicative abelian groups. 

Note that the set Z x of all nonzero integers is not a multiplicative group, for none of its 
elements (aside from ± 1 ) has a multiplicative inverse which is an integer. 

(ii) The set Z of all integers is an additive abelian group with a * b — a + b. with identity 
e = 0, and with the inverse of an integer n being — n. Similarly, we can see that Q, R, and 
C are additive abelian groups. 

(iii) The circle group. 

S l = {z e C : |z| = 1}, 

is the group whose operation is multiplication of complex numbers; this is an operation 
because the product of complex numbers of modulus 1 also has modulus 1, by Corol¬ 
lary 1.31. Complex multiplication is associative, the identity is 1 (which has modulus 1), 
and the inverse of any complex number of modulus 1 is its complex conjugate, which also 
has modulus 1. Therefore, .S ’ 1 is a group. 

(iv) For any positive integer n, let 

Pn — : 0 < k < n J 

be the set of all the nth roots of unity, where 

f = e 2lti l n = cos(^) + i sin(^). 

The reader may use De Moivre’s theorem to see that //„ is a group with operation multipli¬ 
cation of complex numbers; moreover, the inverse of any nth root of unity is its complex 
conjugate, which is also an nth root of unity. 
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(v) The plane f xlisa group with operation vector addition; that is, if a — (x, y) and 
a' = (x\ y'), then a + a' = (x + x', y + y'). The identity is the origin O = (0, 0), and 
the inverse of (x, y) is (—x, —y). ◄ 


Example 2.18. 

Let X be a set. If U and V are subsets of X, define 

U - V = {x e U : x i V}. 

The Boolean group B (X) [named after the logician G. Boole (1815-1864)] is the family 
of all the subsets of X equipped with addition given by symmetric difference A + B, where 

A + B = (A — B) U (B — A); 

symmetric difference is pictured in Figure 2.2. 




It is plain that A + B — B + A, so that symmetric difference is commutative. The 
identity is 0, the empty set, and the inverse of A is A itself, for A + A — 0. The reader 
may verify associativity by showing that both (A + B) + C and A + (B + C) are described 
by Figure 2.3. ◄ 


Example 2.19. 

An n x n matrix A with real entries is called nonsingular if it has an inverse; that is, there 
is a matrix B with AB — I — BA, where I — [< 5 i; ] (Sjj is the Kronecker delta) is the 
n x n identity matrix. Since ( AB)~ l — B~ l A~ l , the product of nonsingular matrices is 
itself nonsingular. The set GL(n, R) of all n x « nonsingular matrices having real entries, 
with binary operation matrix multiplication, is a (nonabelian) group, called the general 
linear group. [The proof of associativity is routine, though tedious; a “clean” proof of 
associativity can be given (Corollary 3.99) once the relation between matrices and linear 
transformations is known.] -4 
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A binary operation allows us to multiply two elements at a time; how do we multiply 
three elements? There is a choice. Given the expression 2x3x4, for example, we can 
first multiply 2x3 = 6 and then multiply 6x4 = 24; or, we can first multiply 3 x 4 = 12 
and then multiply 2x12 = 24; of course, the answers agree, for multiplication of numbers 
is associative. Thus, if an operation is associative, the expression abc is not ambiguous. 
Not all operations are associative, however. For example, subtraction is not associative: if 
c ^ 0, then 

a — (b — c) ^ (a — b) — c, 

and so the notation a — b — c is ambiguous. The cross product of two vectors in R 3 is 
another example of a nonassociative operation. 

Definition. If G is a group and if a e G, define the powers 4 a 11 , for n > 1, inductively: 

a 1 — a and a n+l — aa". 

Define a 0 = 1 and, if n is a positive integer, define 

a~ n = (a -1 )". 

The reader expects that (a -1 )" = (a") -1 ; this is a special case of the equation in Exer¬ 
cise 2.17 on page 61, but this is not so obvious to prove at this stage. For example, showing 
that a~ 2 a 2 — 1 amounts to doing the cancellation in the expression (a~ l a~ l )(aa)\ but as¬ 
sociativity is given to us only for products having three, not four, factors. 

Let us return to powers. The first and second powers are fine: a 1 = a and a 2 = aa. 
There are two possible cubes: We have defined a 2 — aa 2 — a(aa), but there is another 
reasonable contender: (aa)a — a 2 a. If we assume associativity, then these are equal: 

a 2 = aa 2 — a(aa) — (aa)a = a 2 a. 

There are several possible products of a with itself four times; assuming that the operation 
is associative, is it obvious that a 4 — a 2 a — a 2 a 2 l And what about higher powers? 

Define an expression a\U 2 - ■ ■ a n to be an /;-tuple in G x • • • x G (n factors). An 
expression yields many elements of G by the following procedure. Choose two adjacent 
a’s, multiply them, and obtain an expression with n — 1 factors: The new product just 
formed and n — 2 original factors. In this shorter new expression, choose two adjacent 
factors (either an original pair or an original one together with the new product from the first 
step) and multiply them. Repeat this procedure until there is an expression with only two 

4 The terminology x square and x cube for x 2 and x 3 is, of course, geometric in origin. Usage of the word 
power in this context arises from a mistranslation of the Greek dunamis (from which dynamo derives) used by 
Euclid. Power was the standard European rendition of dunamis; for example, the first English translation of 
Euclid, in 1570, by H. Billingsley, renders a sentence of Euclid as, “The power of a line is the square of the same 
line.” However, contemporaries of Euclid (e.g., Aristotle and Plato) often used dunamis to mean amplification, 
and this seems to be a more appropriate translation, for Euclid was probably thinking of a one-dimensional 
line sweeping out a two-dimensional square. (I thank Donna Shalev for informing me of the classical usage of 
dunamis .) 
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factors; multiply them and obtain an element of G; call this an ultimate product derived 
from the expression. For example, consider the expression abed. We may first multiply ab , 
obtaining ( ab)cd , an expression with three factors, namely, ab , c, d. We may now choose 
either the pair c, d or the pair ab , c; in either case, multiply these, obtaining expressions 
with two factors: ( ab)(cd ) having factors ab and cd or (( ab)c)d having factors ( ab)c 
and d. The two factors in either of these last expressions can now be multiplied to give 
an ultimate product from abed. Other ultimate products derived from the expression abed 
arise by multiplying be or cd as the first step. It is not obvious whether the ultimate 
products derived from a given expression are all equal. 

Definition. An expression a 102 ■ ■ ■ a n needs no parentheses if all the ultimate products 
it yields are equal; that is, no matter what choices are made of adjacent factors to multiply, 
all the resulting products in G are equal. 


Theorem 2.20 (Generalized Associativity). If G is a group and a 1,02, ■ ■a n e G, 

then the expression aia2 ■ ■ ■ a n needs no parentheses. 


Remark. This result holds in greater generality, for neither the identity element nor 
inverses will be used in the proof. ■* 

Proof. The proof is by (the second form of) induction. The base step n = 3 follows from 
associativity. For the inductive step, consider two ultimate products U and V obtained 
from an expression aia 2 • • • a n after two series of choices: 

(ai ■ ■ • a,)(a ( +i • • • a n ) and {a\ ■ ■ ■ aj)(aj+\-■ ■ a n )\ 

the parentheses indicate the last two factors which multiply to give U and V ; there are 
many parentheses inside each of these shorter expressions. We may assume that i < j. 
Since each of the four expressions in parentheses has fewer than n factors, the inductive 
hypothesis says that each needs no parentheses. It follows that U = V if i = j. If i < j, 
then the inductive hypothesis allows the first expression to be rewritten 

U = (ai ■ ■ ■ at) ([a i+ i ■ ■ ■ aj][a j+ i ■ ■ ■ a,,]) 

and the second to be rewritten 

V = ([ai • •■a i ][a i+ 1 • • -a,]) (aj + \ • • • a n ), 

where each of the expressions a\ ■ ■ ■ at, a ,+1 • • • aj, and a J+ \ ■ ■ ■ a„ needs no parentheses. 
Thus, these expressions yield unique elements A, B , and C of G, respectively. The first 
expression yields A(BC ), the second yields (AB)C, and these two expressions give the 
same element of G, by associativity. • 
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Corollary 2.21. If G is a group and a, b e G, then 

( ab)~ l — b~ l a~ l . 

Proof. By Lemma 2.16(iii), it suffices to prove that ( ab)(b~ l a ~ l ) = 1 = ( b~ l a~ l )(ab ). 
Using generalized associativity, 

( ab)(b~ l a ~ l ) = [a(bb~ l )]a~ l — ( a\)a~ l — aa~ l — 1. 

A similar argument proves the other equation. • 


Corollary 2.22. If G is a group, if a e G, and ifm, n > 1, then 

„m-\-n /„ni\n „mn 

a — ci ci avid (a ) — a 

Proof. In the first instance, both elements arise from the expression having m + n factors 
each equal to a; in the second instance, both elements arise from the expression having mn 
factors each equal to a. • 

It follows that any two powers of an element a in a group commute: 


Proposition 2.23 (Laws of Exponents). Let G be a group, let a, b e G, and let m and 
n be (not necessarily positive) integers. 

(i) If a and b commute, then ( ab )" = a"b n . 

(ii) ( a n ) m = a mn . 

(iii) a m a n = a m+n . 


Sketch of Proof. The proofs, while routine, are lengthy double inductions. • 

The notation a" is the natural way to denote a * a * ■ ■ ■ * a, where a appears n times. 
However, if the operation is +, then it is more natural to denote a + a + • • • + a by na. 
Let G be a group written additively; if a, b e G and m and n are (not necessarily positive) 
integers, then Proposition 2.23 is usually rewritten: 

(i) n(a + b) — na + nb 

(ii) m(na) = (mn)a 

(iii) ma + na — (m + n)a 
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Definition. Let G be a group and let a e G. If a k — 1 for some k > 1, then the smallest 
such exponent k > 1 is called the order of a\ if no such power exists, then one says that a 
has infinite order. 

The additive group of integers, Z, is a group, and 3 is an element in it having infinite 
order (because 3 + 3 + • • • + 3 is never 0). 

In any group G, the identity has order 1, and it is the only element of order 1; an element 
has order 2 if and only if it is equal to its own inverse. 

The definition of order says that if x has order n and x m — 1 for some positive integer 
m, then n < m. The next theorem says that n must be a divisor of m. 

Theorem 2.24. If a e G is an element of order n, then a m = 1 if and only if n \ m. 

Proof Assume that a m — 1. The division algorithm provides integers q and r with 
m = nq + r. where 0 < r < n. It follows that a' = a m ~ nq — a' n a~ nq — 1. If r > 0, then 
we contradict n being the smallest positive integer with a n = 1. Hence, r = 0 and n \ m. 
Conversely, if m = nk, then a m — a nk — (a n ) k — l k — 1. • 

What is the order of a permutation in S n ? 

Proposition 2.25. LetaeS,,- 

(i) If a is an r-cycle, then a has order r. 

(ii) If a — fi\ ■■■ ft is a product of disjoint r,-cycles ft,, then a has order lcm{ri, ..., r, }. 

(iii) If p is a prime, then a has order p if and only if it is a p-cycle or a product of disjoint 
p-cycles. 

Proof (i) This is Exercise 2.5 on page 50. 

(ii) Each has order r,-, by (i). Suppose that a M — (1). Since the fj commute, (1) = 
a M = (/Si • • • ft) M — /6f • • • /S f M . By Exercise 2.11 on page 50, disjointness of the /Ts 
implies that yS ( M = (1) for each i, so that Theorem 2.24 gives r, \ M for all i; that is, M 
is a common multiple of rj, ..., r,. On the other hand, if m — lcm{i'|, ..., r,\, then it is 
easy to see that a m = (1). Therefore, a has order m. 

(iii) Write a as a product of disjoint cycles and use (ii). • 

For example, a permutation in S n has order 2 if and only if it is a transposition or a 
product of disjoint transpositions. 

Example 2.26. 

Suppose a deck of cards is shuffled, so that the order of the cards has changed from 

1, 2, 3, 4.52 to 2, 1,4, 3.52, 51. If we shuffle again in the same way, then the 

cards return to their original order. But a similar thing happens for any permutation a of 
the 52 cards: If one repeats ot sufficiently often, the deck is eventually restored to its orig¬ 
inal order. One way to see this uses our knowledge of permutations. Write a as a product 
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of disjoint cycles, say, a = P 1 P 2 ■ ■ ■ ft. where fti is an r,-cycle. By Proposition 2.25, a has 
order k , where k is the least common multiple of the r,. Therefore, a k — (1). 

Here is a more general result with a simpler proof (abstract algebra can be easier than 
algebra): If G is a finite group and a e G, then a k — 1 for some k > 1. Consider the 
subset {1, a, a 2 , ..., a", ...}. Since G is finite, there must be a repetition occurring on this 
infinite list: There are integers m > n with a m — a n , and hence 1 = a m a~ n — a' n ~". We 
have shown that there is some positive power of a equal to 1. [Our original argument that 
a k — (1) for a permutation a of 52 cards is not worthless, because it gives an algorithm 
computing k.] ◄ 

Let us state what we have just proved in Example 2.26. 

Proposition 2.27. If G is a finite group, then every x e G has finite order. 

Table 2.3 augments the table in Example 2.5(ii). 


Cycle Structure 

Number 

Order 

Parity 

(1) 

1 

1 

Even 

(12) 

10 

2 

Odd 

(12 3) 

20 

3 

Even 

(1 2 3 4) 

30 

4 

Odd 

(1 2 3 45) 

24 

5 

Even 

(1 2)(3 4 5) 

20 

6 

Odd 

(1 2) (3 4) 

15 

120 

2 

Even 


Table 2.3. Permutations in S 5 

Here are some geometric examples of groups. 

Definition. A motion is a distance preserving bijection tp : R 2 -» R 2 [it can be shown 
that q> is a linear transformation if (pi 0) = 0]. If n is a polygon in the plane, then its 
symmetry group Yin) consists of all the motions (p for which (pin) — n. The elements of 
Yin) are called symmetries of n. 

Example 2.28. 

(i) Let 7 t 4 be a square having sides of length 1 and vertices [ui, V 2 , U 3 , V 4 }; draw 7 T 4 in 
the plane so that its center is at the origin O and its sides are parallel to the axes. It can 
be shown that every q> e Yinf) permutes the vertices; indeed, a symmetry tp of 774 is 
determined by \q>ivj) : 1 < i < 4}, and so there are at most 24 = 4! possible symmetries. 
Not every permutation in .S 4 arises from a symmetry of 774 , however. If v, and Vj are 
adjacent, then || Vj — vj || = 1, but ||ui — U 31 | = V2 = ||i >2 — 1 ) 41 |; it follows that <p must 
preserve adjacency (for motions preserve distance). The reader may now check that there 
are only eight symmetries of 774 . Aside from the identity and the three rotations about O 
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by 90 °, 180 °, and 270 °, there are four reflections, respectively, in the lines V2V4, the 
x-axis, and the 3;-axis (for a generalization to come, note that the v-axis is Om\, where m \ 
is the midpoint of v\V2 , and the x-axis is Omi, where m2 is the midpoint of U2V3). The 
group £(774) is called the dihedral group 5 with 8 elements, and it is denoted by Z)g. 



v 4 v 3 


Figure 2.4 


l 'i 



(ii) The symmetry group £(775) of a regular pentagon 715 with vertices v\, ..., 115 and 
center O has 10 elements: the rotations about the origin of ( 12 j)°, where 0 < j < 4 , as 
well as the reflections in the lines Ov^ for 1 < k < 5 . The symmetry group £(715) is 
called the dihedral group with 10 elements, and it is denoted by D\ o- < 


v i m 1 v 2 



5 F. Klein was investigating those finite groups occurring as subgroups of the group of motions of R.T Some of 
these occur as symmetry groups of regular polyhedra (from the Greek poly meaning “many” and hedron meaning 
“two-dimensional side”). He invented a degenerate polyhedron that he called a dihedron, from the Greek words di 
meaning “two” and hedron , which consists of two congruent regular polygons of zero thickness pasted together. 
The symmetry group of a dihedron is thus called a dihedral group. For our purposes, it is more natural to describe 
these groups as in the text. 
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Definition. If jr„ is a regular polygon with n vertices vi,v 2 ,, v„ and center O. then the 
symmetry group Y. (tt,,) is called the dihedral group with 2 n elements, and it is denoted 6 
by D 2n . 

The dihedral group D 2n contains the n rotations p J about the center by (360 j/n)°, 
where 0 < j < n - 1 . The description of the other n elements depends on the parity of n. 
If n is odd (as in the case of the pentagon; see Figure 2.5), then the other n symmetries are 
reflections in the distinct lines Ovj, for i = 1, 2,..., n. If n — 2 q is even (see the square in 
Figure 2.4 or the regular hexagon in Figure 2.6), then each line O v, coincides with the line 
Ovq+i, giving only q such reflections; the remaining q symmetries are reflections in the 
lines OiHj for i — 1,2 ,... ,q, where m; is the midpoint of the edge i>;U;+i. For example, 
the six lines of symmetry of ne are O v i, Ov 2 , and Ov 3 , and Om \, Omi, and Om 2 . 

Exercises 

2.17 If , a 2 , ■ ■ ., fl/-i, a t are elements in a group G, prove that 

(<7ia2 ''' — a 7^ a 7-\ ''' ■ 

2.18 Assume that G is a set with an associative binary operation. Prove that ( ab)(cd) = a[(bc)d] 
without using generalized associativity. 

2.19 (i) Compute the order, inverse, and parity of 

a = (1 2)(4 3)(1 3 5 4 2)(1 5)(1 3)(2 3). 

(ii) What are the respective orders of the permutations in Exercises 2.1 on page 49 and 2.8 
on page 50? 

2.20 (i) How many elements of order 2 are there in S 5 and in 5g? 

(ii) How many elements of order 2 are there in S n ? 

Hint. You may express your answer as a sum. 

2.21 If G is a group, prove that the only element g e G with g~ = g is 1. 

2.22 This exercise gives a shorter list of axioms defining a group. Let H be a set containing an 
element e. and assume that there is an associative binary operation * on H satisfying the 
following properties: 

1. e * x = x for all x e H : 

2. for every x e H, there is x' e H with x' * x = e. 

(i) Prove that if h e H satisfies h * h = /?, then h — e. 

Hint. If h' * h = e, evaluate h' * h * h in two ways. 

(ii) For all x e H, prove that x * x' = e. 

Hint. Consider (x * x’)~. 

(iii) For all x e H, prove that x * e — x. 

Hint. Evaluate x * x' * x in two ways. 

6 Some authors denote D 2n by D n . 
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(iv) Prove that if e' e H satisfies e' * x = x for all x e H, then e' = e. 

Hint. Show that ( e')~ = e'. 

(v) Let x e H. Prove that if x" e H satisfies x" * x = e, then x" — x'. 

Hint. Evaluate x' * x * x" in two ways. 

(vi) Prove that H is a group. 

2.23 Let y be a group element of order nr, if m = pt for some prime p, prove that y l has order p. 
Hint. Clearly, (y , ) p = 1. Use Theorem 2.24 to show that no smaller power of y l is equal 
to 1. 

2.24 Let G be a group and let a e G have order k. If p is a prime divisor of k, and if there is x e G 
with x p = a , prove that x has order pk. 

2.25 Let G = GL(2, Q), and let 


A = 


0 

1 


-1 

0 


and 



1 

1 


Show that A 4 = / = B 6 , but that (. AB) n ^ 1 for all n > 0, where 1 = [ J ^ ] is the 2 x 2 
identity matrix. Conclude that AB can have infinite order even though both factors A and B 
have finite order (this cannot happen in a finite group). 

2.26 If G is a group in which x~ = 1 for every x e G, prove that G must be abelian. [The Boolean 
groups B(X) of Example 2.18 are such groups.] 


2.27 If G is a group with an even number of elements, prove that the number of elements in G of 
order 2 is odd. In particular, G must contain an element of order 2. 

Hint. Pair each element with its inverse. 


2.28 What is the largest order of an element in S n , where n = 1,2,..., 10? (We remark that no 
general formula is known for arbitrary n, although, in 1903, E. Landau found the asymptotic 
behavior.) 


2.4 Lagrange’s Theorem 


A subgroup H of a group G is a group contained in G so that if h, h' e H, then the product 
hh' in H is the same as the product bit in G. The formal definition of subgroup, however, 
is more convenient to use. 

Definition. A subset H of a group G is a subgroup if 

(i) 1 e H- 

(ii) if x, y e H, then xy e H\ 

(iii) if x e //, thenx -1 e H. 

If H is a subgroup of G, we write H < G; if H is a proper subgroup of G, that is, H G, 
then we write H < G. 
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Observe that {1} and G are always subgroups of a group G, where {1} denotes the 
subset consisting of the single element 1. More interesting examples will be given soon. 
A subgroup // G is called a proper subgroup. 

Let us see that every subgroup H < G is itself a group. Property (ii) shows that H 
is closed ; that is, H has a binary operation. Associativity (xy)z — x(yz ) holds for all 
x, y, z e G, and so this equation holds, in particular, for all x,y,z e H. Finally, (i) gives 
the identity, and (iii) gives inverses. 

It is easier to check that a subset H of a group G is a subgroup (and hence that it is a 
group in its own right) than to verify the group axioms for H: Associativity is inherited 
from the operation on G and hence it need not be verified again. 

Example 2.29. 

(i) The four permutations 

V = {(1), (1 2)(3 4), (1 3)(2 4), (1 4)(2 3)} 

form a group, because V is a subgroup of 54 : (1) e V; a 2 — (1) for each a e V, 
and so a~ l — a e Y; the product of any two distinct permutations in V — {(1)} is the 
third one. The group V is called the four-group (V abbreviates the original German term 
Vierergruppe). 

Consider what verifying associativity a (fee) = (afe)c would involve: There are 4 choices 
for each of a , fe, and c, and so there are 4 3 = 64 equations to be checked. Plainly, the best 
way to prove that V is a group is to show that it is a subgroup of 54 . 

(ii) If R 2 is the plane considered as an (additive) abelian group, then any line L through 
the origin is a subgroup. The easiest way to see this is to choose a point (a, fe) ^ (0, 0) on 
L and then note that L consists of all the scalar multiples (ra, rb). The reader may now 
verify that the axioms in the definition of subgroup do hold for L. ■* 

We can shorten the list of items needed to verify that a subset is, in fact, a subgroup. 

Proposition 2.30. A subset H of a group G is a subgroup if and only if H is nonempty 
and, whenever x, y e H, then vy -1 e H. 

Proof. Necessity is clear. For sufficiency, take x e H (which exists because H f 0); 
by hypothesis, 1 = xx~ [ e H. If y e H , then y -1 = ly -1 e H and, if x, y e H , then 
xy — x(y -1 ) -1 e H. • 

Of course, the simplest way to check that a candidate H for a subgroup is nonempty is 
to check whether 1 e H. 

Note that if the operation in G is addition, then the condition in the proposition is that 
H is a nonempty subset such that x, y e H implies x — y e H. 

Proposition 2.31. A nonempty subset H of a finite group G is a subgroup if and only if 
H is closed; that is, if a, b e H, then ab e H. In particular, a nonempty subset of 5„ is a 
subgroup if and only if it is closed. 
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Sketch of Proof. Since G is finite. Proposition 2.27 says that each x e G has finite order. 
Hence, if x" = 1, then 1 e H and x _1 = x" _1 e H. • 

This last proposition can be false when G is an infinite group. For example, let G be 
the additive group Z; the subset H = N is closed, but it is not a subgroup of Z. 

For Galois, in 1830, a group was just a subset H of S„ that is closed under composition; 
that is, if a, f e H, then af e H. A. Cayley, in 1854, was the first to define an abstract 
group, mentioning associativity, inverses, and identity explicitly. He then proved (see Cay¬ 
ley’s theorem) that every abstract group with n elements is, essentially, a subgroup of S n 
(the notion of isomorphism, introduced in the next section, will enable us to state this more 
precisely). 

Example 2.32. 

The subset A„ of S„, consisting of all the even permutations, is a subgroup because it is 
closed under multiplication: even o even = even. This subgroup A„ < S n is called the 
alternating 7 group on n letters. < 


Definition. If G is a group and a e G. write 

(a) — {a n : n e Z} = {all powers of a}; 

(a) is called the cyclic subgroup of G generated by a. A group G is called cyclic if there 
exists a e G with G = (a), in which case a is called a generator of G. 

It is easy to see that (a) is, in fact, a subgroup: 1 = a 0 e (a); a"a m — a n+m e (a); 
a -1 e (a). Example 2.17(iv) shows, for every n > 1, that the multiplicative group of 
all nth roots of unity is a cyclic group with the primitive nth root of unity £ — e 27r '/" as a 
generator. 

No doubt, the reader has seen the example of the integers modulo m in an earlier course. 
We merely recall the definition. Given m > 0 and a e Z, the congruence class [a] of a 
mod m was defined on page 34: 

[a] = {b e Z : b = a mod m} 

= {a + km : k e Z} 

= {..., a — 2m, a — m, a, a + m, a + 2 m ,...}. 


7 The alternating group first arose in studying polynomials. If 

fix) = (x - U\){x - u 2 ) ■ ■ ■ (x - u„), 

then the number D = Y\i < ji u i ~ u j) can change sign when the roots are permuted: If a is a permutation of 
{«!, « 2 ,..., u n ], then it is easy to see that Yii<j [<* («f) — ot.(uj)\ = ±D. Thus, the sign of the product alternates 
as various permutations a are applied to its factors. The sign does not change for those a in the alternating group, 
and this last fact can be used to give another proof of Theorem 2.13(ii). 
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Definition. The integers mod m, denoted 8 by I m , is the family of all congruence classes 
mod m. 

Recall that [a] = [b] in I,„ if and only if a = b mod m. In particular, [a] = [0] in 
I m if and only if a = 0 mod m; that is, [a] = [0] in I m if and only if m is a divisor of a. 
The definition of congruence mod m makes sense for all m > 0, but the cases m — 0 and 
m = 1 are not very interesting: a = b mod 0 means 0 | (a — b), which says that a — b\ 
a = b mod 1 means 1 | (a — b), which says that a and b are always congruent; that is, 
there is only one congruence class mod 1. Recall Proposition 1.19, which we now rewrite 
in the bracket notation. 

Proposition 1.19. Let m >2 be a fixed integer. 

(i) If a & Z, then [a] = [r] for some r with 0 < r < m. 

(ii) If 0 < r' < r < m. then [/] [r], 

(iii) I,„ has exactly m elements, namely, [0], [1], ..., [m — 1], 

For every m > 2, I„, is an (additive) cyclic group, where 

[a] + [b] = [a + b]\ 

the identity is [0], the inverse of [a] is [—a], and a generator is [1], Part (iii) shows that I m 
has order m. 

A cyclic group can have several different generators. For example, (a) = [a~ l ). 

Theorem 2.33. 

(i) If G — (a) is a cyclic group of order n, then a k is a generator of G if and only if 
(, k, n) = 1. 

(ii) If G is a cyclic group of order n and gen(G) = {all generators of G }, then 

|gen(G)| = cp(n), 

where (p is the Euler (p-function. 

Proof, (i) If a k generates G, then a e (a k ), so that a — a kt for some t e Z. Hence, 
a k '~ l — 1; by Theorem 2.24, n \ (kt — 1), so there is v e Z with nv — kt — 1. Therefore, 
1 is a linear combination of k and m, and so (k, n) — 1. 

Conversely, if ( k, n) — 1, then nt + ku — 1 for t, u e Z; hence 

a = a n,+ku = a n, a ku = a ku e (, a k ). 

Therefore, every power of a also lies in (a k ) and G = {a k ). 

(ii) Proposition 1.38 says that (pin) = \{k < n : (k,n) = 1}|. The next proposition shows 
that G = {1, a ,..., a n ~ 1 }, and so this result follows from part (i). • 


8 We introduce this new notation because there is no commonly agreed one; the most popular contenders are 
Z m and Z/mZ. We have chosen I m because I is the initial letter of integers. The usual notation Z for the integers 
(it is the initial letter of the German Zahleri) is almost universally accepted, and so a change from Z to I would be 
consistent but too distracting. 
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Proposition 2.34. Let G be a finite group and let a € G. Then the order of a is (a) |, the 
number of elements in (a). 

Proof Since G is finite, there is an integer k > 1 with 1, a. a 2 ,..., a k ~ 1 consisting of k 
distinct elements, while 1, a, a 2 , ..., a k has a repetition; hence a k e {l, a. a 2 , a k ~ l }; 
that is, a k = a' for some i with 0 < i < k. If i > 1, then a k ~' = 1, contradicting the 
original list having no repetitions. Therefore, a k = a° — 1, and k is the order of a (being 
the smallest positive such k). 

If H — {1, a, a 2 , ..., a* -1 }, then \H\ — k\ it suffices to show that H = (a). Clearly, 
H c (a). For the reverse inclusion, take a 1 e (a). By the division algorithm, i — qk + r. 
where 0 < r < k. Hence a' — a qk+r = a qk a r = ( a k ) q a r — a r e H\ this gives (a) C H, 
and so (a) — H. • 

Definition. If G is a finite group, then the number of elements in G. denoted by |G|, is 
called the order of G. 

The word order is used in two senses: the order of an element a e G and the order |G| 
of a group G. Proposition 2.34 shows that the order of a group element a is equal to | (a) \. 

Proposition 2.35. The intersection P|;g/ Hi of any family of subgroups of a group G is 
again a subgroup of G. In particular, if H and K are subgroups of G, then II C\ K is a 
subgroup of G. 

Sketch of Proof This follows easily from the definitions. • 

Corollary 2.36. If X is a subset of a group G, then there is a subgroup {X) of G con¬ 
taining X that is smallest in the sense that (X) < H for every subgroup H of G that 
contains X. 

Proof There exist subgroups of G that contain X: for example, G itself contains X. 
Define (X) = (~] X ch H, the intersection of all the subgroups H of G that contain X. By 
Proposition 2.35, (X) is a subgroup of G; of course, (X) contains X because every H 
contains X. Finally, if H is any subgroup containing X. then H is one of the subgroups 
whose intersection is (X)\ that is, (X) < H. • 

Note that there is no restriction on the subset X in the last corollary; in particular, X — 0 
is allowed. Since the empty set is a subset of every set, we have 0 C H for every subgroup 
H of G. Thus, (0) is the intersection of all the subgroups of G; in particular, (0) < {1}, 
and so (0) = {1}. 

Definition. If X is a subset of a group G, then (X) is called the subgroup generated 
byX. 

If X is a nonempty subset of a group G, define a word 9 on X to be an element g e G 
of the form g — x^ 1 ■ ■ ■ x e f. where x; e X and <?, = ±1 for all i. 

®This term will be modified a bit when we discuss free groups. 
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Proposition 2.37. If X is a nonempty subset of a group G, then (X) is the set of all the 
words on X. 

Proof. We claim that W (X), the set of all the words on X, is a subgroup. If x e X, then 
1 = xx _1 e W(X)-, the product of two words on X is also a word on X: the inverse of a 
word on X is a word on X. It now follows that (X) < W(X ), for W(X) obviously contains 
X (and (X) is the intersection of all the subgroups of G containing X). On the other hand, 
any subgroup of G containing X must also contain W{X), and so (X) = W(X). • 


Example 2.38. 

(i) If G = (a) is a cyclic group with generator a, then G is generated by the subset X — {a}. 

(ii) The dihedral group TG,,, the symmetry group of a regular n-gon, is generated by p, er, 

where p is a rotation by (360/«)° and o is a reflection. Note that these generators satisfy 
the equations p n = 1, o 2 — 1, and opo — p . ◄ 

Perhaps the most fundamental fact about subgroups H of a finite group G is that their 
orders are constrained. Certainly, we have |//| < |G|, but it turns out that |//| must be a 
divisor of | G \. To prove this, we introduce the notion of coset. 

Definition. If H is a subgroup of a group G and a e G, then the coset aH is the subset 
aH of G, where 

aH — {ah : h e H}. 

The cosets defined are often called left cosets\ there are also right cosets of H, namely, 
subsets of the form Ha = {ha : h e H}. In general, left cosets and right cosets may be 
different, as we shall soon see. 

If we use the * notation for the operation in a group G, then we denote the coset aH by 
a * H, where 

a * H = {a * h : h G H]. 

In particular, if the operation is addition, then the coset is denoted by 

a T H = {a -{- h ; h € H }. 

Of course, a = al e aH. Cosets are usually not subgroups. For example, if a f //, 
then 1 aH (otherwise 1 = ah for some h e H, and this gives the contradiction a — 
h~ l e H). 

Example 2.39. 

(i) Consider the plane R 2 as an (additive) abelian group and let L be a line through the 
origin O (see Figure 2.7 on page 68); as in Example 2.29(ii), the line L is a subgroup of 
M 2 . If ft e R 2 , then the coset f + L is the line L' containing f that is parallel to L, for if 
ra e L, then the parallelogram law gives fi + ra <= L'. 
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(ii) Let A be an m x n matrix with real entries, and let Ax = b be a consistent linear system 
of equations; that is, there is a column vector s e R" with As = b. The solution space 
S — {x e M" : Ax = 0} of the homogeneous system Ax = 0 is an additive subgroup of 
R", and the solution set )x e 1" : Ax = b} of the original inhomogeneous system is the 
coset s + S. 

(iii) If G — S 3 and H — ((1 2)}, there are exactly three left cosets of H, namely 

H = {(1), (1 2)} = (1 2 )H, 

(1 3 )H = {(1 3), (1 2 3)} = (12 3 )H, 

(2 3 )H = {(2 3), (1 3 2)} = (13 2 )H, 

each of which has size 2. Note that these cosets are also “parallel;” that is, distinct cosets 
are disjoint. 

Consider the right cosets of H = ((1 2)} in S 3 : 

H = {(1), (1 2)} = H( 1 2), 

Hll 3) = {(1 3), (1 3 2)} = Ha 3 2), 

H(2 3) = {(2 3), (1 2 3)} = H (1 2 3). 

Again, we see that there are exactly 3 (right) cosets, each of which has size 2. Note that 
these cosets are “parallel”; that is, distinct (right) cosets are disjoint. ◄ 


Lemma 2.40. Let H be a subgroup of a group G, and let a, b e G. 

(i) aH — bH if and only ifb~ l a € H. In particular, aH = H if and only if a € H. 

(ii) If aH (T bH 0, then aH — bH. 

(iii) \aH\ = \H\ for all a e G. 
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Remark. In Exercise 2.29 on page 72, it is shown that Ha = Hb if and only if ab 1 e H , 
and hence Ha — H if and only if a e II. a 

Sketch of Proof. The first two statements follow from observing that the relation on G, 
defined by a = b if b~ l a e H , is an equivalence relation whose equivalence classes are 
the cosets; it follows from Proposition 1.54 that the cosets of H partition G. The third 
statement is true because h ah is a bijection H -> aH. • 

The next theorem is named after J. L. Lagrange, who saw, in 1770, that the order of 
certain subgroups of S n are divisors of «!. The notion of group was invented by Galois 60 
years afterward, and it was probably Galois who first proved the theorem in full. 

Theorem 2.41 (Lagrange’s Theorem). If H is a subgroup of a finite group G, then \H\ 
is a divisor of \ G\. 

Proof. Let {a\ II. agH, ..., a t H] be the family of all the distinct cosets of H in G. Then 

G = a\H U a 2 H U • • • U atH, 

because each g e G lies in the coset gH , and gH — ajH for some i. Moreover, 
Lemma 2.40(h) shows that the cosets partition G into pairwise disjoint subsets. It follows 
that 

\G\ = \a l H\ + \a 2 H\ + --- + \a t H\. 

But \ajH\ — \H\ for all i, by Lemma 2.40(iii), so that |G| = t\H\, as desired. • 


Definition. The index of a subgroup H in G , denoted by [G : H], is the number of left 10 
cosets of H in G. 

The index [G : H] is the number t in the formula |G| = t\H \ in the proof of Lagrange’s 
theorem, so that 

IG| = [G : H]\H\; 

this formula shows that the index [G : H] is also a divisor of | G |; moreover. 


[G:H] = \G\/\H\. 


Example 2.42. 

Recall that the dihedral group ZG/i = £ (tt,,). the group of symmetries of the regular n-gon 
7r„, has order 2 n and it contains a cyclic subgroup of order n generated by a rotation p. The 
subgroup ( p) has index [ZG>, : (p)] = 2. Thus, there are two cosets: (p) and o (p), where 
o is any reflection outside of (p). It follows that every element a e ZG,, has a factorization 
a — o' p J , where i = 0, 1 and 0 < j < n. ◄ 


l(J Hxercise 2.37 on page 72 shows that the number of left cosets of a subgroup is equal to the number of its 
right cosets. 
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Corollary 2.43. If G is a finite group and a e G, then the order of a is a divisor of |G|. 
Proof This follows at once from Proposition 2.34, for the order of a is | {a) |. • 

Corollary 2.44. IfG is a finite group, then a 0 — l for all a e G. 

Proof If a has order cl, then |G| = dm for some integer m, by the previous corollary, 
and so — a d ' n = ( a d ) m = 1. • 

Corollary 2.45. If p is a prime, then every group G of order p is cyclic. 

Proof If a e G and a f I, then a has order d > 1, and d is a divisor of p. Since p is 
prime, d — p, and so G — {a). • 

We have seen that I m , under addition, is a cyclic group of order m. Now multiplication 
p : I,„ x l m -> I m , given by 

[a][fo] = [ab], 

is also a binary operation on I,,, (which is well-defined, by Proposition 1.20); it is associa¬ 
tive, commutative, and [1] is an identity element. However, I,„ is not a group under this 
operation because inverses may not exist; for example, [0] has no multiplicative inverse. 

Proposition 2.46. The set U(I m ), defined by 

U (I m ) = {[r] e l m : (r, m) = 1 }, 

is a multiplicative group of order 0 im), where 0 is the Euler fi-function. In particular, if p 
is a prime, then U (Ip) = I*, the nonzero elements of Ip, is a multiplicative group of order 
P ~ 1. 

Proof By Exercise 1.14 on page 12, (r, m) = 1 = (r', m) implies (rr', in) — 1; hence 
U (I m ) is closed under multiplication. We have already mentioned that multiplication is 
associative and that [1] is the identity. If ( a , m ) = 1, then [a][x] = [1] can be solved for 
[x] in I,„. Now (x, m ) = 1, for rx + sm — 1 for some integer .y, and so Proposition 1.13 
on page 5 gives (x, m) = 1; therefore, [x] e U{ I m ), and so each [r] e U(I m ) has an 
inverse. Therefore, U(I m ) is a group; the definition of the Euler 0-function shows that 

|£/CU)|= *<-»). 

The last statement follows from <p(p) = p — 1 when p is a prime. • 

In Chapter 3, we will prove, for every prime p, that I* is a cyclic group. 

Here is a group-theoretic proof of Theorem 1.24, Fermat’s theorem. Our earlier proof 
used binomial coefficients and the fact that p \ ( p r ) for 0 < r < p. 

Corollary 2.47 [Fermat). If p is a prime and a e Z, then 


a p = a mod p. 
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Proof. It suffices to show that [ a p ] = [a] in l p . If [a] = [0], then [ a p ] = [ a\ p = [O]^ = 
[0] = [a]. If [a] ^ [0], then [a] e I*, the multiplicative group of nonzero elements in I p . 
By Corollary 2.44 to Lagrange’s theorem, [a] p ~ l — [1],because |I*| = p — 1. Multiplying 
by [a] gives the desired result [a p ~\ = [a) p — [a]. Therefore, a p = a mod p. • 

We now give a generalization of Fermat’s theorem due to Euler. 

Theorem 2.48 {Euler). If(r.m) — 1, then 

_ j moc j m 

Proof. Since |f/(I m )| = <p(m), Corollary 2.44 (essentially Lagrange’s theorem) gives 
= [1] for all [r] e U (I,„). In congruence notation, this says that if (r, m) = 1, then 

r 4>(m) — ] m() d m , 


Example 2.49. 

It is easy to see that 

C/(I 8 ) = {[1], [3], [5], [7]} 

is a group (resembling the four-group V) in which the square of each element is [1], while 

U (Ito) = {[1]> [3], [7], [9]} 

is a cyclic group of order 4 [after we introduce isomorphisms in the next section, we will 
say that U (Is) is isomorphic to Y and U (Iio) is isomorphic to I 4 ]. ◄ 


Theorem 2.50 (Wilson’s Theorem). An integer p is a prime if and only if 


{p — 1 )! = — 1 mod p. 


Proof. Assume that p is a prime. If a\, a 2 ,. ■ ■, a n is a list of all the elements of a finite 
abelian group G, then the product a\U 2 ■ ■ ■ a n is the same as the product of all elements 
a with a 2 — 1, for any other element cancels against its inverse. Since p is prime. Ex¬ 
ercise 1.37 on page 14 implies that I* has only one element of order 2, namely, [—1]. It 
follows that the product of all the elements in I*, namely, [(p — 1)!], is equal to [—1]; 
therefore, {p — 1)! = — 1 mod p. 

Conversely, assume that m is composite: there are integers a and b with m = ab and 

1 < a < b < m. If a < b. then m — ab is a divisor of (m—1)!, and so (m — 1)! = 0 mod m. 
If a = b, then m = a 2 . If a — 2, then (a 2 — 1)! = 3! = 6 = 2 mod 4 and, of course, 

2 # -1 mod 4. If 2 < a. then 2a < a 2 , and so a and 2a are factors of (a 2 — 1)!; therefore, 
{a 2 — 1)! = 0 mod a 2 . Thus, (a 2 — 1)! ^ — 1 mod a 2 , and the proof is complete. • 
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Remark. We can generalize Wilson’s theorem in the same way that Euler’s theorem 
generalizes Fermat’s theorem: Replace U(I p ) by U(I m ). For example, for all m > 3, it 
can be proved that U (ly ) has exactly 3 elements of order 2, namely, [—1], [1+ 2" 1-1 ], and 
[—(1+2'” -1 )]. It now follows that the product of all the odd numbers r, where 1 < r < 2 m 
is congruent to 1 mod 2"', because 

(— 1)(1 + 2 "' ;i )(-l -2"' l ) = (1 + 2 "' ') 2 

= 1 + 2 m + 2 lm ~ 2 = 1 mod 2 m . + 


Exercises 


2.29 Let H be a subgroup of a group G. 

(i) Prove that right cosets Ha and Hb are equal if and only if ab~ l e H. 

(ii) Prove that the relation a = b if ab~ l e H is an equivalence relation on G whose 
equivalence classes are the right cosets of H. 

2.30 (i) Define the special linear group by 

SL(2, R) = [A € GL(2, R) : det(A) = 1). 

Prove that SL(2, R) is a subgroup of GL(2, R). 

(ii) Prove that GL(2, Q) is a subgroup of GL(2, R). 

2.31 (i) Give an example of two subgroups H and AT of a group G whose union H U K is not a 

subgroup of G. 

Hint. Let G be the four-group V. 

(ii) Prove that the union H U K of two subgroups is itself a subgroup if and only if either H 
is a subset of K or K is a subset of H. 

2.32 Let G be a finite group with subgroups H and K. If H < K, prove that 

[G : H] = [G : K][K : H], 


2.33 If H and K are subgroups of a group G and if \H\ and | A" | are relatively prime, prove that 
h n K = {l}. 

Hint. If* e H n K, then*l ff l = 1 = *1*1. 

2.34 Prove that every subgroup 5 of a cyclic group G — (a) is itself cyclic. 

Hint. If 5 7 ^ 1, choose k to be the smallest positive integer with a k e S. 

2.35 Prove that a cyclic group G of order n has a subgroup of order d for every d dividing n. 

Hint. If G = {a) and n = dk, consider [ci'). 

2.36 Let G be a group of order 4. Prove that either G is cyclic or x~ — 1 for every x e G. 
Conclude, using Exercise 2.26 on page 62, that G must be abelian. 

2.37 If H is a subgroup of a group G, prove that the number of left cosets of H in G is equal to the 
number of right cosets of H in G. 

Hint. The function (p\ aH m- Ha~ 1 is a bijection from the family of all left cosets of H to 
the family of all right cosets of H. 
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2.38 Let p be an odd prime, and let aj, , a p _ j be a permutation of (1,2. p — 1}. Prove that 

there exist i ^ j with ia, = jaj mod p. 

Hint. Use Wilson’s theorem. 


2.5 Homomorphisms 


An important problem is determining whether two given groups G and H are somehow 
the same. For example, we have investigated S 3 , the group of all permutations of X — 
{1,2, 3}. The group Sy of all the permutations of Y — {a, b. c} is a group different from 
S3 because permutations of {1,2, 3} are different than permutations of {a, b, c}. But even 
though S3 and Sy are different, they surely bear a strong resemblance to each other (see 
Example 2.51). A more interesting example is the strong resemblance between S3 and Z> 6 , 
the symmetries of an equilateral triangle. The notions of homomorphism and isomorphism 
allow us to compare different groups, as we shall see. 

Definition. If (G, *) and (H, o) are groups (we have displayed the operation in each), 
then a function f: G —»■ H is a homomorphism 1 1 if 

fix * y) = fix) o fiy) 

for all x, y e G. If / is also a bijection, then / is called an isomorphism. Two groups G 
and H are called isomorphic , denoted by G A //, if there exists an isomorphism / : G —> 
H between them. 

A multiplication table of a group G displays every product ah for a. b e G. 


G 

a\ 

ci2 

d j 

d n 

a\ 

a\a\ 

a\a,2 

d\dj 

d\dn 

a,2 

Cl2Cl\ 

d2Cl2 

d2dj 

d2dn 

at 

aia\ 

diCl2 

did; 

did n 

a n 

a n a\ 

ClnCl 2 ' ' 

• dyid j * 1 

d n d n 


Definition. Let a \, 02 , ■ ■ ■, a n be a list with no repetitions of all the elements of a group 
G. A multiplication table for G is an n x n array whose ij entry is UjU r 

11 The word homomorphism comes from the Greek homo meaning "same" and morph meaning “shape” or 
“form.” Thus, a homomorphism carries a group to another group (its image) of similar form. The word isomor¬ 
phism involves the Greek iso meaning “equal,” and isomorphic groups have identical form. 
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A multiplication table of a group G of order n depends on how we list the elements of 

G, and so G has n \ different multiplication tables. (Thus, the task of determining whether 
a multiplication table of a group G is the same as some multiplication table of another 
group H is a daunting one: It involves about n ! comparisons, each of which involves com¬ 
paring n 2 entries.) If a\, ai,..., a n is a list of all the elements of G with no repetitions, 
and if /: G -> H is a bijection, then f(a\), fiai), . ■., f(a„) is a list of all the ele¬ 
ments of H with no repetitions, and this latter list determines a multiplication table for 

H. That / is an isomorphism says that if we superimpose the given multiplication table 
for G (determined by a\, ai, ■ ■ ■, a„) upon the multiplication table for H [determined by 
f (a \), fiai),. ■ ■, /(««)], then the tables match: If a,a; is the ij entry in the given multi¬ 
plication table of G, then f (ai) f (aj) — fiataj) is the ij entry of the multiplication table 
of H. In this sense, isomorphic groups have the same multiplication table. Thus, isomor¬ 
phic groups are essentially the same, differing only in the notation for the elements and the 
operations. 


Example 2.51. 

Let us show that G — S 3 , the symmetric group permuting {1, 2, 3), and H = Sy, the 
symmetric group of all the permutations of Y = {a, b, c}, are isomorphic. First, enumerate 


G: 


(1), (1 2), (1 3), (2 3), (1 2 3), (1 3 2). 


We define the obvious function cp: S 3 —> Sy that replaces numbers by letters: 


(1), (ab), (a c), (be), (a be), (acb ). 


Compare the multiplication table for S 3 arising from this list of its elements with the mul¬ 
tiplication table for Sy arising from the corresponding list of its elements. The reader 
should write out the complete tables of each and superimpose one on the other to see that 
they do match. We will check only one entry. The 4,5 position in the table for S 3 is the 
product (2 3)(1 2 3) = (1 3), while the 4,5 position in the table for Sy is the product 
(.b c)(a b c) = (a c). 

This result is generalized in Exercise 2.39 on page 80. ◄ 


Lemma 2.52. Let f : G —> H be a homomorphism of groups. 

(i) /CD = 1 

(ii) /(*-*) = /(*)-' 

(iii) f(x n ) — f(x) n for all n e Z 

Sketch of Proof, (i) 1-1=1 implies /(1)/(1) = /(1). 

(ii) 1 = xx~ l implies 1 = /(1) = f(x)f(x~ l ). 

(iii) Use induction to show that f(x n ) = f(x) n for all n > 0. Then observe that x~ n = 

(x -1 )", and use part (ii). • 
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Example 2.53. 

If G and H are cyclic groups of the same order m, then G and H are isomorphic. (It follows 
from Corollary 2.45 that any two groups of prime order p are isomorphic.) Although 
this is not difficult, it requires some care. We have G — {1, a, a 2 ,..., and H = 

{1, b, b 2 , ..., and the obvious choice for an isomorphism is the bijection /: G —> 

H given by f(a') = b‘. To check that / is an isomorphism, that is, f(a'a J ) — b l+J , 
involves two cases: i + j < m — 1; i + j > m — 1. We give a less computational proof in 
Example 2.71. ◄ 

A property of a group G that is shared by any other group isomorphic to it is called an 
invariant of G. For example, the order |G| is an invariant of G, for isomorphic groups 
have the same orders. Being abelian is an invariant [if / is an isomorphism and a and b 
commute, then ab — ba and 

f(a)f(b) = f (ab) = f(ba) = f(b)f(a); 

hence, f (a) and f(b) commute]. Thus, ]% and S 3 are not isomorphic, for Ig is abelian and 
S 3 is not. In general, it is a challenge to decide whether two given groups are isomorphic. 
See Exercise 2.42 on page 80 for more examples of invariants. 

Example 2.54. 

We present two nonisomorphic abelian groups of the same order. 

As in Example 2.29(i), let V be the four-group consisting of the following four permu¬ 
tations: 

V={(1), (1 2)(3 4), (1 3)(2 4), (1 4)(2 3)}, 

and let 1x4 = (i) — { 1 , i. — 1 , —i } be the multiplicative cyclic group of fourth roots of 
unity, where i 2 = —1. If there were an isomorphism / : V —» /r 4 , then surjectivity 
of / would provide some x e Y with i — f (x). But x 2 = (1) for all x e Y, so that 
i 2 = f(x) 2 = f (x 2 ) — /((1)) = 1, contradicting i 2 = — 1. Therefore, Y and /x 4 are not 
isomorphic. 

There are other ways to prove this result. For example, /r 4 is cyclic and V is not; /x 4 
has an element of order 4 and V does not; fx 4 has a unique element of order 2, but V 
has 3 elements of order 2. At this stage, you should really believe that /x 4 and V are not 
isomorphic! ◄ 

Definition. If /: G —> H is a homomorphism, define 

kernel 12 f — {x e G \ f(x) — 1} 


and 


image f = {h e H : h — fix ) for some x e G}. 


We usually abbreviate kernel / to ker / and image / to im /. 


12 Kernel comes from the German word meaning “grain” or “seed” (com comes from the same word). Its usage 
here indicates an important ingredient of a homomorphism. 
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Example 2.55. 

(i) If /Z 2 is the multiplicative group p .2 — {±1}, then sgn: S„ —> p 2 is a homomorphism, 
by Theorem 2.12. The kernel of sgn is the alternating group A n , the set of all even permu¬ 
tations. 

(ii) Determinant is a surjective homomorphism det: GL(n, R) —>• R x , the multiplicative 

group of nonzero real numbers, whose kernel is the special linear group SL(n, R) of all 
n x n matrices of determinant 1 . ◄ 


Proposition 2.56. Let f: G —> H be a homomorphism. 

(i) ker f is a subgroup of G and im / is a subgroup of H. 

(ii) If x e ker f and if a e G, then axa~ l € ker/. 

(iii) / is an injection if and only if ker / = {1}. 

Sketch of Proof (i) Routine. 

(ii) f(axa~ l ) = f(a)lf(a)~ l = 1. 

(iii) f(a) — f(b ) if and only if f(b~ l a ) =1. • 


Definition. A subgroup K of a group G is called a normal subgroup if A' e K and g e G 
imply gkg~ l e K. If K is a normal subgroup of G, we write K <] G. 

The proposition thus says that the kernel of a homomorphism is always a normal sub¬ 
group. If G is an abelian group, then every subgroup K is normal, for if A e K and g e G, 
then gkg~ l = kgg~ l — k e K. The converse of this last statement is false: In Exam¬ 
ple 2.63, we shall show that there is a nonabelian group (the quaternions ), each of whose 
subgroups is normal. 

The cyclic subgroup H = ((1 2)) of S 3 , consisting of the two elements (1) and (1 2), is 
not a normal subgroup of S3: If a — (1 2 3), then a -1 = (32 1), and 

»(1 2)a~ l =(1 2 3)(1 2)(3 2 1) = (2 3) $ H 

[by Theorem 2.9, «(1 2)a~ l — (a 1 «2) = (2 3)]. On the other hand, the cyclic subgroup 
K — ((1 2 3)) of S 3 is a normal subgroup, as the reader should verify. 

It follows from Examples 2.55(i) and 2.55(ii) that A„ is a normal subgroup of S„ and 
SL(n, R) is a normal subgroup of GL(», R) (however, it is also easy to prove these facts 
directly). 


Definition. If G is a group and a e G, then a conjugate of a is any element in G of the 
form 

gag~', 

where g e G. 

It is clear that a subgroup K < G is a normal subgroup if and only if K contains all the 
conjugates of its elements: If A e K , then gkg~ l e K for all g e G. 
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Example 2.57. 

(i) Theorem 2.9 states that two permutations in S„ are conjugate if and only if they have 
the same cycle structure. 

(ii) In linear algebra, two matrices A, B e GL(n, R) are called similar if they are conju¬ 
gate; that is, if there is a nonsingular matrix P with B = P AP~ 1 . ■* 


Definition. If G is a group and g e G. define conjugation y g : G —*■ G by 

Yg(a) = gag~ l 


for all a e G. 


Proposition 2.58. 

(i) If G is a group and g € G, then conjugation y g : G -> G is an isomorphism. 

(ii) Conjugate elements have the same order. 

Proof, (i) If g, h <= G, then 

(Yg ° Yh)(a) = Yg(hah ~ l ) = g(hah~ l )g~ l = ( gh)a(gh)~ l = y g h(a)\ 

that is, 

Yg ° Yh = Ygh • 

It follows that each y g is a bijection, for y g o y - 1 = y i = 1 = y g - 1 o y g . We now show 
that y g is an isomorphism: if a, b e G, 

Ygiab ) = g(ab)g~ l = ga(g~ l g)bg~ l = y g (a)y g (b). 


(ii) To say that a and b are conjugate is to say that there is g e G with b — gag~ [ \ that is, 
b — y g (a). But y g is an isomorphism, and so Exercise 2.42 on page 80 shows that a and 
b — y g (a) have the same order. • 

Example 2.59. 

Define the center of a group G, denoted by Z(G), to be 

Z(G) — {z e G : zg = gz for all g e G}; 

that is, Z(G) consists of all elements commuting with everything in G. 

It is easy to see that Z(G) is a subgroup of G; it is a normal subgroup because if 
z e Z(G) and g e G, then 

gzg~ l = zgg~ l = z e Z(G). 

A group G is abelian if and only if Z(G) = G. At the other extreme are centerless groups 
G for which Z(G) = {1}; for example, Z(.S 3 ) = {1}; indeed, all large symmetric groups 
are centerless, for Exercise 2.15 on page 51 shows that Z(S„) — {1} for all n > 3. ◄ 
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Example 2.60. 

If G is a group, then an automorphism of G is an isomorphism /: G -* G. For example, 
every conjugation y g is an automorphism of G (it is called an inner automorphism ), for 
its inverse is conjugation by g _1 . The set Aut(G) of all the automorphisms of G is itself a 
group, under composition, and the set of all conjugations, 

Inn(G) = {y g :ge G], 

is a subgroup of Aut(G). Exercise 2.64 on page 82 says that the function T: G -> Aut(G), 
given by g y g , is a homomorphism with im T = Inn(G) and ker T = Z(G); moreover, 
Inn(G) < Aut(G). ◄ 

Example 2.61. 

The four-group V is a normal subgroup of 54 . Recall that the elements of V are 

V= {(1), (1 2)(3 4), (1 3)(2 4), (1 4)(2 3)}. 

By Theorem 2.9, every conjugate of a product of two transpositions is another such. But 
we saw, in Example 2.5(i), that only 3 permutations in S 4 have this cycle structure, and so 
V is a normal subgroup of 54 . ◄ 


Proposition 2.62. 

(i) If H is a subgroup of index 2 in a group G, then g 2 e H for every g e G. 

(ii) If H is a subgroup of index 2 in a group G, then H is a normal subgroup of G. 

Proof, (i) Since H has index 2, there are exactly two cosets, namely, H and aH, where 
a f H. Thus, G is the disjoint union G — H U aH. Take g e G with g f H , so that 
g — ah for some h e H. If g 2 f H , then g 2 = ah', where h' e H. Hence, 

g — g~ l g 2 = h~ l a~ l alf — h~ [ h' e H , 


and this is a contradiction. 

(ii) 13 It suffices to prove that if h e //, then the conjugate ghg~ l e H for every g e G. 
Since H has index 2, there are exactly two cosets, namely, H and aH , where a f. H. Now, 
either g e H or g e aH. If g e H , then ghg~ l e H , because H is a subgroup. In 
the second case, write g — ax, where x e H. Then ghg~ l = a{xhx~ x )a~ 1 = ah'a~ l , 
where h’ — xhx~ l e H (for It is a product of three elements in H). If ghg~ 1 H, then 
ghg~ l — ah'a~ l e aH ; that is, ah'a~ l — ay for some v e H. Canceling a, we have 
h'a~ l — y, which gives the contradiction a — y~ l h' e H. Therefore, if h e H, every 
conjugate of h also lies in //; that is, H is a normal subgroup of G. • 


13 Another proof of this is given in Exercise 2.50 on page 81. 
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Definition. The group of quaternions 14 is the group Q of order 8 consisting of the fol¬ 
lowing matrices in GL(2, C): 

Q = { I, A, A 2 , A 3 , B. BA. BA 2 , BA 3 }, 

where I is the identity matrix. 



The element A e Q has order 4, so that (A) is a subgroup of order 4 and hence of 
index 2; the other coset is B (A) — {B, BA. BA 2 , BA 3 }. Thus, every element in Q has an 
expression of the form B' A 1 , where ; = 0, 1 and j = 0, 1, 2, 3. 

Example 2.63. 

In Exercise 2.59 on page 81, the reader will check that Q is a nonabelian group of order 8 
having exactly one element of order 2 , and hence only one subgroup of order 2 , namely, 
(— I). We claim that every subgroup of Q is normal. Lagrange’s theorem says that every 
subgroup of Q has order a divisor of 8 , and so the only possible orders of subgroups are 
1, 2, 4, or 8 . Clearly, the subgroup {/} and the subgroup of order 8 (namely, Q itself) are 
normal subgroups. By Proposition 2.62(ii), any subgroup of order 4 must be normal, for it 
has index 2. Finally, the subgroup (— I) is normal, for it is the center, Z(Q). < 

Example 2.63 shows that Q is a nonabelian group that is like abelian groups in that 
every subgroup is normal. This is essentially the only such example. A nonabelian finite 
group is called Hamiltonian if every subgroup is normal; every hamiltonian group has the 
form Q x A, where A is an abelian group with no elements of order 4 ( direct products 
will be introduced in the next section). A proof of this result can be found in Robinson, A 
Course in the Theory of Groups, page 139. 

Lagrange’s theorem states that the order of a subgroup of a finite group G must be a 
divisor of |G|. This suggests the question, given a divisor d of |G|, whether G must contain 
a subgroup of order d. The next result shows that there need not be such a subgroup. 

Proposition 2.64. The alternating group A4 is a group of order 12 having no subgroup 
of order 6. 

Proof. First, | A 4 1 = 12, by Exercise 2.12 on page 50. If A 4 contains a subgroup H of 
order 6 , then H has index 2, and so a 2 e H for every a e A4, by Corollary 2.62(i). If a is 
a 3-cycle, however, then a has order 3, so that a = a 4 = (a 2 ) 2 . Thus, H contains every 
3-cycle. This is a contradiction, for there are 8 3-cycles in A4. • 

1 1 \V. R. Hamilton invented a system having two operations, addition and multiplication, that he called quater¬ 
nions, for it was four-dimensional. The group of quaternions consists of 8 special elements in that system; see 
Exercise 2.60 on page 82. 
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Exercises 

2.39 Show that if there is a bijection f: X -*■ Y (that is, if X and Y have the same number of 
elements), then there is an isomorphism <p: Sx —>■ Sy- 

Hint. If a e Sx, define i p(a) = /oao/' 1 . In particular, show that if \X | =3, then (p takes 
a cycle involving symbols 1, 2, 3 into a cycle involving a, b, c, as in Example 2.51. 

2.40 (i) Show that the composite of homomorphisms is itself a homomorphism. 

(ii) Show that the inverse of an isomorphism is an isomorphism. 

(iii) Show that two groups that are isomorphic to a third group are isomorphic to each other. 

(iv) Prove that isomorphism is an equivalence relation on any set of groups. 

2.41 Prove that a group G is abelian if and only if the function /: G -*■ G, given by f(a) = a~ , 
is a homomorphism. 

2.42 This exercise gives some invariants of a group G. Let /: G —»■ H be an isomorphism. 

(i) Prove that if a e G has infinite order, then so does f(a), and if a has finite order n, then 
so does f(a). Conclude that if G has an element of some order n and H does not, then 
G^H. 

(ii) Prove that if G = H, then, for every divisor d of |G|, both G and H have the same 
number of elements of order d. 

2.43 Prove that A 4 and D \ 2 are nonisomorphic groups of order 12. 

2.44 (i) Find a subgroup H of 54 with H 7 # V and H = V. 

(ii) Prove that the subgroup H in part (i) is not a normal subgroup. 

2.45 Show that every group G with |G| < 6 is abelian. 

2.46 Let G = (/: R —>• R : f(x) = ax + b, where a 0). Prove that G is a group under 
composition that is isomorphic to the subgroup of GL(2, R) consisting of all matrices of the 

. Ta b 
form |^o , - 

2.47 (i) If /: G —> H is a homomorphism and x e G has order k . prove that f(x) e H has 

order m, where m \ k. 

(ii) If/: G —> H is a homomorphism and if (| G |, | H\) = 1, prove that/(.r) = 1 for all 
x e G. 

2.48 (i) Prove that 

-ik r 

cos 9 — sin 9 _ cos k9 — sin k9 

sind cos# sin k9 cos k9 

Hint. Use induction on k > 1. 

(ii) Prove that the special orthogonal group SO( 2, R), consisting of all 2 x 2 orthogonal 
matrices of determinant 1, is isomorphic to the circle group S' 1 . 

I’coso' -sina 

Hint. Consider <p : (cost*, sinct). 

sin o' cos o' 

2.49 Let G be the additive group of all polynomials in x with coefficients in Z, and let H be the 
multiplicative group of all positive rationals. Prove that G = H. 

Hint. List the prime numbers pg = 2, p\ = 3, P 2 = 5 .and define 

<p(eo + e\x + e 2 x 2 H- 1 - e n x n ) = p e ° ■ ■ ■ p e ". 
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2.50 

2.51 

2.52 

2.53 

2.54 

2.55 

2.56 

2.57 


2.58 

2.59 


(i) Show that if H is a subgroup with bH — Hb = {hb : h e H} for every b € G, then H 
must be a normal subgroup. 

(ii) Use part (i) to give a second proof of Proposition 2.62(h): If H < G has index 2, then 
H <\G. 

Hint. If a £ H, then aH = H r = Ha, where H' is the complement of H. 

(i) Prove that if a e S n , then a and a ~ 1 are conjugate. 

(ii) Give an example of a group G containing an element x for which x and x~‘ are not 
conjugate. 

Prove that the intersection of any family of normal subgroups of a group G is itself a normal 
subgroup of G. 

Define W = ((1 2)(3 4)), the cyclic subgroup of 54 generated by (1 2)(3 4). Show that W is 
a normal subgroup of V, but that W is not a normal subgroup of S 4 . Conclude that normality 
is not transitive: W <\ V and V <1 G do not imply W < G. 

Let G be a finite abelian group written multiplicatively. Prove that if |G| is odd, then every 
x e G has a unique square root; that is, there exists exactly one g e G with g 2 = x. 

Hint. Show that squaring is an injective function G —+ G, and use Exercise 1.58 on page 36. 

Give an example of a group G. a subgroup H < G. and an element g e G with [G : H\ = 3 
and g 3 ^ H. 

Hint. Take G = S 3 , H = ((1 2)), and g = (2 3). 

Show that the center of GL(2, R) is the set of all scalar matrices a I with a 5 ^ 0. 

Hint. Show that if A is a matrix that is not a scalar matrix, then there is some nonsingular 
matrix that does not commute with A. (The generalization of this to n x n matrices is true.) 

Let f = e- 71 '!" be a primitive nth root of unity, and define 


A = 


K 

0 


0 

r 1 


and B = 


0 

1 


1 

0 


(i) Prove that A has order n and that B has order 2. 

(ii) Prove that BAB — A -1 . 

(iii) Prove that the matrices of the form A' and BA ', for 0 < i < n, form a multiplicative 
subgroup G < GL(2, C). 

Hint. Consider cases A' A-' , A' BA ', BA' A' , and (BA' )(BA'). 

(iv) Prove that each matrix in G has a unique expression of the form B' A J , where ; = 0, 1 
and 0 < j < n. Conclude that |G| = 2 n. 

(v) Prove that G = Di n ■ 

Hint. Define a function G —»■ £> 2 ,, using the unique expression of elements in G in the 
form B' AJ. 

(i) Prove that every subgroup of Q x I 2 is normal. 

(ii) Prove that there exists a nonnormal subgroup of Q x I 4 . 

Recall that the group of quaternions Q consists of the 8 matrices in GL(2, C) 

Q = { /, A, A 2 , A 3 , B, BA, BA 2 . BA 3 }, 
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where 

. TO ll [0 il 

A= [-l 0 J and B = [i O' 

(i) Prove that —/is the only element in Q of order 2, and that all other elements M ^ 1 
satisfy M 2 = — I. 

(ii) Prove that Q is a nonabelian group with operation matrix multiplication. 

Hint. Note that A 2 = —1 = B 2 . 

(iii) Prove that Q has a unique subgroup of order 2, and it is the center of Q. 

2.60 Assume that there is a group G of order 8 whose elements 

±l,±i, ±j, ±k 

satisfy 

# 9 # 9 9 ## t t # 

i =j =k = —1, ij = k, jk = i, ki=j, 

ij = -ji, ik = -ki, jk = -kj. 

Prove that G = Q and, conversely, that Q is such a group. 

2.61 Prove that the quaternions Q and the dihedral group Dg are nonisomorphic groups of order 8. 
Hint. Use Exercise 2.42 on page 80. 

2.62 Prove that Aq is the only subgroup of Sq of order 12. 

Hint. Use Proposition 2.62(ii). 

2.63 Prove that the symmetry group E(jr„), where n n is a regular polygon with n vertices, is 
isomorphic to a subgroup of S n . 

Hint. The vertices X — {iq,..., v„] of n n are permuted by every motion a e 

2.64 (i) For every group G, show that the function T : G —> Aut(G), given by g y g (where 

Yx is conjugation by g), is a homomorphism. 

(ii) Prove that kerT = Z(G) and imP = Inn(G); conclude that Inn(G) is a subgroup of 
Aut(G). 

(iii) Prove that Inn(G) <1 Aut(G). 


2.6 Quotient Groups 

The construction of the additive group of integers modulo m is the prototype of a more 
general way of building new groups from given groups, called quotient groups. The homo¬ 
morphism tv : 2. — > I m , defined by tz : a t-> [a], is surjective, so that I„, is equal to im n. 
Thus, every element of I m has the form n (a ) for some a e Z, and it (a ) + 7r (b) — Tt(a + b). 
This description of the additive group I,„ in terms of the additive group Z can be general¬ 
ized to arbitrary, not necessarily abelian, groups. Suppose that f: G —> H is a surjective 
homomorphism between groups G and H. Since / is surjective, each element of H has the 
form f(a ) for some a e G, and the operation in H is given by f{a)f{b) — f(ab), where 



Sec. 2.6 Quotient Groups 


83 


a. b e G. Now K = ker / is a normal subgroup of G, and we are going to reconstruct 
H = im / (as well as a suijective homomorphism n : G -> II ) from G and K alone. 

We begin by introducing an operation on the set 

S(G) 

of all nonempty subsets of a group G. If X, Y e S(G). define 

XY = {xy : x e X and y e Y}. 

This multiplication is associative: X (YZ) is the set of all x(yz), where x e X, y e Y , and 
zeZ, (. XY)Z is the set of all such ( xy)z, and these are the same because of associativity 
in G. 

An instance of this multiplication is the product of a one-point subset {a} and a subgroup 
K < G, which is the coset aK. 

As a second example, we show that if H is any subgroup of G, then 

HH = H. 

If h, h' e H , then hit e H , because subgroups are closed under multiplication, and so 
HH C H. For the reverse inclusion, if h e H , then h = h 1 e HH (because 1 e H), and 
so H C HH. 

It is possible for two subsets X and Y in S(G) to commute even though their constituent 
elements do not commute. For example, let G = S 3 and K — ((1 2 3)}. Now (1 2) does 
not commute with (1 2 3) e K, but we claim that (1 2 )K — K (1 2). In fact, here is the 
converse of Exercise 2.50 on page 81. 

Lemma 2.65. A subgroup K of a group G is a normal subgroup if and only if 

gK = Kg 

for every g e G. Thus, every right coset of a normal subgroup is also a left coset. 

Proof. Let gk e gK. Since K is normal, gkg~ l e K, say gkg~ [ = k' e K, so that 
gk — (gkg~ l )g — k'g e Kg, and so gK C Kg. For the reverse inclusion, let kg e Kg. 
Since K is normal, (g _1 )A:(g _1 ) _1 = g~ l kg e K, say g~ l kg — k" e K. Hence, 
kg = g(g~ l kg ) = gk" e gK and Kg C gK. Therefore, gK — Kg when K <\G. 

Conversely, if gK = Kg for every g e G, then for each k e K, there is k' e K with 
gk — k'g ; that is, gkg~ l e K for all g e G, and so K <d G. • 

A natural question is whether HK is a subgroup when both H and K are subgroups. In 
general, HK need not be a subgroup. For example, let G — S 3 , let H = ((1 2)>, and let 
K = ((1 3)>. Then 

HK= {(1), (1 2), (1 3), (13 2)} 

is not a subgroup lest we contradict Lagrange’s theorem, for 4 \ 6. 
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Proposition 2.66. 

(i) If H and K are subgroups of a group G, and if one of them is a normal subgroup, 
then HK is a subgroup ofG: moreover, HK = KH in this case. 

(ii) If both H and K are normal subgroups, then HK is a normal subgroup. 


Remark. Exercise 2.72 on page 95 shows that if H and K are subgroups of a group G, 
then HK is a subgroup if and only if HK — KH. < 

Proof, (i) Assume first that K <1 G. We claim that HK = KH. If hk e HK, then 
k' = hkh~ x e K, because K < G, and 

hk = hkh~ 1 h = k'h e KH. 

Hence, HK c KH. For the reverse inclusion, write kh — hh~ l kh — hk" e HK. (Note that 
the same argument shows that HK — KH if // ■<] G.) 

We now show that HK is a subgroup. Since 1 e H and 1 e K , we have 1 = 1 • 1 e HK: 
if hk e HK, then ( hk)~ l — k~ l h~ l e KH = HK: if hk,h\k\ e HK, then hkhiki e 
HKHK = HHKK = HK. 

(ii) If g e G, then Lemma 2.65 gives gHK = HgK = HKg, and the same lemma now 
gives HK < G. • 

Here is a fundamental construction of a new group from a given group. 

Theorem 2.67. Let G/K denote the family of all the left cosets of a subgroup K of G. If 
K is a normal subgroup, then 

aKbK = abK 

for all a, b € G, and G/K is a group under this operation. 

Remark. The group G/K is called the quotient group G mod K: when G is finite, its 
order \G/K\ is the index [G : /if] = \G\/\K \ (presumably, this is the reason why quotient 
groups are so called). < 

Proof. The product of two cosets ( aK){bK ) can also be viewed as the product of 4 ele¬ 
ments in S(G). Hence, associativity in S(G ) gives 

(, aK)(bK ) = a(Kb)K = a(bK)K = abKK = abK, 

for normality of K gives Kb — bK for all b e K, by Lemma 2.65, while KK — K 
because K is a subgroup. Thus, the product of two cosets of K is again a coset of K, and 
so an operation on G/K has been defined. Because multiplication in S(G ) is associative, 
equality X(YZ) — (XY)Z holds, in particular, when X, Y, and Z are cosets of K, so that 
the operation on G/K is associative. The identity is the coset K = IK, for (1 K)(bK) = 
1 bK = bK — b\K — ( bK)(lK ), and the inverse of aK is a~ l K, for (a~ l K)(aK) = 
a~ l aK — K = aa~ [ K — (aK)(a~ l K). Therefore, G/K is a group. • 
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It is important to remember what we have just proved: The product aKbK — abK 
in G/K does not depend on the particular representatives of the cosets, and the law of 
substitution holds: If a K — a'K and bK — b' K, then 

aKbK — abK — a'b'K = a Kb'K. 


Example 2.68. 

We show that the quotient group G/K is precisely I„, when G is the additive group Z and 
K — (m), the (cyclic) subgroup of all the multiples of a positive integer m. Since Z is 
abelian, (m) is necessarily a normal subgroup. The sets Z/ (m) and I m coincide because 
they are comprised of the same elements: The coset a + (m) is the congruence class [a]: 

a + (m) — (a + km : k e Z} = [a]. 

The operations also coincide: Addition in Z/ (m) is given by 

(a + ( m )) + (b + (m)) = (a + b ) + (m ); 

since a + (m) = [a], this last equation is just [a] + [/?] = [a + b], which is the sum in I m . 
Therefore, I m is equal to the quotient group Z/ ( m ). ◄ 

There is another way to regard quotient groups. After all, we saw, in the proof of 
Lemma 2.40, that the relation = on G, defined by a = b if b~ [ a e K. is an equivalence 
relation whose equivalence classes are the cosets of K. Thus, we can view the elements of 
G/K as equivalence classes, with the multiplication aKbK — abK being independent of 
the choice of representative. 

We remind the reader of Lemma 2.40(i): If K is a subgroup of G, then two cosets a K 
and bK are equal if and only if b~ l a e K. In particular, if b — 1, then aK — K if and 
only if a e K. 

We can now prove the converse of Proposition 2.56(h). 

Corollary 2.69. Every normal subgroup K <i G is the kernel of some homomorphism. 

Proof. Define the natural map jt : G —> G/K by n(a) = aK. With this notation, the 
formula aKbK = abK can be rewritten as Tt(a)n(b) = Tt(ab)\ thus, Tt is a (surjective) 
homomorphism. Since K is the identity element in G/K, 

kerjr — {a e G : n(a) — K] = {a e G : aK — K} = K , 

by Lemma 2.40(i). • 

The next theorem shows that every homomorphism gives rise to an isomorphism and 
that quotient groups are merely constructions of homomorphic images. E. Noether (1882- 
1935) emphasized the fundamental importance of this fact. 

Theorem 2.70 (First Isomorphism Theorem). If f: G II is a homomorphism, then 
ker / < G and G / ker / = im /. 

In more detail, if ker / = K and cp: G/K -> im / < H is given by ip\ aK i —> f(a), then 
ip is an isomorphism. 
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Remark. The following diagram describes the proof of the first isomorphism theorem, 
where n : G -* G/K is the natural map tc : a i-> a K. 



G/K 


Proof. We have already seen, in Proposition 2.56(ii), that K — ker / is a normal sub¬ 
group of G. Now <p is well-defined: If a K — bK , then a = bk for some k e K, and so 
f(a) = f(bk) = f(b)f(k ) = f (b). because f(k) = 1. 

Let us now see that is a homomorphism. Since / is a homomorphism and cp(aK ) = 

f(a), 

< p(aKbK ) = (p(abK ) = f(ab) — f(a)f(b) = (p(aK)(p{bK). 

It is clear that mup < im /. For the reverse inclusion, note that if y e im /, then 
y = f{a) for some a e G, and so y — f(a ) = ip(aK). Thus, q> is surjective. 

Finally, we show that (p is injective. If <p(aK) = (p{bK ), then f(a) — fib). Hence, 
1 = / (b)~ l f (a) — f(b~ l a), so that b~ [ a e ker/ = K. Therefore, aK — bK, 
by Lemma 2.40(i), and so <p is injective. We have proved that (p : G/K im / is an 
isomorphism. • 

Given any homomorphism f: G H, we should immediately ask for its kernel and 
image; the first isomorphism theorem will then provide an isomorphism G/ker/ = im/. 
Since there is no significant difference between isomorphic groups, the first isomorphism 
theorem also says that there is no significant difference between quotient groups and ho¬ 
momorphic images. 

Example 2.71. 

Let us revisit Example 2.53, which showed that any two cyclic groups of order m are 
isomorphic. Let G = (a) be a cyclic group of order m. Define a function /: Z -* G by 
f(n) — a' 1 for all n e Z. Now / is easily seen to be a homomorphism; it is surjective 
(because a is a generator of G), while ker / = {neZ:a" = l} = (m>, by Theorem 2.24. 
The first isomorphism theorem gives an isomorphism Z/ (m) = G. We have shown that 
every cyclic group of order m is isomorphic to Z/ (m), and hence that any two cyclic groups 
of order m are isomorphic to each other. Of course. Example 2.68 shows that Z/ ( m) = I,„, 
so that every cyclic group of order m is isomorphic to I m . 

We point out that any two infinite cyclic groups are isomorphic to Z; the reader should 
have no difficulty proving this. -4 

Example 2.72. 

What is the quotient group R/Z? Define /: R —> S l . where S l is the circle group, by 

/: x e 2nix . 
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Now / is a homomorphism; that is, /(x + y) = f(x)f(y), by the addition formulas 
for sine and cosine. The map / is surjective, and ker / consists of all x e R for which 
e 2jTlx = cos 27 xx + i sin27rx = 1; that is, cos 2tcx — 1 and sin27rx = 0. But cos 2nx — 1 
forces x to be an integer; since 1 e ker /, we have ker / = Z. The first isomorphism 
theorem now gives 

R/Z = S 1 . 

This is the group-theoretic version of Example 1.55(i). ◄ 

Here is a useful counting result. 

Proposition 2.73 (Product Formula). If H and K are subgroups of a finite group G, 
then 

\HK\\HGK\ = \H\\K\, 
where HK — {hk : h e H and k e K}. 

Remark. The subset HK need not be a subgroup of G; however. Proposition 2.66 shows 
that if either H <] G or K <3 G, then HK is a subgroup (see also Exercise 2.72 on page 95). 


Proof. Define a function /: HxK -> HK by / : (h, k) hk. Clearly, / is a surjection. 
It suffices to show, for every x e HK. that |/ -1 (x)| = | H IT K |, where / -1 (x) = {(h.k) e 
H x K : hk = x}, [because HxK is the disjoint union Uas/za: r\x)i 
We claim that if x = hk , then 

/ -1 (x) = {(hd , d~ l k) : d e H D K}. 

Each ( lid,d~ l k ) e / _1 (x), for f(hd,d~ l k ) = hdd~ l k = hk — x. For the reverse 
inclusion, let (/;', k') e / -1 (x), so that h!k! — hk. Then h~ l h’ — kk'~ l e H (IK ; call this 
element d. Then /;' = hd and k' = d~ l k, and so (If. k f ) lies in the right side. Therefore, 

|/-‘(x)| = \{(hd, d~ l k) : d € H n K}\ = \H n K\, 

because d i-> (hd, d~ l k ) is abijection. • 

The next two results are consequences of the first isomorphism theorem. 

Theorem 2.74 (Second Isomorphism Theorem). If H and K are subgroups of a group 
G with II -<] G, then HK is a subgroup, II C\ K <] K, and 

K/(H HK) = HK/H. 

Proof. Since H <d G, Proposition 2.66 shows that HK is a subgroup. Normality of H in 
HK follows from a more general fact: If H < S < G and if H is normal in G, then H 
is normal in S (if ghg~ l e H for every g e G, then, in particular, ghg~ l e H for every 

8 e S). 
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We now show that every coset xH e HK/H has the form kll for some k e K. Of 
course, xH — hkH, where h e H and k e K. But hk — kk~ l hk — kh' for some 
h' e H, so that hkH — kh'H — kH. It follows that the function / : AT —> HK/H, given 
by f: k kll, is surjective. Moreover, / is a homomorphism, for it is the restriction 
of the natural map 7T: G -> G/H. Since ker7r = H, it follows that ker/ = H fl K, 
and so H fl K is a normal subgroup of K. The first isomorphism theorem now gives 
K/(H n K) = HK/H. • 

The second isomorphism theorem gives the product formula in the special case when 
one of the subgroups is normal: If K/(H(1K) = HK/H, then \K/{H fl ^T)| = \HK/H\, 
and so \HK\\H (T K\ = \H\\K\. 

Theorem 2.75 (Third Isomorphism Theorem). If H and K are normal subgroups of 
a group G with K < H, then H/K < G/K and 

(G/K)/(H/K) = G/H. 

Proof. Define /: G/K -> G/H by /: aK i-»- aH. Note that / is a (well-defined) 
function, for if a' e G and a'K — aK, then a~ l a r e K < H, and so aH = a'H. It is 
easy to see that / is a surjective homomorphism. 

Now ker / = H/K, for aH — H if and only if a € H, and so H/K is a normal 
subgroup of G/K. Since / is surjective, the first isomorphism theorem gives 

(G/K)/(H/K) = G/H. . 

The third isomorphism theorem is easy to remember: In the fraction ( G/K)/(H/K ), 
the K's can be canceled. We can better appreciate the first isomorphism theorem after 
having proved the third one. The quotient group ( G/K)/{H/K ) consists of cosets (of 
H/K ) whose representatives are themselves cosets (of G/K). A direct proof of the third 
isomorphism theorem could be nasty. 

The next result, which can be regarded as a fourth isomorphism theorem, describes the 
subgroups of a quotient group G/K. 

Proposition 2.76 (Correspondence Theorem). Let G be a group, let K <\ G, and let 

tc : G —> G/K be the natural map. Then 

S jt(S) = S/K 

is a bijection between Sub(G; K), the family of all those subgroups S of G that contain K, 
and Sub(G/W), the family of all the subgroups ofG/K. If we denote S/K by S*, then 

T < S < G if and only if T* < S*, in which case [S : T] = [5* : T*], 

and 

T <1 S if and only if T* <1 S*, in which case S/T = S*/T*. 
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Remark. The following diagram is a way to remember this theorem. 




Proof. Define <J>: Sub(G; K) -> Sub(G//f)by <t>: 5" i—>• S/K (it is routine to check that 
if S is subgroup of G containing K , then S/K is a subgroup of G/K). 

To see that O is injective, we begin by showing that if K < S < G. then tt “ 1 ;r ( S ) = S. 
As always, S C by Proposition 1.50(iv). For the reverse inclusion, let a e 

7T~ l 7t(S ), so that 7 r(a) — n(s) for some s e S. It follows that as~ [ e ker7r = K , so that 
a = sk for some k e K. But K < 5, and so a = sk e S. 

Assume now that n(S) — tt(S'), where S and 5' are subgroups of G containing K. 
Then tc~ 1 jt(S) = jt~ l Tt(S'), and so S — S' as we have just proved in the preceding 
paragraph; hence, <I> is injective. 

To see that <J> is surjective, let U be a subgroup of G/K. Now jr~ l (U) is a subgroup of 
G containing K — 7r —1 ({1}), and = U. by Proposition 1.50(ii). 

Proposition 1.50(i) shows that T < S < G implies T/K = ic(T) < tc(S) — S/K. 
Conversely, assume that T/K < S/K. If t e T, then tK e T/K < S/K and so tK — sK 
for some s e S. Hence, t — sk for some k e K < S. and so t e S. 

To prove that [5 ; T] — [5* : T*], it suffices to show that there is a bijection from the 
family of all cosets of the form sT, where ,v e S, and the family of all cosets of the form 
s*T *, where s* e S*, and the reader may check that sT ir(s)T* is such a bijection. 
When G is finite, we may prove [5:2"] = [5* : 7"*] as follows: 

[S* :T*] = \S*\/\T*\ 

= \S/K\/\T/K\ 

= (\S\/\K\) / (\T\/\K\) 

= |S|/|r| 

= IS--T]. 
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If T <1 S, then T/ K <1 S/K and ( S/K)/(T/ K) = S/T, by the third isomorphism theorem; 
that is, S*/T* = S/T. It remains to show that if T* <3 S*. then T < S', that is, if t e T and 
s e S, then sfs -1 e T. Now 

7 r(sfs -1 ) = 7r(i)7r(f)7r(s) _1 e tt(s)T*jz(s )~ 1 = T*, 
sothatsfs- 1 e = T. • 

When dealing with quotient groups, we usually say, without mentioning the correspon¬ 
dence theorem explicitly, that every subgroup of G/K has the form S/K for a unique 
subgroup S < G containing K. 


Example 2.77. 

Let G — (a) he a cyclic group of order 30. If ?r: 1 -» G is defined by tt(h) — a n , then 
ker7r = (30). The subgroups (30) < (15} < (5} < Z correspond to the subgroups 

{1} = (« 30 > < (a 15 ) < (a 5 ) < (a). 


Moreover, the quotient groups are 

(o^>^(15>^ i {a*)_ 

(a 30 } (30} 2 ’ (a 15 } 



and 


(a) _ Z _ 

-= - = -II5. ◄ 

(a 5 ) (5) 


Proposition 2.78. If G is a finite abelian group and d is a divisor of \G\, then G contains 
a subgroup of order d. 

Proof. We prove the result by induction on n = |G| for a prime divisor p of |G|. The 
base step n = 1 is true, for there are no prime divisors of 1. For the inductive step, choose 
a e G of order k > 1. If p \ k, say k — pi, then Exercise 2.23 on page 62 says that a 1 
has order p. If p \ k, consider the cyclic subgroup H = (a). Now H <\ G, because G is 
abelian, and so the quotient group G/H exists. Note that \G/H\ — n/k is divisible by p, 
and so the inductive hypothesis gives an element bH e G/H of order p. If b has order m, 
then Exercise 2.47(i) on page 80 gives p \ m. We have returned to the first case. 

Let d be any divisor of |G|, and let p be a prime divisor of d. We have just seen that 
there is a subgroup S < G of order p. Now S < G, because G is abelian, and G/S is 
a group of order n/p. By induction on |G|, G/S has a subgroup H* of order d/p. The 
correspondence theorem gives H* = H/S for some subgroup H of G containing S, and 
\H\ — |H*||5| = d. 

Here is a construction of a new group from two given groups. 

Definition. If H and K are groups, then their direct product, denoted by fix K, is the 
set of all ordered pairs (h, k) with h e H and k e K equipped with the operation 

(h,k)(h',k r ) = (hhfkk'). 

It is easy to check that the direct product H x K is a group [the identity is (1, 1) and 

(h,k)~ l = 

We now apply the first isomorphism theorem to direct products. 
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Proposition 2.79. Let G and G' be groups, and let K <i G and K' < G' be normal 
subgroups. Then K x K' <J G x G', and there is an isomorphism 

(G x G')/(K x K') = ( G/K) x ( G'/K '). 

Proof. Let tt: G —> G/S' and tt' : G' —> G'/K' be the natural maps. It is routine to 
check that /: GxG'-^ (G/K) x (G'/K'), given by 

/: (g, g') h* (jt(g), jt'(g')) = (gK, g'K') 

is a surjective homomorphism with ker f — K x K'. The first isomorphism theorem now 
gives the desired isomorphism. • 

Proposition 2.80. If G is a group containing normal subgroups H and K with H IT K = 
{1} and HK — G, then G = H x K. 

Proof. We show first that if g e G, then the factorization g = hk, where h e H and 
k e K, is unique. If hk = h'k', then h'~ X h — k'k~ l e H fl K — {1}. Therefore, h' = h 
and k! = k. We may now define a function <p: G -> H x K by cp(g) — (h, k), where 
g = hk, h e H, and k e K. To see whether <p is a homomorphism, let g' = h'k', so that 
gg' — hkh'k!. Hence, cp(gg') = cp(hkh'k'), which is not in the proper form for evaluation. 
If we knew that if h e H and k e K, then hk — kh, then we could continue: 

<p(hkh'k') — <p(hh'kk') 

= (hh’, kk’) 

= (h,k)(h',k') 

= V(g)<P(g')- 

Let h e H and k e K. Since K is a normal subgroup, (hkh~ l )k~ l e K ; since H is a 
normal subgroup, h(kh~ { k~ l ) e H. But H IT K = {1}, so that hkh~ l k~ l = 1 and hk = 
kh. Finally, we show that the homomorphism q> is an isomorphism. If (h, k) e H x K, 
then the element g e G defined by g — hk satisfies (p(g) — (h, k ); hence tp is surjective. 
If (p(g) — (1, 1), then g — 1, so that ker q> — 1 and tp is injective. Therefore, tp is an 
isomorphism. • 

Remark. We must assume that both subgroups H and K are normal. For example, .S 3 has 
subgroups H = ((1 2 3)> and K = ((1 2)). Now H < S 3 , H Fl K = {1}, and HK = .S 3 , 
but S 3 H x K (because the direct product is abelian). Of course, K is not a normal 
subgroup of S 3 . ◄ 

Theorem 2.81. If m and n are relatively prime, then 

Imo — IfM ^ 
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Proof. If a e Z, denote its congruence class in I,„ by [a] m . The reader can show that the 
function /: Z -> I m x I„, given by a i-> ([d\ m , [a]„), is a homomorphism. We claim 
that ker / = (mn). Clearly, (mn) < ker/. For the reverse inclusion, if a e ker /, then 
[a\ m = [ 0 ],„ and [a]„ = [ 0 ]„; that is, a = 0 mod m and a = 0 mod n\ that is, m \ a and 
n \ a. Since m and n are relatively prime, mn \ a, and so a e {mn), that is, ker / < {mn) 
and ker / = {mn). The first isomorphism theorem now gives Z/ {mn) = im / < I m x I„. 
But Z/ {mn) = I m „ has mn elements, as does I m x I„. We conclude that / is surjective. • 

For example, it follows that Ig = I 2 x I 3 . Note that there is no isomorphism if m and n 
are not relatively prime. For example, I 4 I 2 x I 2 , for I 4 has an element of order 4 and 
the direct product (which is isomorphic to the four-group V) has no such element. 

In light of Proposition 2.34, we may say that an element a e G has order n if (a) = I„. 
Theorem 2.81 can now be interpreted as saying that if a and b are commuting elements 
having relatively prime orders m and n, then ab has order mn. Let us give a direct proof of 
this result. 

Proposition 2.82. Let G be a group, and let a, b e G be commuting elements of orders 
m and n, respectively. If (m, n) = 1, then ab has order mn. 

Proof. Since a and b commute, we have (ab) r — a' b 1 for all r, so that ( ab) mn = 
a mnjjmn _ j j t su ffj ces t 0 p r0 ve that if {ab) k — 1, then mn \ k. If 1 = {ab) k — a k b k , 
then a k — b~ k . Since a has order m, we have 1 = a mk — b~ mk . Since b has order n, 
Theorem 2.24 gives n \ mk. As (m, n) = 1, however. Corollary 1.11 gives n \ k\ a similar 
argument gives m \ k. Finally, Exercise 1.19 on page 13 shows that mn \ k. Therefore, 
mn < k, and mn is the order of ab. • 

Corollary 2.83. If (in, n) — 1, then <p(mn) = <p(m)(p(n), where cp is the Euler (p-function. 

Proof. 15 Theorem 2.81 shows that the function /: I,„„ —> I m x I„, given by [a] 
([a] m , [a] n ), is an isomorphism. The result will follow if we prove that f(U(I mn )) = 
U (l m ) x U (I„), for then 

<p(mn) = \U(l mn )\ = \f(U(l mn ))\ 

— I U (I m ) x f/(I„)| = |f/(I m )| • |f/(I„)| = (p(m)<p(n). 

If [a] e U (I mn ), then [a][fo] = [1] for some [fo] e I m „, and 

f([ab]) = ([ ab ] m , [ab] n ) = ([a] m [b] m , [a]„[b] n ) 

= ([a] m ,[a] n )([b] m ,[b] n ) = ([l] m ,[l]„). 

Hence, [Y\ m — [a] m [b]„, and [1]„ = [a] n [b] n , so that f([a]) = ([a] m ,[a]„) e U(l m ) x 
U (I„), and f(U (I m „)) c U (I m ) x U (I„). 

For the reverse inclusion, if/([c]) = ([c] m , [c]„) e U (I m ) x U (I,,), then we must show 
that [c] e U There is [d] m e I m with [c]„,[d] m — [l] m , and there is [e]„ e I„ with 

1 \See Exercise 3.50 on page 150 for a less cluttered proof. 
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[c] n [e]n = [1]«- Since / is surjective, there is b e Z with ([ b] m , [b] n ) — i[d] m , [e] n ), so 
that 

/([!]) = ([l]m. M») = ([C] m [b] m , [ c] n [b ] n ) = f([c][b]). 

Since / is an injection, [1] = [c][&] and [c] e U (I mn ). • 

Corollary 2.84. 

(i) If p is a prime, then <p{p e ) — p e — p e ~ l = p e ^1 — 

(ii) If n — p e y ■ ■ ■ p e f is the prime factorization ofn, then 


( x _iv. 


V pj 

V p t ) 


Sketch of Proof. Part (i) holds because ( k, p e ) = 1 if and only if p \ k, while part (ii) 
follows from Corollary 2.83. • 

Lemma 2.85. A cyclic group of order n has a unique subgroup of order d, for each 
divisor d ofn, and this subgroup is cyclic. 

Proof. Let G — (a). If n — cd, we show that a c has order d (and so (a c ) is a subgroup 
of order d). Clearly ( a c ) d = a cd = a" = 1; we claim that d is the smallest such power. 
If ia c Y = 1, then n \ cr [Theorem 2.24]; hence cr — ns = dcs for some integer ,v, and 
r — ds > d. 

To prove uniqueness, assume that (x) is a subgroup of order d (recall that every sub¬ 
group of a cyclic group is cyclic, by Exercise 2.34 on page 72). Now x = a' n and 1 = 
x d = a md \ hence md — nk for some integer k. Therefore, x = a m = (a n ^ d ) k — ( a c ) k , so 
that (x) < ( a c ). Since both subgroups have the same order d, it follows that (x) = ( a c ). • 

Define an equivalence relation on a group G by x = y if (x) = (y); that is, x and y 
are equivalent if they are generators of the same cyclic subgroup. Denote the equivalence 
class containing an element x by gen(C), where C = (x); thus, gen(C) consists of all the 
generators of C. As usual, equivalence classes form a partition, and so G is the disjoint 
union: 

G = Ugen(C), 

c 

where C ranges over all cyclic subgroups of G. In Theorem 2.33(h), we proved that 

|gen(C)| = f{n). 


where (p is the Euler (/.-function. 

The next theorem will be used later to prove that the multiplicative group I* is cyclic. 

Theorem 2.86. A group G of order n is cyclic if and only if for each divisor d of n, 
there is at most one cyclic subgroup of order d. 
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Proof. If G is cyclic, then the result follows from Lemma 2.85. Conversely, write G as a 
disjoint union: 

G = Ugen(C). 

c 

Hence, n — |G| = |gen(C)|, where the summation is over all cyclic subgroups C of G: 

n = Yl lg en (OI = 

c c 

By hypothesis, for any divisor d of «, the group G has at most one cyclic subgroup of order 
d. Therefore, 

n = ^2 |gen(C)| = ^</>(|C|) < J2<p(d) = n, 

C C d\n 

the last equality being Corollary 1.39. Hence, for every divisor d of n, we must have 4>{d) 
arising as |gen(C)| for some cyclic subgroup C of G of order d. In particular, fin) arises; 
there is a cyclic subgroup of order n, and so G is cyclic. • 

Here is a proof of the abelian case of the preceding theorem (shown to me by D. Leep). 

Theorem. If G is an abelian group of order n having at most one cyclic subgroup of 
order p for each prime divisor p ofn, then G is cyclic. 

Proof. The proof is by induction on n — |G|, with the base step n = 1 obviously true. 
For the inductive step, note first that the hypothesis is inherited by subgroups of G. We 
claim that there is some element x in G whose order is a prime divisor p of |G|. Choose 
y e G with y f 1; its order A: is a divisor of |G|, by Lagrange’s theorem, and so k = 
pm for some prime p. By Exercise 2.23 on page 62, the element x — y m has order p. 
Define 6: G —> G by 6 : g i —> g p (9 is a homomorphism because G is abelian). Now 
x e ker@, so that | ker 6 > p. If | kcrd] > p. then there would be more than p elements 
g e G satisfying g p = 1, and this would force more than one subgroup of order p in G. 
Therefore, | ker@| = p. By the first isomorphism theorem, G/ kerb = im6 < G. Thus, 
im 9 is a subgroup of G of order n/p satisfying the inductive hypothesis, so there is an 
element z e im0 with im 9 — ( z)■ Moreover, since z e im 9, there is b e G with z = b p . 
There are now two cases. If p \ n/p , then xz has order p ■n/p — n, by Proposition 2.82, 
and so G — {xz). If p \ n/p , then Exercise 2.24 on page 62 shows that b has order n, and 
G = (b). • 

Exercises 

2.65 Prove that U (I 9 ) = Ig and U (I 15 ) = I 4 x 12- 

2.66 (i) Let H and K be groups. Without using the first isomorphism theorem, prove that H* = 

{(h, 1 ) : h e H } and K* = ((1, k ) : k e K) are normal subgroups of H x K with 
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H = H* and K = K*, and /: H (H x K)/K*. defined by f(h) = (li, 1 )K*, is 
an isomorphism. 

(ii) Use the first isomorphism theorem to prove that K* <1 H x K and that 

(H x K)/K* = H. 

Hint. Consider the function f:HxK^-H defined by /: (h, k) h. 

2.67 (i) Prove that Aut(V) = 53 and that Aut(S 3 ) = 53 . Conclude that nonisomorphic groups 

can have isomorphic automorphism groups. 

(ii) Prove that Aut(Z) = I 2 . Conclude that an infinite group can have a finite automorphism 
group. 

2.68 If G is a group for which Aut(G) = (1), prove that |G| < 2. 

2.69 Prove that if G is a group for which G/Z(G) is cyclic, where Z(G) denotes the center of G, 
then G is abelian. 

Hint. If G/Z(G) is cyclic, prove that a generator gives an element outside of Z(G) which 
commutes with each element of G. 

2.70 (i) Prove that Q/Z(Q) = V, where Q is the group of quaternions and V is the four-group; 

conclude that the quotient of a group by its center can be abelian. 

(ii) Prove that Q has no subgroup isomorphic to V. Conclude that the quotient Q/Z(Q) is 
not isomorphic to a subgroup of Q. 

2.71 Let G be a finite group with K <\ G. If (|A'|. [G : K}) = 1, prove that K is the unique 
subgroup of G having order | K \. 

Hint. If H < G and |H| = | AT|, what happens to elements of H in G/ Kl 

2.72 If H and K are subgroups of a group G, prove that HK is a subgroup of G if and only if 
H K = KH. 

Hint. Use the fact that H C HK and K C H K. 

2.73 Let G be a group and regard G x G as the direct product of G with itself. If the multiplication 
g: G x G -> Gisa group homomorphism, prove that G must be abelian. 

2.74 Generalize Theorem 2.81 as follows. Let G be a finite (additive) abelian group of order mn , 
where (m . n) = 1. Define 

G m = {g € G : order (g) | m} and G„ = [h e G : order ( h) \ n). 

(i) Prove that G m and G n are subgroups with G m fl G„ = (0). 

(ii) Prove that G = G m + G n = {g + h : g € G m and h e G„). 

(iii) Prove that G = G m x G„. 

2.75 Let G be a finite group, let p be a prime, and let H be a normal subgroup of G. Prove that if 
both \H\ and |G/H| are powers of p, then |G| is a power of p. 

2.76 If H and K are normal subgroups of a group G with HK — G, prove that 

G/(H H K) = ( G/H ) x (G/K). 

Hint. If (p\ G -*■ (G/H) x (G/K) is defined by x !->■ (xH, xK), then kerip = H fl K\ 
moreover, we have G = HK, so that 

(J aH = HK = (J bK. 

a b 
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Definition. If H\,..., H n are groups, then their direct product 

H\ x • • ■ x H n 

is the set of all n-tuples (h\, .... h n ), where hj e Hj for all i, with coordinatewise multiplication: 
dn,...,h n )(h\ . h' n ) = (h l h\,...ji n h' n ). 

2.77 (i) Generalize Theorem 2.81 by proving that if the prime factorization of an integer m is 

m = pj 1 ■ ■ • pa" , then 

I»i = I «i x ■ ■ ■ x I n . 

Pi Pi 

(ii) Generalize Corollary 2.83 by proving that if the prime factorization of an integer m is 
m — Pj 1 • ■ • p e f , then 

t/(I m )St/(I n tlJX-Xf/n e„). 

Pi Pi 


2.7 Group Actions 

Groups of permutations led us to abstract groups; the next result, due to A. Cayley, shows 
that abstract groups are not so far removed from permutations. 

Theorem 2.87 (Cayley). Every group G is isomorphic to a subgroup of the symmetric 
group Sq- In particular, if\G\ = n, then G is isomorphic to a subgroup of S n . 

Proof For each a e G, define “translation” t a : G -» G by x a (x) — ax for every x e G 
fif a 1, then r a is not a homomorphism). For a, b e G, (x a o r/,)(x) = x a (rb(x)) = 
xa(bx) — a(bx) — ( ab)x , by associativity, so that 


tatb — tab • 

It follows that each x a is a bijection, for its inverse is x a -i : 

FvTj-I = t aa -l X\ Ip 

and so x a e Sq. 

Define <p ; G -> Sq by tp(a) = x a . Rewriting, 

(p(a)tp(b) — x a xb = x a b — tp(ab), 

so that q> is a homomorphism. Finally, q> is an injection. If ip(a) = (p(b), then x a — xb, and 
hence x a (x) = xb(x) for all x e G; in particular, when x = 1, this gives a — b, as desired. 

The last statement follows from Exercise 2.39 on page 80, which says that if X is a set 
with |X| = n, then Sx = S n . • 

The reader may note, in the proof of Cayley’s theorem, that the permutation x a is just 
the <7th row of the multiplication table of G. 

To tell the truth, Cayley’s theorem itself is only mildly interesting. However, the identi¬ 
cal proof works in a larger setting that is more interesting. 
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Theorem 2.88 (Representation on Cosets). Let G be a group, and let H be a subgroup 
of G having finite index n. Then there exists a homomorphism : G —>■ S n withkercp < H. 

Proof. Even though H may not be a normal subgroup, we still denote the family of all 
the cosets of H in G by G/H. 

For each a e G, define “translation” r a : G/H -> G/H by r a (xH) — axH for every 
x e G. For a, b e G, 

( r a o Tb)(xH) = x a (xb(xH)) = r a (bxH) — a(bxH) — ( ab)xH , 
by associativity, so that 

tatb — tab- 

It follows that each r a is a bijection, for its inverse is x a -i: 

T at a -' = t aa -l =ti = 1 g/H = t a -it a , 
and so x a e Sq/h- Define cp : G -+ Sq/h by <p(a ) = x a . Rewriting, 

(p{a)(p(b ) = x a Xb = x a b — <P/ab), 

so that q> is a homomorphism. Finally, if a e ker^>, then (p{a) — 1 g/h, so that x a (xH ) = 
xH for all x e G; in particular, when x = 1, this gives aH = H, and a e H, by 
Femma 2.40(i). The result follows from Exercise 2.39 on page 80, for \G/H\ = n, and so 
Sg/h = S„. • 

When H = {1}, this is the Cayley theorem. 

We are now going to classify all groups of order up to 7. By Example 2.53, every group 
of prime order p is isomorphic to I p , and so, to isomorphism, there is just one group of 
order p. Of the possible orders through 7, four of them, 2, 3, 5, and 7, are primes, and so 
we need look only at orders 4 and 6 . 

Proposition 2.89. Every group G of order 4 is isomorphic to either I 4 or the four-group 
V. Moreover, I 4 and V are not isomorphic. 

Proof. By Fagrange’s theorem, every element in G, other than 1, has order either 2 or 4. 
If there is an element of order 4, then G is cyclic. Otherwise, x 2 — 1 for all x e G, so that 
Exercise 2.26 on page 62 shows that G is abelian. 

If distinct elements x and y in G are chosen, neither being 1, then we quickly check that 
xy f: { 1 , x, y}; hence, 

G = {1, 4 ', y, xy}. 

It is easy to see that the bijection /: G —»■ V, defined by /(1) = 1, f(x) — (1 2)(3 4), 
f(y) — (1 3)(2 4), and /(xy) = (1 4)(2 3), is an isomorphism. 

We have already seen, in Example 2.54, that I 4 '/f V. • 
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Proposition 2.90. If G is a group of order 6 , then G is isomorphic to either Ig or S 3 . 
Moreover, Ig and S 3 are not isomorphic . 16 

Proof. By Lagrange’s theorem, the only possible orders of nonidentity elements are 2, 3, 
and 6 . Of course, G = 16 if G has an element of order 6 . Now Exercise 2.27 on page 62 
shows that G must contain an element of order 2, say, t. We now consider the cases G 
abelian and G nonabelian separately. 

Case 1. G is abelian. 

If there is a second element of order 2, say, a , then it is easy to see, using at — ta, that 
H — {1 ,a,t, at} is a subgroup of G. This contradicts Lagrange’s theorem, because 4 is 
not a divisor of 6 . It follows that G must contain an element b of order 3. But tb has order 
6 , by Proposition 2.82. Therefore, G is cyclic if it is abelian. 

Case 2. G is not abelian. 

If G has no elements of order 3, then x 2 — 1 for all x e G, and G is abelian, by 
Exercise 2.26 on page 62. Therefore, G contains an element s of order 3 as well as the 
element t of order 2 . 

Now | (s) | = 3, so that [ G : (s)] = \G\/\ (s) \ = 6/3 = 2, and so (s) is a normal 
subgroup of G, by Proposition 2.62(h). Since t — t~ l , we have tst e (s); hence, tst — s l 
for i — 0. 1 or 2. Now i f 0, for tst — s° = 1 implies s = 1. If i = 1, then ,v and t 
commute, and this gives st of order 6, as in Case 1 (which forces G to be cyclic, hence 
abelian, contrary to our present hypothesis). Therefore, tst — s 2 = s 1 . 

We now use Theorem 2.88 to construct an isomorphism G -> S 3 . Let H — (f), and 
consider the homomorphism (p : G —»■ Sent) given by 

<P(g) ■ x ( t } i-> gx ( t). 


By the theorem, ker (p < (t), so that either ker (p = {1} (and cp is injective), or ker ip — (t ). 
Now G/ ( t ) = {(f) ,s(t), s 2 (f)}, and, in two-rowed notation. 


(pit) = 



s ( t) 
ts (t) 


s 2 (t)\ 
ts 2 {t)J 


If (p(t) is the identity permutation, then ts ( t) = s ( t }, so that s~ l ts e ( t) — {1, f}, by 
Lemma 2.40. But now s~ [ ts — t (it cannot be 1), hence ts — st, contradicting t and ,v 
not commuting. Therefore, t £ ker (p, and cp: G —> Sc/(t) = ^3 is an injective homomor¬ 
phism. Since both G and S 3 have order 6 , (p must be a bijection, and so G = S 3 . 

It is clear that 1^ and S 3 are not isomorphic, for one is abelian and the other is not. • 

16 Cayley states this proposition in an article he wrote in 1854. However, in 1878, in the American Journal of 
Mathematics, he wrote, “The general problem is to find all groups of a given order n; ... if n = 6, there are three 
groups; a group 

1, a, a 2 , a 3 , a 4 , a 5 (eft = 1), 

and two more groups 

1 , p 2 , a, otp, olP 2 ( a 2 = 1 , P 3 = 1 ), 

viz., in the first of these otp = Pa while in the other of them, we have ap = p 2 a, ap 2 = PaP Cayley’s list is 15 , 
E 2 x E 3 , and S 3 . Of course, I 2 x E 3 = 15 ; even Homer nods. 
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One consequence of this result is another proof that 16 = I 2 x I 3 (see Theorem 2.81). 
Classifying groups of order 8 is more difficult, for we have not yet developed enough 
theory. It turns out that there are 5 nonisomorphic groups of order 8 : Three are abelian: 
Igt I 4 x I 2 ; I 2 x I 2 x I 2 ; two are nonabelian: Z)g; Q. 

We can continue this discussion for larger orders, but things soon get out of hand, as 
Table 2.4 shows. Making a telephone directory of groups is not the way to study them. 


Order of Group 

Number of Groups 

2 

1 

4 

2 

8 

5 

16 

14 

32 

51 

64 

267 

128 

2,328 

256 

56, 092 

512 

10,494,213 

1024 

49,487,365,422 


Table 2.4. 

Groups arose by abstracting the fundamental properties enjoyed by permutations. But 
there is an important feature of permutations that the axioms do not mention: Permutations 
are functions. We shall see that there are interesting consequences when this feature is 
restored. 

Definition. If X is a set and G is a group, then G acts on X if there is a function GxX -> 
X , denoted by (g, x ) i-> gx, such that 

(i) (gh)x — g(hx ) for all g, h e G and x e X\ 

(ii) lx — x for all x e X. where 1 is the identity in G. 

We also call X a G-set if G acts on X. 

If a group G acts on a set X, then fixing the first variable, say g, gives a function 
a g : X —> X, namely, a g : x gx. This function is a permutation of X. for its inverse 
is a g -i: 

a g oig-i — cxi = lx — u g -ia g . 

It is easy to see that or. G —*■ Sx. defined by or. g cv„, is a homomorphism. Conversely, 
given any homomorphism <p: G — * Sx, define gx = <p(g)(x). Thus, an action of a group 
G on a set X is another way of viewing a homomorphism G -* Sx- 

Cayley’s theorem says that a group G acts on itself by (left) translation, and its general¬ 
ization, Theorem 2.88, shows that G also acts on the family of cosets of a subgroup H by 
(left) translation. 
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Example 2.91. 

We show that G acts on itself by conjugation: that is, for each g e G, define «„: G —>■ G 
to be conjugation 

<*g(x) = gxg~ l . 

To verify axiom (i), note that for each x e G, 


(a? ° a h)(x) — a g (ah(x)) 

— a g (hxh~ l ) 

= g(hxh ' l )g 1 
= (gh)x(ghy 1 
= otgh(x). 


Therefore, a g o a/, = a g /,. 

To prove axiom (ii), note that for each x e G, 

ai(x) = lxl -1 = x. 


and so ai = l(j. ◄ 

The following two definitions are fundamental. 

Definition. If G acts on X and x e X, then the orbit of x, denoted by 0(x), is the subset 
of X 

0{x) = {gx : g e G} c X; 
the stabilizer of x, denoted by G x , is the subgroup 

G x = {g e G : gx — x } < G. 

If G acts on a set X , define a relation on X by x = y in case there exists g e G with 
y — gx. It is easy to see that this is an equivalence relation whose equivalence classes are 
the orbits. 

Let us find some orbits and stabilizers. 

Example 2.92. 

(i) Cayley’s theorem says that G acts on itself by translations: x g : a i-^ ga. If a e G, then 
the orbit O(a) = G, for if b e G, then b — ( ba~ l )a — x ba -i{o). The stabilizer G a of 
a e G is {1), for if a = x g (a) — ga , then g = 1. We say that G acts transitively on X if 
there is only one orbit. 

(ii) When G acts on G/H (the family of cosets of a subgroup H) by translations r„ : 

aH gaH, then the orbit Q(aH) = G/H, for if bH e G/H, then x ba -i : aH i-^ bH. 
Thus, G acts transitively on G/H. The stabilizer G a n of aH is aHa~ l , for gaH — aH 
if and only if a~ l ga e H if and only if g e aHa~ l . < 
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Example 2.93. 

When a group G acts on itself by conjugation, then the orbit 0(x) is 
{y e G : y — axa~ l for some a e G}; 

in this case, O(x ) is called the conjngacy class of x, and it is commonly denoted by x G . 
For example. Theorem 2.9 shows that if a e S n , then the conjugacy class of a consists of 
all the permutations in S n having the same cycle structure as a. As a second example, an 
element z lies in the center Z(G) if and only if z G = {z}; that is, no other elements in G 
are conjugate to z. 

If x e G, then the stabilizer G x of x is 

Cg(x) = {g e G : gxg~ l = x}. 

This subgroup of G, consisting of all g e G that commute with x, is called the centralizer 
of x in G. a 

Example 2.94. 

Every group G acts on the set X of all its subgroups, by conjugation: If a e G, then a acts 
by H i->- aHa~ l , where H < G. 

If H is a subgroup of a group G, then a conjugate of H is a subgroup of G of the form 

aHa~ l — {aha -1 : h e //}, 

where a e G. 

Since conjugation li ho* aha -1 is an injection H -> G with image aHa~ l , it follows 
that conjugate subgroups of G are isomorphic. For example, in .S 3 , all cyclic subgroups of 
order 2 are conjugate (for their generators are conjugate). 

The orbit of a subgroup H consists of all its conjugates; notice that H is the only element 
in its orbit if and only if H <\G\ that is, aHa~ l — H for all a e G. The stabilizer of H is 

Ng(H) = {g e G : gHg~ l — H}. 

This subgroup of G is called the normalizer of H in G. ◄ 

Example 2.95. 

Let X = the vertices { v 1 , V 2 , V3, V4 1 of a square, and let G be the dihedral group D% 
acting on X, as in Figure 2.8 on page 102 (for clarity, the vertices in the figure are labeled 
1, 2, 3, 4 instead of iq, i>2, V3, V 4 ). 

G = {rotations : (1), (1 2 3 4), (1 3)(2 4), (1 4 3 2); 

reflections : (2 4), (1 3), (1 2)(3 4), (1 4)(2 3)}. 

For each vertex v,- e X, there is some g e G with g iq = vp, therefore, O(v 1 ) = X and 
D 8 acts transitively. 

What is the stabilizer G VI of iq? Aside from the identity, there is only one g e D% 
fixing iq, namely, g = (2 4); therefore G„, is a subgroup of order 2. (This example can be 
generalized to the dihedral group TL„ acting on a regular n-gon.) ■* 
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1 2 4 1 3 4 2 3 



1 4 3 2 2 1 4 3 



Figure 2.8 


Example 2.96. 

Let X — {1,2.let « e S„, and regard the cyclic group G — ( a) as acting on X. If 
i e X, then 

O(i) = [a k (i) : k e Z}. 

Let the complete factorization of a be a — ■ ■ ■ fi ,( a ) , and let i = i\ be moved by a. If 

the cycle involving z'i is fij — (ij h ... z r ). then the proof of Theorem 2.3 shows that 
ik +1 = a k (i i) for all k < r. Therefore, 

O(i) = {z'i, z 2 . i r ], 

where i = i i. It follows that \0(i)\ = r. The stabilizer G ( > of a number £ is G if a fixes 
£; however, if a moves l , then G( depends on the size of the orbit (D(£). For example, if 
a = (1 2 3)(4 5)(6), then G 6 = G,Gi= (a 3 ), and G 4 = (a 2 ). ◄ 

Proposition 2.97. If G acts on a set X, then X is the disjoint union of the orbits. If X is 
finite, then 

\X\ = Y J \0{x i )\, 

i 

where one xi is chosen from each orbit. 

Proof. As we have mentioned earlier, the relation on X, given by x = y if there exists 
g e G with y = gx, is an equivalence relation whose equivalence classes are the orbits. 
Therefore, the orbits partition X. 

The count given in the second statement is correct: Since the orbits are disjoint, no 
element in X is counted twice. • 

Here is the connection between orbits and stabilizers. 

Theorem 2.98. If G acts on a set X and x e X, then 

\0(x)\ = [G : G x ] 


the index of the stabilizer G x in G. 
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Proof. Let G/G x denote the family of all the left cosets of G x in G. We will exhibit 
a bijection (p: G/G x —*■ 0(x), and this will give the result, since \G/G X \ = [G : G x ], 
Define <p: gG x \—r gx. Now <p is well-defined: If gG x — hG x , then h = gf for some 
/ e G A ; that is, fx — x; hence, hx = gfx — gx. Now <p is an injection: if gx = 
<p(gG x ) — <p(hG x ) = hx, then h~ l gx — x; hence, h~ l g e G x , and gG x = hG x . Lastly, 
ip is a surjection: if y e O(x), then y — gx for some g e G, and so y — ip(gG x ). • 

In Example 2.95, Z)g acting on the four corners of a square, we saw that \0(v\)\ = 4, 
IGujI = 2, and [G : G v J = 8/2 = 4. In Example 2.96, G — {a) < S n acting on 
X — {1,2, ..., n], we saw that if, in the complete factorization of a into disjoint cycles 
a = f\ ■■■ fir (a), the r -cycle f>j moves l, then r = \0(£)\ for any l occurring in f>j. 
Theorem 2.98 says that r is a divisor of the order k of a. (But Theorem 2.25 tells us more: 
k is the 1 cm of the lengths of the cycles occurring in the factorization.) 

Corollary 2.99. If a finite group G acts on a set X, then the number of elements in any 
orbit is a divisor of \ G\. 

Proof. This follows at once from Lagrange’s theorem. • 

In Example 2.5(i), there is a table displaying the number of permutations in 54 of each 
cycle structure; these numbers are 1, 6 , 8 , 6 , 3. Note that each of these numbers is a divisor 
of IS 4 I = 24, In Example 2.5(ii), we saw that the corresponding numbers are 1, 10, 20, 30, 
24, 20, and 15, and these are all divisors of | S$ \ = 120. We now recognize these subsets 
as being conjugacy classes, and the next corollary explains why these numbers divide the 
group order. 

Corollary 2.100. If x lies in a finite group G, then the number of conjugates of x is the 
index of its centralizer: 

|x G | = [G : C G (x)], 

and hence it is a divisor of | G |. 

Proof. As in Example 2.93, the orbit of x is its conjugacy class x G , and the stabilizer G x 
is the centralizer C G (x). • 

Proposition 2.101. IfH is a subgroup of a finite group G, then the number of conjugates 
of H in G is [G : N c (H)]. 

Proof. As in Example 2.94, the orbit of H is the family of all its conjugates, and the 
stabilizer is its normalizer Nq(H). • 

There are some interesting applications of group actions to counting problems, which 
we will give at the end of this section. Let us first apply group actions to group theory. 

When we began classifying groups of order 6 , it would have been helpful to be able to 
assert that any such group has an element of order 3 (we were able to use an earlier exercise 
to assert the existence of an element of order 2). We now prove that if p is a prime divisor 
of | G |, where G is a finite group G, then G contains an element of order p. 
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Theorem 2.102 {Cauchy). If G is a finite group whose order is divisible by a prime p, 
then G contains an element of order p. 

Proof. We prove the theorem by induction on m > 1, where |G| = pm. The base step 
m — 1 is true, for Lagrange’s theorem shows that every nonidentity element in a group of 
order p has order p. 

Let us now prove the inductive step. If x € G, then the number of conjugates of x 
is |x G | = [G : Cg(x )], where Cc{x) is the centralizer of x in G. As noted earlier, if 
x f Z{G), then x G has more than one element, and so \Cg{x)\ < |G|. If p \ \Cg(x)\ 
for some noncentral x, then the inductive hypothesis says there is an element of order p 
in Cg(x) < G, and we are done. Therefore, we may assume that p \ \Cg(x)\ for all 
noncentral x e G. Better, since p is a prime and |G| = [G : Cg(x)]\Cg(x)\, Euclid’s 
lemma gives 

P I [G : Cg(x)]. 

After recalling that Z(G) consists of all those elements x e G with \x G \ — 1, we may 
use Proposition 2.97 to see 

|G| = |Z(G)| + £ [G:C G ( Xi )i, 

i 

where one x\ is selected from each conjugacy class having more than one element. Since 
|G| and all [G : Cc{xi )] are divisible by p. it follows that |Z(G)| is divisible by p. But 
Z(G) is abelian, and so Proposition 2.78 says that Z(G), and hence G, contains an element 
of order p. • 

Definition. The class equation of a finite group G is 

IG| = \Z(G)\ + J2 IG : C G (xi)i, 

i 

where one Xj is selected from each conjugacy class having more than one element. 

Definition. If p is a prime, then a finite group G is called tip-group if | G | = p" for some 
n > 0. (See Exercise 2.81 on page 112 for the definition of an infinite p-group.) 

We have seen examples of groups whose center is trivial; for example, Z{S-}) — {1}. 
For p-groups, however, this is never true. 

Theorem 2.103. If p is a prime and G is a p-group, then Z(G) f {1}. 

Proof. Consider the class equation 

IG| = \Z(G)\ + J2 [G ■ C G (x ,)]. 

i 

Each Cc(xi) is aproper subgroup of G, for x,- f Z{G). Since G is a p-group, [G : Cg(xi )] 
is a divisor of |G|, hence is itself a power of p. Thus, p divides each of the terms in the 
class equation other than |Z(G)|, and so p | |Z(G)| as well. Therefore, Z(G) ^ {11. • 
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Corollary 2.104. If p is a prime, then every group G of order p 2 is abelian. 

Proof. If G is not abelian, then its center Z(G) is a proper subgroup, so that |Z(G)| = 1 
or p, by Lagrange’s theorem. But Theorem 2.103 says that Z(G) f {1}, and so |Z(G)| = 
p. The center is always a normal subgroup, so that the quotient G/Z(G) is defined; it has 
order p, and hence G/Z(G) is cyclic. This contradicts Exercise 2.69 on page 95. • 

Example 2.105. 

Who would have guessed that Cauchy’s theorem (if G is a group whose order is a multiple 
of a prime p , then G has an element of order p) and Fermat’s theorem (if p is prime, 
then a p = a mod p) are special cases of some common theorem? The elementary yet 
ingenious proof of Cauchy’s theorem is due to J. H. McKay in 1959 (see Montgomery 
and Ralston, Selected Papers in Algebra)', A. Mann showed me that McKay’s argument 
also proves Fermat’s theorem. If G is a finite group and p is a prime, denote the cartesian 
product of p copies of G by G p , and define 

X = {(urn a\,..., ap— i) € G p ; aoa\ ... ap— \ — 1}. 

Note that |X| = |G| P_1 , for having chosen the last p — 1 entries arbitrarily, the 0th entry 
must equal (a \ an ■ ■ ■ a p - 1 ) -1 . Introduce an action of I p on X by defining, for 0 < i < 
P - 1. 

[/] (^o. ti \,..., ap~ i) = (ai. ai \ \..... a p—t , ao, a \,..., ai ). 

The product of the entries in the new /5-tuple is a conjugate of aoa\ ■ ■ ■ a p - 1 : 

a, a, ( i • • - a p _ jug^t • • • £t; = (uo a t • ■ • af) ^ (flo^i ■ • • cip—\ )(uo a i • • • fl;). 

This conjugate is 1 (for g _1 lg = 1), and so [i](ao> «i,..., a p -\) e X. By Corollary 2.99, 
the size of every orbit of X is a divisor of |I p | = p\ since p is prime, these sizes are either 
1 or p. Now orbits with just one element consist of a /5-tuple all of whose entries a, are 
equal, for all cyclic permutations of the /5-tuple are the same. In other words, such an orbit 
corresponds to an element a e G with ci p = 1. Clearly, (1,1,..., 1) is such an orbit; if it 
were the only such, then we would have 

\G\ p ~ l = \X\ = 1 +kp 

for some k > 0; that is, \G\ p ~ l = 1 mod p. If p is a divisor of |G|, then we have a 
contradiction, for |G| P_1 = 0 mod p. We have thus proved Cauchy’s theorem: If a prime 
p is a divisor of | G |, then G has an element of order p. 

Suppose now that G is a group of order n, say, G = I„, and that p is not a divisor of 
n. By Lagrange’s theorem, G has no elements of order p, so that if a p = 1, then a = 1. 
Therefore, the only orbit in G p of size 1 is (1, 1,..., 1), and so 

n p_1 = \G\ p ~ l = \X\ = 1 +kp; 

that is, if p is not a divisor of n, then n p ~ l = 1 mod p. Multiplying both sides by n, we 
have n p = n mod p, a congruence also holding when p is a divisor of n\ this is Fermat’s 
theorem. < 
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We have seen, in Proposition 2.64, that A 4 is a group of order 12 having no subgroup 
of order 6 . Thus, the assertion that if d is a divisor of |G|, then G must have a subgroup of 
order d, is false. However, this assertion is true when G is a p-group. 

Proposition 2.106. If G is a group of order |G| = p e , then G has a normal subgroup of 

order p k for every k < e. 

Proof We prove the result by induction on e > 0. The base step is obviously true, and so 
we proceed to the inductive step. By Theorem 2.103, the center of G is a nontrivial normal 
subgroup: Z(G) f {1}. Let Z < Z(G) be a subgroup of order p\ as any subgroup of 
Z(G), the subgroup Z is a normal subgroup of G. If k < e, then P k ~ X < P e ~ l = \G/Z\. 
By induction, G/Z has a normal subgroup H* of order p k 1 . The correspondence theorem 
says there is a subgroup H of G containing Z with H* = H/Z\ moreover, H* <1 G/Z 
implies H < G. But \H/Z\ — p k ~ [ implies \H\ = p k , as desired. • 

Abelian groups (and the quaternions) have the property that every subgroup is normal. 
At the opposite pole are groups having no normal subgroups other than the two obvious 
ones: {1} and G. 

Definition. A group G f {1} is called simple if G has no normal subgroups other than 
{1} and G itself. 


Proposition 2.107. An abelian group G is simple if and only if it is finite and of prime 
order. 

Proof If G is finite of prime order p , then G has no subgroups H other than {1} and 
G, otherwise Lagrange’s theorem would show that |H| is a divisor of p. Therefore, G is 
simple. 

Conversely, assume that G is simple. Since G is abelian, every subgroup is normal, and 
so G has no subgroups other than {1} and G. Choose x e G with x f 1. Since (x) is a 
subgroup, we have (x) — G. If x has infinite order, then all the powers of x are distinct, 
and so (x 2 ) < (x) is a forbidden subgroup of {x ), a contradiction. Therefore, every x e G 
has finite order. If x has (finite) order m and if m is composite, say m — kl, then (x k ) is a 
proper nontrivial subgroup of (x), a contradiction. Therefore, G = (x) has prime order. • 

We are now going to show that A 5 is a nonabelian simple group (indeed, it is the smallest 
such; there is no nonabelian simple group of order less than 60). 

Suppose that an element x e G has k conjugates; that is 

\x G \ = \{gxg~ l : g eG}\=k. 

If there is a subgroup H < G with x e H < G, how many conjugates does x have in HI 
Since 

x H — {hxir 1 : h e H] c {gxg -1 :geG) = x G , 
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we have \x H \ < \x G \. It is possible that there is strict inequality \x H \ < \x G \. For example, 
take G = S 3 , x = (1 2), and H — (x). We know that \x G = 3 (because all transpositions 
are conjugate), whereas \x H \ = 1 (because H is abelian). 

Now let us consider this question, in particular, for G — S$, x = (1 2 3), and H — A 5 . 

Lemma 2.108. All 3-cycles are conjugate in A 5 . 

Proof. Let G = S 5 , a — (1 2 3), and H — A 5 . We know that a Ss = 20, for there 
are twenty 3-cycles in S 5 , as we saw in Example 2.5(h). Therefore, 20 = | 5 , 5 |/|Cs J (o')| = 
120/1CT^(or)|, by Corollary 2.100, so that |Cs 5 (a)| = 6 ; that is, there are exactly six 
permutations in S 5 that commute with a. Here they are: 

(1), (1 2 3), (1 3 2), (4 5), (4 5)(1 2 3), (4 5)(1 3 2). 

The last three of these are odd permutations, so that |Ca 5 (g:)| = 3. We conclude that 

|a As | = |A 5 |/|Ca 5 («)| = 60/3 = 20; 

that is, all 3-cycles are conjugate to a — (1 2 3) in A 5 . • 

This lemma can be generalized from A 5 to all A n for n > 5; see Exercise 2.91 on 
page 113. 

Lemma 2.109. Ifn > 3, every element in A„ is a 3-cycle or a product of 3-cycles. 

Proof. If a e A„, then a is a product of an even number of transpositions: 


Ot — X 1 T 2 * * • T2q— 1 T 2 q • 

Of course, we may assume that adjacent r’s are distinct. As the transpositions may be 
grouped in pairs T 2 /- 1 T 2 ;, it suffices to consider products rr', where r and r' are transpo¬ 
sitions. If r and r' are not disjoint, then r = (i j ), r' = (i k ), and rr' = (i k j)\ if r and 
r' are disjoint, then rr' = (i j)(k l) — (i j)(j k)(j k)(k l) — (i j k)(j k £). • 

Theorem 2.110. A 5 is a simple group. 

Proof. We shall show that if PI is a normal subgroup of A 5 and PI / {(1)}, then H = A 5 . 
Now if H contains a 3-cycle, then normality forces PI to contain all its conjugates. By 
Lemma 2.108, PI contains every 3-cycle, and by Lemma 2.109, PI — A 5 . Therefore, it 
suffices to prove that PI contains a 3-cycle. 

As PI {(1)}, it contains some o / (1). We may assume, after a harmless relabeling, 
that either a = (1 2 3), 0 = (1 2) (3 4), or a — (1 2 3 4 5). As we have just remarked, we 
are done if a is a 3-cycle. 

If er = (1 2)(3 4), define r = (1 2)(3 5). Now PI contains (rar _ 1 )cr _1 , because 
it is a normal subgroup, and rar _ 1 er _1 = (3 5 4), as the reader should check. If a = 
(1 2 3 4 5), define p = (1 3 2); now PI contains pap - 1 or -1 = (13 4), as the reader 
should also check. 

We have shown, in all cases, that PI contains a 3-cycle. Therefore, the only normal 
subgroups in A 5 are {(1)} and A 5 itself, and so A 5 is simple. • 
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Theorem 2.110 turns out to be the basic reason why the quadratic formula has no gen¬ 
eralization giving the roots of polynomials of degree 5 or higher (see Theorem 4.27). 

Without much more effort, we can prove that the alternating groups A„ are simple for 
all n > 5. Observe that A 4 is not simple, for the four-group V is a normal subgroup of A 4 . 


Lemma 2.111. A(, is a simple group. 

Proof. Let H {(1)} be a normal subgroup of Ag; we must show that H — A&. Assume 
that there is some a e H with a f ( \) that fixes some ;, where 1 < i < 6 . Define 

F = {0 e A(, : o(i) = i}. 

Note that a e H IT F, so that H D F {(1)}- The second isomorphism theorem gives 
H H F <\ F . But F is simple, for F = A 5 , and so the only normal subgroups in F are {(1)} 
and F. Since H C\ F {(1)}, we have H IT F = F\ that is, F < H. It follows that H 
contains a 3-cycle, and so H = A(„ by Exercise 2.91 on page 113. 

We may now assume that there is no a e FI with a f (1) that fixes some i with 
1 < i < 6 . If we consider the cycle structures of permutations in A 5 , however, any such 
a must have cycle structure (1 2)(3 4 5 6 ) or (1 2 3)(4 5 6 ). In the first case, a 2 e H 
is a nontrivial permutation that fixes 1 (and also 2), a contradiction. In the second case, H 
contains a(fta~ l where ft = (2 3 4), and it is easily checked that this is a nontrivial 
element in H which fixes 1, another contradiction. Therefore, no such normal subgroup H 
can exist, and so Ag is a simple group. • 


Theorem 2.112. A„ is a simple group for all n > 5. 

Proof. If H is a nontrivial normal subgroup of A„, that is, H f {(1)}, then we must show 
that H — A„; by Exercise 2.91 on page 113, it suffices to prove that H contains a 3-cycle. 
If f e H is nontrivial, then there exists some i that f moves; say, f{i) — j ^ i. Choose 
a 3-cycle a that fixes i and moves j. The permutations a and ft do not commute: fta(i) = 
ft(i) — j , while aft(i) — a(j) j. It follows that y — (afta~ l )ftl~ l is a nontrivial 
element of H. But /la -1 /! -1 is a 3-cycle, by Theorem 2.9, and so y = ft~ l ) is a 

product of two 3-cycles. Hence, y moves at most 6 symbols, say, / 1 (if y moves 
fewer than 6 symbols, just adjoin others so we have a list of 6 ). Define 

F — {cr e A„ : o fixes all i f=i 1 ,..., ie}- 

Now F = A(, and y e H fl F. Hence, H IT F is a nontrivial normal subgroup of F. But 
F is simple, being isomorphic to A 5 , and so H fl F = F; that is, F < H. Therefore, H 
contains a 3-cycle, and so H = A„; the proof is complete. • 

We now use groups to solve some difficult counting problems. 
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Theorem 2.113 (Burnside’s Lemma 17 ). Let G act on a finite set X. If N is the number 
of orbits, then 

1 \ - 

N = - > Fix(r), 

' G ' h 

where Fix(r) is the number of x e X fixed by x. 

Proof List the elements of X as follows: Choose x\ e X , and then list all the ele¬ 
ments xi, X 2 , ... ,x r in the orbit 0 (x i); then choose x r +\ f 0(x i), and list the elements 
x r +i, x r +2, ■ • • in 0(x r + 1 ); continue this procedure until all the elements of X are listed. 
Now list the elements T\, xj, ■ •., t„ of G, and form the following array, where 

I I if t; fixes Xj 
n f 

0 tl Xj moves xj. 



X] 

X 2 

X r +1 

Xr+2 

Tl 

fl.l 

ft.2 ■■■ 

/l,r+l 

fl.r+2 ■■■ 

T2 

fl, 1 

fl,2 

/2,r+l 

f2.r+2 ■ ■ ■ 

T i 

fi.l 

fi,2 ' ' ' 

fi,r+1 

fi.r+2 ■ ■ ■ 

T„ 

fn,\ 

fn,2 ■ ■ 

fn.r+1 

fn.r+2 


Now Fix(r,), the number of x fixed by ry, is the number of l’s in the ;th row of the 
array; therefore, Fix(r) is the total number of l’s in the array. Let us now look at 

the columns. The number of l’s in the first column is the number of r,• that fix x\\ by 
definition, these Xj comprise G X1 . Thus, the number of 1 ’s in column 1 is | G X] \. Similarly, 
the number of 1 ’s in column 2 is | G X2 \. By Exercise 2.99 on page 114, | G X] \ — \G X1 \. By 
Theorem 2.98, the number of l’s in the r columns labeled by the x; e 0(x i) is thus 

r\G xl \ = \0{x\)\ ■ \G XI \ = (|G|/|G X1 |) \G X1 \ = |G|. 

The same is true for any other orbit: Its columns contain exactly |G| l’s. Therefore, if 
there are N orbits, there are A' |G l’s in the array. We conclude that 

^Fix(r) = (V|G|. • 

reG 


We are going to use Burnside’s lemma to solve problems of the following sort. How 
many striped flags are there having six stripes (of equal width) each of which can be colored 
red, white, or blue? Clearly, the two flags in Figure 2.9 are the same: The bottom flag is 
just the top one turned over. 

17 Burnside himself attributed this lemma to F. G. Frobenius. To avoid the confusion that would be caused by 
changing a popular name. P. M. Neumann has suggested that it be called "not-Burnside’s lemma.” W. Burnside 
was a fine mathematician, and there do exist theorems properly attributed to him. For example, Burnside proved 
that if p and q are primes, then there are no simple groups of order p'"q n . 
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Figure 2.9 


Let X be the set of all 6 -tuples of colors; if x e X , then 

X — (c 1 , C2, C 3 , C4, C 5 , C 6 ), 


where each c, denotes either red, white, or blue. Let r be the permutation that reverses all 
the indices; 


/I 2 3 4 5 
V 6 5 4 3 2 


= (1 6 )(2 5)(3 4) 


(thus, r “turns over” each 6 -tuple x of colored stripes). The cyclic group G — (r) acts 
on X; since |G| =2, the orbit of any 6 -tuple x consists of either 1 or 2 elements: Either 
r fixes a - or it does not. Since a flag is unchanged by turning it over, it is reasonable to 
identify a flag with an orbit of a 6 -tuple. For example, the orbit consisting of the 6 -tuples 


(r, w, b, r, w, b) and (b, w, r, b, w, r) 


describes the flag in Figure 2.9. The number of flags is thus the number N of orbits; by 
Burnside’s lemma, N = ^[Fix((l)) + Fix(r)]. The identity permutation (1) fixes every 
a - e X , and so Fix((l)) = 3 6 (there are 3 colors). Now r fixes a 6 -tuple x if it is a 
“palindrome,” that is, if the colors in x read the same forward as backward. For example, 

x — (r, r, w , w, r, r) 


is fixed by r. Conversely, if 


A' = (Cl, C2, C 3 , C4, C 5 , Cfi) 

is fixed by r = (1 6 )(2 5)(3 4), then c i = eg, C 2 — C 5 , and C 3 = C 4 ; that is, a' is a 
palindrome. It follows that Fix(r) = 3 3 , for there are 3 choices for each of ci, C 2 , and C 3 . 
The number of flags is thus 

N= i(3 6 + 3 3 ) = 378. 

Let us make the notion of coloring more precise. 

Definition. If a group G acts on X = {1, and if C is a set of q colors , then G acts 

on the set C" of all n -tuples of colors by 

r(ci,..., c„ ) = (c T i,..., c T „) for all r e G. 

An orbit of (ci,... ,c n ) e C” is called a (q, G)-coloring of X. 
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Example 2.114. 

Color each square in a 4 x 4 grid red or black (adjacent squares may have the same color; 
indeed, one possibility is that all the squares have the same color). 

If X consists of the 16 squares in the grid and if C consists of the two colors red and 
black, then the cyclic group G = ( R) of order 4 acts on X, where R is clockwise rotation 
by 90°; Figure 2.10 shows how R acts: The right square is R’s action on the left square. In 
cycle notation, 

R = ( 1, 4, 16, 13)(2, 8, 15, 9)(3, 12, 14, 5)(6, 7, 11, 10), 

R 2 = ( 1, 16)(4, 13)(2, 15)(8, 9)(3, 14)(12, 5)(6, 11)(7, 10), 

R 3 = ( 1, 13, 16, 4)(2, 9, 15, 8)(3, 5, 14, 12)(6, 10, 11, 7). 

A red-and-black chessboard does not change when it is rotated; it is merely viewed from a 

different position. Thus, we may regard a chessboard as a 2-coloring of X\ the orbit of a 
16-tuple corresponds to the four ways of viewing the board. 

By Burnside’s lemma, the number of chessboards is 

±[Vix((l)) + Fi x(R) + Fi x(R 2 ) + Fix(/? 3 )]. 

Now Fix((l)) = 2, for every 16-tuple is fixed by the identity. To compute Fix(7?), note 
that squares 1,4, 16, 13 must all have the same color in a 16-tuple fixed by R. Similarly, 
squares 2, 8, 15, 9 must have the same color, squares 3, 12, 14, 5 must have the same color, 
and squares 6, 7, 11, 10 must have the same color. We conclude that Fi x(R) — 2, note 
that the exponent 4 is the number of cycles in the complete factorization of R. A similar 
analysis shows that Fi\(R 2 ) — 2 8 , for the complete factorization of R 2 has 8 cycles, and 
Fi x(R 3 ) — 2 , because the cycle structure of R 3 is the same as that of R. Therefore, the 
number N of chessboards is 

N=\ [2 16 + 2 4 + 2 8 + 2 4 ] = 16,456. ◄ 
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We now show, as in Example 2.114, that the cycle structure of a permutation r allows 
one to calculate Fix(r). 

Lemma 2.115. Let C be a set of q colors, and let G be a subgroup of S n . //r e G, then 

Fix(r) = q ,(r \ 

where t(r) is the number of cycles in the complete factorization of r. 

Proof. Since r(ci, ..., c„) — (c T i,..., c rn ) — (ci, ..., c„), we see that c T( - = c/ for all 
i, and so r i has the same color as i. It follows, for all k, that r k i has the same color as 
i, that is, all points in the orbit of i acted on by (r) have the same color. If the complete 
factorization of r is x — fi\ ■ ■ ■ r). and if i occurs in (ij , then Example 2.96 shows that 

the orbit containing i is the set of symbols occurring in f j. Thus, for an n -tuple to be fixed 
by r, all the symbols involved in each of the t (r) cycles must have the same color; as there 
are q colors, there are thus q ,lj hi -tuples fixed by r. • 


Corollary 2.116. 

X, then 


Let G act on a finite set X. If N is the number of (q, G)-colorings of 


N = —Yq ,(T) 
\a\ 2-^ 


T€ G 


where t(x) is the number of cycles in the complete factorization oft. 


There is a generalization of this technique, due to G. Polya (see Biggs, Discrete Math¬ 
ematics ), giving a formula, for example, that counts the number of red, white, blue, and 
green flags having 20 stripes exactly 7 of which are red and 5 of which are blue. 


Exercises 


2.78 If a and h are elements in a group G. prove that ab and ba have the same order. 

Hint. Use a conjugation. 

2.79 Prove that if G is a finite group of odd order, then no x e G, other than x = 1, is conjugate to 
its inverse. 

Hint. If x is conjugate to x~f how many elements are in x G ? 

2.80 Prove that no pair of the following groups of order 8, 

Ig; I4 x IF; 1 2 * 1 2 ^ I21 7^8» Q- 

are isomorphic. 

2.81 Prove that if p is a prime and G is a finite group in which every element has order a power 
of p. then G is a p-group. (A possibly infinite group G is called a p-group if every element in 
G has order a power of p.) 

Hint. Use Cauchy's theorem. 
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2.82 Define the centralizer Cq{H) of a subgroup H < G to be 

Cq(H) = [x e G : xh = /lx for all h e H). 

(i) For every subgroup H < G, prove that Cq(H) <1 N G (H). 

(ii) For every subgroup H < G, prove that N G (H)/C G (H) is isomorphic to a subgroup of 
Aut (H). 

Hint. Generalize the homomorphism F in Exercise 2.64 on page 82. 

2.83 Show that 64 has a subgroup isomorphic to Dg. 

2.84 Prove that S 4 /V = S 3 . 

Hint. Use Proposition 2.90. 

2.85 (i) Prove that A 4 ^ D[ 2 - 

Hint. Recall that A 4 has no element of order 6 . 

(ii) Prove that Dj 2 = S 3 x I 2 . 

Hint. Each element x e D \2 has a unique factorization of the form x — b'a, where 
b 6 = 1 and a 2 = 1 . 

2.86 (i) If G is a group, then a normal subgroup H <] G is called a maximal normal subgroup 

if there is no normal subgroup K of G with H < K < G. Prove that a normal subgroup 
H is a maximal normal subgroup of G if and only if G/H is a simple group. 

(ii) Prove that every finite abelian group G has a subgroup of prime index. 

Hint. Use Proposition 2.107. 

(iii) Prove that Ag has no subgroup of prime index. 

2.87 Prove that H <3 N G (H) and that N G (H) is the largest subgroup of G containing H as a normal 
subgroup. 

2.88 Find N G (H) if G = S 4 and H = ((12 3)). 

2.89 (i) If H is a subgroup of G and if x e H , prove that 

ChG) = H n C G (x). 

(ii) If H is a subgroup of index 2 in a finite group G and if x e H, prove that \x H \ — \x G \ 

or |.v^| = where x H is the conjugacy class of x in H. 

Hint. Use the second isomorphism theorem. 

(iii) Prove that there are two conjugacy classes of 5-cycles in A 5 , each of which has 12 
elements. 

1 1 120 

Hint. If a = (1 2 3 4 5), then |C<; 5 (ff)| = 5 because 24 = -—; hence 

Cs 5 (a ) = (a). Whatis C 4 5 (»)? 

(iv) Prove that the conjugacy classes in A 5 have sizes 1, 12, 12, 15, and 20. 

2.90 (i) Prove that every normal subgroup H of a group G is a union of conjugacy classes of G. 

one of which is { 1 }. 

(ii) Use part (i) and Exercise 2.89 to give a second proof of the simplicity of A 5 . 

2.91 (i) For all n > 5, prove that all 3- cycles are conjugate in A„. 

Hint. Show that (12 3) and (i j k) are conjugate, in two steps: First, if they are not 
disjoint (so the permutations move at most 5 letters); then, if they are disjoint. 
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(ii) Prove that if a normal subgroup H <1 A„ contains a 3-cycle, where n > 5, then H = A, 

( Remark. We have proved this in Lemma 2.109 when n = 5.) 

2.92 Prove that the only normal subgroups of 54 are {(1)}, V, A 4 , and S 4 . 

Hint. Use Theorem 2.9, checking the various cycle structures one at a time. 

2.93 Prove that A 5 is a group of order 60 that has no subgroup of order 30. 

Hint. Use Proposition 2.62(ii). 

2.94 (i) Prove, for all n > 5, that the only normal subgroups of S„ are ((D), A„ , and S n . 

(ii) Prove that if n > 3, then A n is the only subgroup of S n of order !. 

Hint. If H is a second such subgroup, then H is normal in S n and hence H IT A„ is 

normal in A n . 

(iii) Prove that S 5 has no subgroup of order 30. 

Hint. Use the representation on the cosets of a supposed subgroup of order 30, as well 
as the simplicity of A 5 . 

(iv) Prove that S 5 contains no subgroup of order 40. 

2.95 Let G be a subgroup of S„. 

(i) If G T A„ = {1}, prove that |G| < 2. 

(ii) If G is a simple group with more than 2 elements, prove that G < A n . 

2.96 (i) If n > 5, prove that S n has no subgroup of index r, where 2 < r < n. 

(ii) Prove that if n > 5, then A„ has no subgroup of index r, where 2 < r < n. 

2.97 (i) Prove that if a simple group G has a subgroup of index n > 1, then G is isomorphic to 

a subgroup of S n . 

Hint. Kernels are normal subgroups. 

(ii) Prove that an infinite simple group (such do exist) has no subgroups of finite index 
n > 1. 

Hint. Use part (i). 

2.98 Let G be a group with |G| = mp, where p is a prime and 1 < m < p. Prove that G is not 
simple. 

Hint. Show that G has a subgroup H of order p, and use the representation of G on the 
cosets of H. 

Remark. Of all the numbers smaller than 60, we can now show that all but 11 are not 
orders of nonabelian simple groups (namely, 12, 18, 24, 30, 36, 40, 45, 48, 50, 54. 56). 
Theorem 2.103 eliminates all prime powers (for the center is always a normal subgroup), and 
this exercise eliminates all numbers of the form mp, where p is a prime and m < p. (We 
can complete the proof that there are no nonabelian simple groups of order less than 60 using 
Sylow's theorem; see Proposition 5.41.) ◄ 

2.99 (i) Let a group G act on a set X, and suppose that x, y e X lie in the same orbit: y = gx 

for some g e G. Prove that G v = gG x g~^ • 

(ii) Let G be a finite group acting on a set X; prove that if x, y e X lie in the same orbit, 
then \G X \ — IG v I - 

2.100 How many flags are there with n stripes each of which can be colored any one of q given 
colors? 

Hint. The parity of n is relevant. 
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2.101 Let X be the squares in an n x n grid, and let p be a rotation by 90°. Define a chessboard 
to be a (q, G)-coloring, where the cyclic group G = (p) of order 4 is acting. Show that the 
number of chessboards is 

l(/+ ? L(» 2 +l)/2j +2(? L(» 2 +3)/4j^ 
where [vj is the greatest integer in the number .v. 

2.102 Let X be a disk divided into n congruent circular sectors, and let p be a rotation by (360/n)°. 

Define a roulette wheel to be a (q, G)-coloring, where the cyclic group G = (p) of order n is 
acting. Prove that if n — 6, then there are ^(2 cj + 2 q~ + + q^) roulette wheels having 6 

sectors. 

The formula for the number of roulette wheels with n sectors is 

\^ 4 >(n/d.)q d , 

d\n 

where <f> is the Euler 0-function. 

2.103 Let X be the vertices of a regular /?-gon, and let the dihedral group G = D 2 n act ( as th e 
usual group of symmetries [see Example 2.28]). Define a bracelet to be a (q, G)-coloring of a 
regular n-gon, and call each of its vertices a bead. (Not only can we rotate a bracelet, we can 
alsoflip it: that is, turn it upside down by rotating it in space about a line joining two beads.) 

(i) How many bracelets are there having 5 beads, each of which can be colored any one of 
q available colors? 

Hint. The group G = D\q is acting. Use Example 2.28 to assign to each symmetry a 
permutation of the vertices, and then show that the number of bracelets is 

JoO ? 5 + + 5 q 3 ). 

(ii) How many bracelets are there having 6 beads, each of which can be colored any one of 
q available colors? 

Hint. The group G = D\i is acting. Use Example 2.28 to assign to each symmetry a 
permutation of the vertices, and then show that the number of bracelets is 


T 2q 4 + 4 q'’ + 3q~ 4- 2 q'). 
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3.1 Introduction 

As in Chapters 1 and 2, this chapter contains some material usually found in an earlier 
course; proofs of such results are only sketched, but other theorems are proved in full. 
We begin by introducing commutative rings, the most prominent examples being Z, Q, 
R, and C, as well as I m , polynomials, real-valued functions, and finite fields. We will 
also give some of the first results about vector spaces (with scalars in any field) and linear 
transformations. Canonical forms, which classify similar matrices, will be discussed in 
Chapter 9. 


3.2 First Properties 


We begin with the definition of commutative ring. 

Definition. A commutative ring 1 R is a set with two binary operations, addition and 
multiplication, such that 

(i) R is an abelian group under addition; 

(ii) ( commutativity) ab — ba for all a,b <= R; 

(iii) (associativity) a(bc) — (ab)c for every a,b, c e R\ 

'This term was probably coined by D. Hilbert, in 1897, when he wrote Zahlring. One of the meanings of the 
word ring, in German as in English, is collection, as in the phrase “a ring of thieves.” (It has also been suggested 
that Hilbert used this term because, for a ring of algebraic integers, an appropriate power of each element “cycles 
back” to being a linear combination of lower powers.) 
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(iv) there is an element 1 e R with la — a for every a e R' 2 

(v) ( distributivity ) a(b + c) = ah + at: for every a, b, c e R. 

The element 1 in a ring R has several names; it is called one , the unit of R, or the 
identity in R. 

Addition and multiplication in a commutative ring R are binary operations, so there are 


functions 

a : R x R 

-» R 

with 

a(r, r') — r + r' e R 

and 

p. : R x R - 

-* R 

with 

jl(r, r') — rr' e R 

for all r, r e R. The law of substitution holds here, as it does for any operation: If r — r 
and s = s', then r + s = r' + s' and rs = r's'. 


Example 3.1. 

(i) Z, Q, E, and C are commutative rings with the usual addition and multiplication (the 
ring axioms are verified in courses in the foundations of mathematics). 

(ii) I m , the integers mod m, is a commutative ring. 

(iii) Let Z[i] be the set of all complex numbers of the form a + bi, where a, b e Z and 
i 2 = —1. It is a boring exercise to check that Z[i] is, in fact, a commutative ring (this 
exercise will be significantly shortened, in Exercise 3.8 on page 124, once the notion of 
subring has been introduced). Z[i] is called the ring of Gaussian integers. 

(iv) Consider the set R of all real numbers x of the form 

x — a + bco , 

where a, b e Q and co — \fl. It is easy to see that R is closed under ordinary addition. 
However, if R is closed under multiplication, then co 2 e R. and there are rationals a and b 
with 

co 2 — a + bco. 

Multiplying both sides by co and by b gives the equations 

2 = aco + bco 2 
bco 2 — ab + b 2 co. 


Hence, 2 — aco = ab + b 2 co, and so 

2 — ab = (b 2 + a)co. 

If b 2 + a ^ 0, then co — s/l is rational; if b 2 + a — 0, then this coupled with 2 — ab — 0 
yields 2 = (~b) 3 . Thus, either case forces \/2 rational, and this contradiction shows that 
R is not a commutative ring. -4 


2 Some authors do not demand that commutative rings have 1. For them, the set of all even integers is a 
commutative ring, but we do not recognize it as such. 
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Remark. There are noncommutative rings; that is, sets having an addition and a mul¬ 
tiplication satisfying all the axioms of a commutative ring except the axiom: ab — ba. 
[Actually, the definition replaces the axiom la = a by la = a = al, and it replaces the 
distributive law by two distributive laws, one on either side: a(b + c) = ab + ac and 
(b + c)a — ba + ca.] For example, it is easy to see that the set of all n x n real matrices, 
equipped with the usual addition and multiplication, satisfies all the new ring axioms. We 
shall study noncommutative rings in Chapter 8 . ◄ 

Here are some elementary results. 


Proposition 3.2. Let R be a commutative ring. 

(i) 0 ■ a = Ofor every a e R. 

(ii) If 1 =0, then R consists of the single element 0. In this case, R is called the zero 
ring. 3 

(iii) If—a is the additive inverse of a, then (—1)(— a) = a. 

(iv) (— 1 )a — —a for every a e R. 

(v) If n e N and n 1 =0, then na — Ofor all a e R. 

(vi) The binomial theorem holds'. If a , b e R, then 


(a + b) n 



Sketch of Proof, (i) 0 • a — (0 + 0) • a = 0 • a + 0 • a. 

(ii) a = l- a= 0 - a= 0 . 

(iii) 0 = (-1 + 1 )(—a) = (— 1 )(—a) + (-a). 

(iv) Since (—1)(— a) — a, we have (—1)(—1)(— a) = (—l)a. But (— 1)(— 1) = 1. 

(v) In Chapter 2, we defined the powers a" of an element in a group, where n > 0. In an 
additive group, na is a more appropriate notation than a ", and the notation na, for n e Z 
and a e R, has this meaning in R: that is, na is the sum of a with itself n times. 

If a e R and « e Z is positive, then n 1 = 0 implies 

na — n(la) = {nl)a = 0a — 0. 


(vi) Induction on n > 0 using the identity ("+ 1 ) = ( r "j) + (") for 0 < r < n + 1 . (We 
agree that c/° = 1 for all a e R, even for a — 0.) • 

A subring S of a commutative ring R is a commutative ring contained in a larger com¬ 
mutative ring R so that S and R have the same addition, multiplication, and unit. 

'The zero ring is not a very interesting ring, but it does arise occasionally. 
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Definition. A subset S of a commutative ring R is a subring of R if 

(i) 1 e S ; 4 

(ii) if a, b e S , then a — b e S', 

(iii) if a, b e S , then ab e S. 

Notation. In contrast to the usage H < G for a subgroup, the tradition in ring theory is 
to write S C R for a subring. We shall also write SCR to denote a proper subring; that 
is, S c R and S ^ R. 

Proposition 3.3. A subring S of a commutative ring R is itself a commutative ring. 

Sketch of Proof. The first condition says that S is a subgroup of the additive group R. The 
other conditions are identities that hold for all elements in R , and hence hold, in particular, 
in S. For example, associativity a (foe) = (ab)c holds for all a. fo. c e R, and so it holds, in 
particular, for all a, fo, c e S c R. • 

Of course, one advantage of the notion of subring is that fewer ring axioms need to be 
checked to determine whether a subset of a commutative ring is itself a commutative ring. 

Exercise 3.4 on page 124 gives a natural example of a commutative ring S contained in 
a commutative ring R in which both S and R have the same addition and multiplication, 
but whose units are distinct (and so S is not a subring of R). 

Example 3.4. 

If n > 3 is an integer, let f, = e 277 '/" be a primitive nth root of unity, and define 

Z[£„] = {z £ C : z = ao + a\^ n + U2£,7 + ■ ■ ■ + <foi-i£ ; " \ all a,- e Z}. 

(When n — 4, then Z[^] is the Gaussian integers Z[i].) It is easy to check that Z[£„] is a 
subring of C (to prove that Z[£„] is closed under multiplication, note that if m > n, then 
m — qn + r , where 0 < r < n, and £" ! = ◄ 


Definition. A domain (often called an integral domain) is a commutative ring R that 
satisfies two extra axioms: first, 

1 # 0 ; 

second, the cancellation law for multiplication: For all a, b,c e R, 

if ca — cb and c/0, then a — fo. 

The familiar examples of commutative rings, Z, Q, R, and C, are domains; the zero ring 
is not a domain. 

1 The even integers do not form a subring of Z because 1 is not even. Their special structure will be recognized 
when ideals are introduced. 
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Proposition 3.5. A nonzero commutative ring R is a domain if and only if the product of 
any two nonzero elements of R is nonzero. 

Sketch of Proof, ab — ac if and only if a(b — c) — 0. • 


Proposition 3.6. The commutative ring I m is a domain if and only ifm is a prime. 

Proof. If m = ab, where 1 < a,b < m, then [a] f [0] and \b] ^ [0] in I m , yet 
[a][b] = [m] = [0]. 

Conversely, if m is prime and [a][b] — [ab] = [0], then m \ ab, and Euclid’s lemma 
gives m \ a or m \ b. • 


Example 3.7. 

(i) Let tF(R) be the set of all the functions M —» R equipped with the operations of point- 
wise addition and pointwise multiplication'. Given f. g e (F(R), define functions f + g 
and fg by 


f + g: a h* f(a) + g(a) and fg: a f(a)g{a) 

(notice that fg is not their composite). 

We claim that (F(R) with these operations is a commutative ring. Verification of the 
axioms is left to the reader with the following hint: The zero element in (F(R) is the 
constant function z with value 0 [that is, z(a) — 0 for all a e M] and the unit is the 
constant function e with e{a) — 1 for all a el. We now show that (F(R) is not a domain. 



Define / and g as drawn in Figure 3.1: 


f(a) = 


if a < 0 
if a > 0; 


g(a) = 


if a < 0 
if a > 0. 


Clearly, neither / nor g is zero (i.e., f f z and g f z). On the other hand, for each a e R, 
fg: a i-^ f(a)g(a) = 0, because at least one of the factors f(a) or g(a) is the number 
zero. Therefore, fg = z, by Proposition 1.43, and JT(R) is not a domain. 
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(ii) All differentiable functions /: R —> R form a subring of (F(R). The identity e is 
a constant function, hence is differentiable, while the sum and product of differentiable 
functions are also differentiable. Hence, the differentiable functions form a commutative 
ring. ◄ 

Many theorems of ordinary arithmetic, that is, properties of the commutative ring Z, 
hold in more generality. We now generalize some familiar definitions from Z to arbitrary 
commutative rings. 

Definition. Let a and b be elements of a commutative ring R. Then a divides b in R for 
a is a divisor of b or b is a multiple of a), denoted by a h, if there exists an element c e R 
with b — ca. 

As an extreme example, if 0 | a. then a = 0 ■ h tor some b e R. Since 0 ■ b = 0, 
however, we must have a = 0. Thus, 0 | a if and only if a = 0. 

Notice that whether a \ b depends not only on the elements a and b but on the ambient 
ring R as well. For example, 3 does divide 2 in Q, for 2 = 3 x |, and | e Q; on the other 
hand, 3 does not divide 2 in Z, because there is no integer c with 3c = 2. 

Definition. An element u in a commutative ring R is called a unit if u | 1 in R. that is, 
if there exists v e R with uv — 1; the element v is called the inverse of u and v is often 
denoted by it . 

Units are of interest because we can always divide by them: If a e R and u is a unit in 
R (so there is i ’ e R with uv = 1), then 


a — u{va) 

is a factorization of a in R. for va e R \ thus, it is reasonable to define the quotient a/u as 
va = u~ l a. 

Given elements a and h. whether a \ b depends not only on these elements but also 
on the ambient ring R \ similarly, whether an element u & R is a unit also depends on the 
ambient ring R (for it is a question whether u | 1 in R). For example, the number 2 is a 
unit in Q, for ^ lies in Q and 2x j = 1, but 2 is not a unit in Z, because there is no integer 
v with 2v — 1. In fact, the only units in Z are 1 and —1. 

Proposition 3.8. Let R be a domain, and let a, b e R be nonzero. Then a \ b and b \ a if 
and only ifb — ua for some unit u e R. 

Sketch of Proof. If b — ua and a — vb, then b — ua = uvb. • 

There exist examples of commutative rings in which Proposition 3.8 is false, and so the 
hypothesis that R be a domain is needed. 

What are the units in I m ? 
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Proposition 3.9. If a is an integer, then [a] is a unit in I„, if and only if a and m are 
relatively prime. In fact, ifsa + tm = 1, then [a]~' = [s]. 

Sketch of Proof, sa = 1 mod m if and only if sa + tm — 1 for some integer t. • 

Corollary 3.10. If p is a prime, then every nonzero [a\ in I p is a unit. 

Sketch of Proof. If 1 < a < p, then (a, p) = 1. • 

Definition. If R is a commutative ring, then the group of units of R is 

U ( R ) = {all units in R}. 

It is easy to check that U{R) is a multiplicative group. It follows that a unit u in R has 
exactly one inverse in R. for each element in a group has a unique inverse. 

There is an obvious difference between Q and Z: every nonzero element of Q is a unit. 

Definition. A field 5 F is a commutative ring in which 1 f 0 and every nonzero element 
a is a unit; that is, there is a~ [ e F with a~ l a — 1. 

The first examples of fields are Q, R, and C. 

The definition of field can be restated in terms of the group of units; a commutative ring 
R is a field if and only if U(R ) = R x , the nonzero elements of R. To say this another way, 
R is a field if and only if R x is a multiplicative group [note that U(R X ) f 0 because we 
are assuming that 1^0]. 

Proposition 3.11. Every field F is a domain. 

Sketch of Proof. If ab — ac and a 0, then b = a~ l (ab) = a~ 1 (ac) — c. • 

The converse of this proposition is false, for Z is a domain that is not a field. 

Proposition 3.12. The commutative ring II,„ is a field if and only ifm is prime. 

Sketch of Proof. Corollary 3.10. • 

In Theorem 3.127, we shall see that there are finite fields having exactly p n elements, 
whenever p is prime and n > 1; in Exercise 3.14 on page 125, we construct a field with 
four elements. 

Every subring of a domain is itself a domain. Since fields are domains, it follows that 
every subring of a field is a domain. The converse of this exercise is true, and it is much 
more interesting: Every domain is a subring of a field. 

-The derivation of the mathematical usage of the English term field (first used by E. H. Moore in 1893 in 
his article classifying the finite fields) as well as the German term Korper and the French term corps is probably 
similar to the derivation of the words group and ring: Each word denotes a "realm” or a “collection of things.” The 
word domain abbreviates the usual English translation integral domain of the German word Integretdtsbereich, a 
collection of integers. 
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Given four elements a, b, c, and d in a field F with h f 0 and d ^ 0, assume that 
ab~ l — cd~ l . Multiply both sides by bd to obtain ad — be. In other words, were 
ab~ l written as a/b , then we have just shown that a/b = c/d implies ad — bc\ that is, 
“cross-multiplication” is valid. Conversely, if ad — be and both b and d are nonzero, then 
multiplication by b~ l d~ l gives ab~ l — cd~ l , that is, a/b — c/d. 

The proof of the next theorem is a straightforward generalization of the usual construc¬ 
tion of the field of rational numbers Q from the domain of integers Z. 

Theorem 3.13. If R is a domain, then there is a field F containing R as a subring. 
Moreover, F can be chosen so that, for each f e F, there are a, b e R with b 0 and 

f = ab~ l . 

Sketch of Proof. Let X = {(a, b) e RxR : h f ()}, and define a relation = on X by 
(a, b) = (c, d) if ad — be. We claim that = is an equivalence relation. Verifications of 
reflexivity and symmetry are straightforward; here is the proof of transitivity. If (a, b) = 
(c, d) and (c, d) = ( e, /), then ad — be and cf — de. But ad — be gives adf — b(cf) = 
bde. Canceling d, which is nonzero, gives af — be\ that is, (a, b) = (e, /). 

Denote the equivalence class of (a, b) by [a, b], define F as the set of all equivalence 
classes [a, b], and equip F with the following addition and multiplication (if we pretend 
that [a, b] is the fraction a/b, then these are just the usual formulas): 

[a, b] + [c, d] — [ad + be, bd] 


and 


[a, b][c, d] — [ac , bd]. 


First, since b 0 and d f 0, we have bd f 0, because R is a domain, and so the 
formulas make sense. Let us show that addition is well-defined. If [a, b] = [a f , If] (that is, 
ab' — a'b) and [c, d] — [c', d'] (that is, cd’ — c'd), then we must show that [ad+bc, bd] = 
[a'd' + b'c', b'd']. But this is true: 


(ad + bc)b'd f — ah' dd' + bb' cd' = a'bdd' + bb'c'd — (a'd' + b'c')bd. 


A similar argument shows that multiplication is well-defined. 

The verification that F is a commutative ring is now routine: The zero element is [0, 1], 
the one is [1, 1], and the additive inverse of [a, b] is [—a, b]. It is easy to see that the family 
R' = {[a, 1] : a e R] is a subring of F, and we identify a e R with [a, 1] e R'. 

To see that F is a field, observe that if [a,b] f [0, 1], then a f 0, and the inverse of 
[a, b] is [b, a]. 

Finally, if b ^ 0, then [1 , b] = [b, l] -1 , and so [a,b] = [a, 1 ][b, l] -1 . • 


Definition. The field F constructed from R in Theorem 3.13 is called the fraction field 
of R: we denote it by FraciW), and we denote [a, b] e Frac( R) by a/b\ in particular, the 
elements [a, 1] of R' are denoted by a/1 or, more simply, by a. 

Notice that the fraction field of Z is Q; that is, Frac(Z) = Q. 
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Definition. A subfield of a field K is a subring k of K that is also a field. 

It is easy to see that a subset k of a field K is a subfield if and only if k is a subring that 
is closed under inverses; that is, if a e k and a / 0, then a~ l e k. It is also routine to see 
that any intersection of subfields of K is itself a subfield of K (note that the intersection is 
not equal to {0} because 1 lies in every subfield). 

Exercises 

3.1 Prove that a commutative ring R has a unique 1. 

3.2 (i) Prove that subtraction in Z is not an associative operation. 

(ii) Give an example of a commutative ring R in which subtraction is associative. 

3.3 (i) If R is a domain and a e R satisfies a~ = a, prove that either a = 0 or a = 1. 

(ii) Show that the commutative ring JF(R) in Example 3.7 contains infinitely many elements 
/-<>• 1 with /'- - /'. 

3.4 (i) If A is a set, prove that the Boolean group B(X) in Example 2.18 with elements the 

subsets of X and with addition given by 

u + V = (U - V) U (V - U), 

where U — V = [x e U : x $ V }, is a commutative ring if one defines multiplication 

uv = unv. 

We call B(X) a Boolean ring. 

Hint. You may use some standard facts of set theory: the distributive law: 
U fl (V U VP) = ({/ PI V) U (U n W ); if V' denotes the complement of V, then 
u - V = U n V'\ and the De Morgan law: ( U n V)' = U' U V'. 

(ii) Prove that B(X) contains exactly one unit. 

(iii) If Y is a proper subset of X (that is, Y C X), show that the unit in B(Y) is distinct from 
the unit in B(X). Conclude that B(Y) is not a subring of B(X). 

3.5 Show that U{I m ) = {[it] e l m :{k,m)= 1). 

3.6 Find all the units in the commutative ring JT(R) defined in Example 3.7. 

3.7 Generalize the construction of JF(R) to arbitrary commutative rings R: Let J-(R) be the set of 
all functions from R to R, with pointwise addition, / + g: r f(r) + g(r), and pointwise 
multiplication, fg:r f(r)g(r) forr e R. 

(i) Show that iF(R) is a commutative ring. 

(ii) Show that T{R) is not a domain. 

(iii) Show that Ttji) h as exactly four elements, and that / + / = 0 for every / e T-XR)- 

3.8 (i) If R is a domain and 5 is a subring of R, then 5 is a domain. 

(ii) Prove that C is a domain, and conclude that the ring of Gaussian integers is a domain. 

3.9 Prove that the only subring of Z is Z itself. 

Hint. Every subring R of Z contains 1. 
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3.10 (i) Prove that R = (a + b\fl : a, b e Z) is a domain. 

(ii) Prove that R = {^ (a + bs/2) : a, b e Z) is not a domain. 

(iii) Using the fact that a — ^ (1 + V—19) is a root of x 2 — x + 5, prove that R - 
a , b e Z) is a domain. 

3.11 Prove that the set of all C 00 -functions is a subring of JF(R). (A function /: R 
C°° -function if it has an nth derivative /(”) for every n > 1.) 

Hint. Use the Leibniz rule (see Exercise 1.6 on page 12). 

3.12 (i) If R is a commutative ring, define the circle operation a o b by 

aob = a + b — ab. 

Prove that the circle operation is associative and that 0 o a = a for all a e R. 

(ii) Prove that a commutative ring R is a field if and only if [r e R : r ^ 1} is an abelian 

group under the circle operation. 

Hint. If a ^ 0, then a + 1/1. 

3.13 Find the inverses of the nonzero elements of Ij \. 

3.14 ( R. A. Dean) Define F 4 to be all 2 x 2 matrices of the form 

a b 
b a + b 

where a, b e lj. 

(i) Prove that F 4 is a commutative ring under the usual matrix operations of addition and 
multiplication. 

(ii) Prove that F 4 is a field with exactly four elements. 

3.15 Prove that every domain R with a finite number of elements must be a field. (Using Proposi¬ 
tion 3.6, this gives a new proof of sufficiency in Proposition 3.12.) 

Hint. If R x denotes the set of nonzero elements of R, prove that multiplication by r is an 
injection R x -> R x , where r e R x . 

3.16 Show that F = {a + bsfl : a, b e Q) is a field. 

3.17 (i) Show that F — [a + bi : a, b e Q) is a field. 

(ii) Show that F is the fraction field of the Gaussian integers. 

3.18 If R is a commutative ring, define a relation = on R by a = b if there is a unit u e R with 
b — ua. Prove that if a = b, then (a) = ( b ), where (a) = {ra : r e R). Conversely, prove 
that if R is a domain, then (a) — (b) implies a =b. 

3.19 (i) For any field k, prove that stochastic group E(2, k), the set of all nonsingular 2x2 ma¬ 

trices with entries in k whose column sums are 1 , is a group under matrix multiplication. 

(ii) Define the affine group Aff(l, k ) to be the set of all /: k -*■ k of the form f(x) — 
ax + b , where a.b e k and a 7 ^ 0. Prove that E(2, k) = Afffl, k). (See Exercise 2.46 
on page 80.) 

(iii) If k is a finite field with q elements, prove that |E(2, &)| = q(q — 1). 

(iv) Prove that E(2, 13 ) = S 3 . 
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3.3 Polynomials 

Even though the reader is familiar with polynomials, we now introduce them carefully. The 
key observation is that one should pay attention to where the coefficients of polynomials 
live. 

Definition. If R is a commutative ring, then a sequence o in R is 

(7 = (S 0 , St, 52,- Si, ...); 

the entries ,v; e R, for all i > 0, are called the coefficients of o. 

To determine when two sequences are equal, let us recognize that a sequence o is really 
a function o : N —> R, where N is the set of natural numbers, with o(i) — si for all i > 0. 
Thus, if r = (to, h, t 2 , ■ ■ ■, ti ,...) is a sequence, then a — x if and only if o(i) — x(i) 
for all i > 0; that is, o — r if and only if .v/ = f, for all i > 0. 

Definition. A sequence a — (so, si,..., s;,...) in a commutative ring R is called a 
polynomial if there is some integer m > 0 with s,- = 0 for all i > nr, that is, 

or = (so, St, ... , S m , 0, 0, . . . ). 

A polynomial has only finitely many nonzero coefficients. The zero polynomial, 
denoted by a — 0, is the sequence a = (0, 0, 0 ,...). 

Definition. If a — (so, si,..., s„, 0, 0,...) f 0 is a polynomial, then there is s„ f 0 
with Si = 0 for all i > n. We call s„ the leading coefficient of <r, we call n the degree of 
a, and we denote the degree n by deg(er). 

The zero polynomial 0 does not have a degree because it has no nonzero coefficients. 
Some authors define deg(0) = —oo, and this is sometimes convenient, for —oo < n for 
every integer n. On the other hand, we choose not to assign a degree to 0 because it is often 
a genuinely different case that must be dealt with separately. 

Notation. If R is a commutative ring, then the set of all polynomials with coefficients in 
R is denoted by R[x], 

Proposition 3.14. If R is a commutative ring, then /x’fx] is a commutative ring that 
contains R as a subring. 

Sketch of Proof. Define addition and multiplication of polynomials as follows: If a = 
(so,si ,...) and r = (to, t\, ...), then 

o' + t — (sq + tQ, si + ft, ..., s n + t n ,...) 

and 

err = (c 0 , C\,C 2 , ...), 



Sec. 3.3 Polynomials 


127 


where Ck = Yli+j=k s i t j = Xj=o s i t k-i- Verification of the axioms in the definition of 
commutative ring is routine. The subset {(r, 0, 0,...) : r e R} is a subring of R[x] that we 
identify with R. • 

Lemma 3.15. Let R be a commutative ring and let er, r e R[x] be nonzero polynomials. 

(i) Either or = 0 or deg(crr) < deg(er) + deg(r). 

(ii) If R is a domain, then or / 0 and 

deg(crr) = deg (cr) + deg(r). 

(iii) If R is a domain, then /?[x] is a domain. 

Sketch of Proof. Let a = (so, si,...) and r = (to, t\ , . ..) have degrees m and n, respec¬ 
tively. 

(i) If k > m + n, then each term in sfk-i is 0 (for either s,- = 0 or t^-i — 0). 

(ii) Each term in JV Sjt m +„-j is 0, with the possible exception of s m t n . Since R is a 
domain, s m f 0 and t n ^ 0 imply s m t n ^ 0. 

(iii) This follows from part (ii) because the product of two nonzero polynomials is now 
nonzero. • 


Definition. If R is a commutative ring, then R[x] is called the ring of polynomials 
over R. 

Here is the link between this discussion and the usual notation. 

Definition. Define the element x e /?[x] by 

x = (0, 1,0,0,...). 


Lemma 3.16. 

(i) If a = (so. Si, ... ), then 

xo — (0, so, si,...); 

that is, multiplying by x shifts each coefficient one step to the right. 

(ii) If n > 1, then x” is the polynomial having 0 everywhere except for 1 in the nth 
coordinate. 

(iii) If r e R, then 

(r , 0, 0, ... )(so, si- ,Sj,...) = (rs 0 , rs 1; - rsj,...). 

Sketch of Proof. Each is a routine computation using the definition of polynomial multi¬ 
plication. • 
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If we identify (r, 0, 0,...) with r. then Lemma 3.16(iii) reads 

r(s 0 , si, = (rs 0 , rs i,..., rs t ,...). 

We can now recapture the usual notation. 

Proposition 3.17. If o = (so, si,..., s n , 0, 0,...), then 

2 n 

O — So + Six + S 2 X~ + • • • + S n X , 

where each element s e R is identified with the polynomial (s, 0, 0, ... ). 

Proof 

o — (so, si,...,s„,0,0,...) 

= (so, 0, 0, ...) + (0, si, 0, ...) + ••• + (0, 0,..., s n , 0,...) 

= s 0 (l, 0, 0,...) + si(0,1, 0,...) + • • • + s„(0, 0,..., 1,0,...) 
= so + Six + S 2 X 2 H-|-s„x". • 


We shall use this familiar (and standard) notation from now on. As is customary, we 
shall write 


f(x) — So + SIX + S 2 X 2 H -h s„x" 


instead of a = (so, si,..., s„, 0, 0,...). 

Here is some standard vocabulary associated with polynomials. If fix) = so + six + 
S 2 X 2 + • • • + s„x" , where s„ f 0, then so is called its constant term and, as we have already 
said, s„ is called its leading coefficient. If its leading coefficient s„ = 1, then fix) is called 
monic. Every polynomial other than the zero polynomial 0 (having all coefficients 0) has 
a degree. A constant polynomial is either the zero polynomial or a polynomial of degree 
0. Polynomials of degree 1, namely, a + bx with h f 0, are called linear , polynomials of 
degree 2 are quadratic , 6 degree 3’s are cubic, then quartics , quintics. and so on. 


Corollary 3.18. Polynomials fix) = so + six + S 2 X 2 + • • • + s„x" and g(x) = fo + 
fix + f 2 X 2 + • • • + t m x m of degrees n and m, respectively, are equal if and only ifn — m 
and si = ti for all i. 

Proof This is merely a restatement of the definition of equality of sequences, rephrased 
in the usual notation for polynomials. • 

^’Quadratic polynomials are so called because the particular quadratic x~ gives the area of a square ( quadratic 
comes from the Latin word meaning “four,” which is to remind us of the four-sided figure); similarly, cubic 
polynomials are so called because x ^ gives the volume of a cube. Linear polynomials are so called because the 
graph of a linear polynomial in R[s] is a line. 
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We can now describe the usual role of x in / (x) as a variable. If R is a commutative ring, 
each polynomial fix) — .vo+.vi v+.s'2-v 2 + - • -+s n x n e f?[x] defines a polynomial function 
/: R —> R by evaluation: If a e R, define f(a) — so + + S 2 a 2 + • —b s n a n e R. The 
reader should realize that polynomials and polynomial functions are distinct objects. For 
example, if R is a finite ring (e.g., R — I m ), then there are only finitely many functions from 
R to itself, and so there are only finitely many polynomial functions. On the other hand, 
there are infinitely many polynomials: for example, all the powers 1, x, x 2 ,..., x n ,... are 
distinct, by Corollary 3.18. 

Definition. Let k be a field. The fraction field of k[x], denoted by k(x), is called the field 
of rational functions over k. 


Proposition 3.19. If k is afield, then the elements of k(x) have the form f(x)/g(x), 
where fix), g(x ) G k[x] and g(x) f 0. 

Sketch of Proof. Theorem 3.13. • 


Proposition 3.20. If p is a prime, then the field of rational functions l p (x ) is an infinite 
field containing I p as a subfield? 

Proof. By Lemma 3.15(iii), I p [x] is an infinite domain, for the powers x", for n e N, 
are distinct. Thus, its fraction field, I p (x), is an infinite field containing I ;) [x] as a subring. 
But I p [x] contains l p as a subring, by Proposition 3.14. • 

In spite of the difference between polynomials and polynomial functions (we shall see, 
in Corollary 3.28, that these objects coincide when the coefficient ring R is an infinite 
field), R[x) is often called the ring of all polynomials over R in one variable. If we write 
A = f?[x], then the polynomial ring A [y] is called the ring of all polynomials over R 
in two variables x and y, and it is denoted by R[x, y]. For example, the quadratic poly¬ 
nomial ax 2 + bxy + cy 2 + dx + ey + f can be written cy 2 + (bx + e)y + {ax 2 + 
dx + /), a polynomial in y with coefficients in R[x], By induction, we can form the 
commutative ring R[xi,X 2 , ..., x„] of all polynomials in n variables with coefficients 
in R. Lemma 3.15(iii) can now be generalized, by induction on n, to say that if R is 
a domain, then so is /?[xi, X 2 ,..., x n ]. Moreover, when k is a field, we can describe 
Frac(£[xi, X 2 ...., x n ]) as all rational functions in n variables', its elements have the form 
f(x i, x' 2 , ..., x n )/g(x i, x' 2 , ..., x„), where / and g lie in k[x\, X 2 , ■ ■ ■, x„]. 

Exercises 

3.20 Show that if R is a commutative ring, then f?[x] is never a field. 

Hint. If x -1 exists, what is its degree? 

7 In the future, we will denote Ip by when it is to be viewed as a field. 
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3.21 (i) If R is a domain, show that if a polynomial in /J[x] is a unit, then it is a nonzero constant 

(the converse is true if R is a field). 

Hint. Compute degrees. 

(ii) Show that (2x + l) 2 = 1 in Iqfx]. Conclude that the hypothesis in part (i) that R be a 
domain is necessary. 

3.22 Show that the polynomial function defined by fix ) = x p — .re I p [x] is identically zero. 
Hint. Use Fermat's theorem. 

3.23 If R is a commutative ring and /(r) = Yli'Lo s i xI e has degree n > 1, define its 
derivative fix) e R\x j by 

f'ix) = si + 2 sjx + 3s$x 2 + • • • + ns n x n ~ l \ 

if fix ) is a constant polynomial, define its derivative to be the zero polynomial. Prove that the 
usual rules of calculus hold: 

(/ + *)' = f + g'\ 

0 rfY = rif ) if ref?; 
ifg)' = fg' + f'g\ 

if")' =nf n - 1 f for all n > 1. 

3.24 Let R be a commutative ring and let fix) e f?[r]. 

(i) Prove that if (r — a) 2 \ fix), then x — a \ f'ix) in f?[r]. 

(ii) Prove that if x — a \ fix) and x — a \ f'ix), then (r — a) 2 \ fix). 

3.25 (i) If f(x) = ax 2p + bx p + c e Ip[r], prove that f'ix) — 0. 

(ii) Prove that a polynomial fix) e Ip[r] has f'ix) = 0 if and only if there is a polynomial 
gix) = I>„r" with fix) = gixPf, that is, fix) = J2 a nX np e l p [x p ). 

3.26 If R is a commutative ring, define f?[[r]] to be the set of all sequences (sq. H• •) with s, e R 
for all i (we do not assume here that s, = 0 for large i). 

(i) Show that the formulas defining addition and multiplication on f?[r] make sense for 
f?[[r]], and prove that f?[[r]] is a commutative ring under these operations (/?[[*]] is 
called the ring of formal power series over R.) 

(ii) Prove that f?[v] is a subring of f?[[x]]. 

(iii) Prove that if R is a domain, then f?[[x]] is a domain. 

Hint. If a = (sq, si, • • •) e f?[[x]] is nonzero, define the order of a, denoted by 
ord(cr), to be the smallest n > 0 for which s n 0. If R is a domain and a, re f?[[x]] 
are nonzero, prove that ord(crr) = ord(cr) + ord(r) 0, and hence ax 0. 

3.27 (i) Denote a formal power series a — (sq, , S 2 , ■ ■ ■, s tI , ...) by 

o 

a = sq + s\x + S 2 X~ + • • • . 

Prove that if a — 1 +x +x 2 + ■ ■ ■, then a = 1/(1 — x) in f?[[x]]; that is, (1 — x)a — 1. 

(ii) Prove that if k is a field, then a formal power series a e A [[x]] is a unit if and only if its 
constant term is nonzero; that is, ord(cr) = 0. 

(iii) Prove that if a e fc[[x]] and ord(ff) = n, then 

a — x n u. 


where u is a unit in fc[[x]]. 
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3.4 Greatest Common Divisors 


We are now going to see that, when k is a field, virtually all the familiar theorems proved 
for Z have polynomial analogs in k[x ]; moreover, we shall see that the familiar proofs can 
be translated into proofs here. 

The division algorithm for polynomials with coefficients in a field says that long division 
is possible. 


Theorem 3.21 (Division Algorithm). Assume that k is a field and that fix), g(x) e 
k[x] with fix) 0. Then there are unique polynomials q (x), r(x) e k[x) with 


g(x ) = q(x)f(x) + r(x) 


and either r(x) — 0 or deg(r) < deg(/). 

Proof. We first prove the existence of such q and r. If f \ g. then g — qf for some 
q\ define the remainder r — 0, and we are done. If / \ g, then consider all (necessarily 
nonzero) polynomials of the form g — q f as q varies over k [x ]. The least integer axiom 
provides a polynomial r — g — q f having least degree among all such polynomials. Since 
g — qf + r, it suffices to show that deg(r) < deg(/). Write fix) — s n x n H— • + ,v i x + so 
and r(x) = t m x m + ■ ■ ■ + t\x + to. Now s„ 0 implies that s n is a unit, because A: is a 
field, and so s„ exists in k. If deg(r) > deg(/), define 

h(x) = r(x) - t m s~ l x m ~ n fix); 


that is, if LT(/) = s, ,x", where LT abbreviates leading term , then 


h = r — 


LT(r) 

LT(/) 


note that /z = 0 or deg(/z) < deg(r). If h — 0, then r = [LT(r)/LT(/)]/ and 


g = qf + r 


= qf + 



LT(r) 
LT(/) 
LT(r) ' 
LT (/). 


contradicting f \ g. If h ^0, then deg(7z) < deg(r) and 


g~qf = r = h + 


LT(r) 
LT if) 


Thus, g — [q + LT(r)/LT(/)]/ = /z, contradicting r being a polynomial of least degree 
having this form. Therefore, deg(r) < deg(/). 
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To prove uniqueness of qix) and rix), assume that g — q' f + r', where deg(r') < 
deg(/). Then 

(q - q')f = r'~ r. 

If r' ^ r, then each side has a degree. But deg((g — q')f) = deg (q — q') + deg(/) > 
deg(/), while deg(r' — r) < max{deg(r'). deg(r)} < deg if), a contradiction. Hence, 
r' = r and (q — q')f = 0. As A:[x] is a domain and / ^ 0, it follows that q — q' = 0 and 
q=q'. • 


Definition. If fix) and g(x) are polynomials in k[x], where k is a field, then the poly¬ 
nomials qix) and r(x) occurring in the division algorithm are called the quotient and the 
remainder after dividing gix) by / (x). 

The hypothesis that A: is a field is much too strong; long division can be carried out in 
K[x] for every commutative ring R as long as the leading coefficient of fix) is a unit in 
R\ in particular, long division is always possible when fix) is a monic polynomial. 

Corollary 3.22. Let R be a commutative ring, and let fix) € /?[x] be a monic polyno¬ 
mial. Ifgix) e then there exist qix), rix) e /?[x] with 

gix) = qix) fix) + rix), 

where either rix) = 0 or deg(r) < deg(/). 

Sketch of Proof. The proof of the division algorithm can be repeated here, once we ob¬ 
serve that LT(r)/LT(/) e R because fix) is monic. • 

We now turn our attention to roots of polynomials. 

Definition. If fix) e k[x ], where A: is a field, then a root of fix) in k is an element 
a e k with fia) — 0. 


Remark. The polynomial fix) = x 2 — 2 has its coefficients in Q, but we usually say 
that y/2 is a root of fix) even though \[2 is irrational; that is, f Q. We shall see later, 
in Theorem 3.123, that for every polynomial fix) e k[x\, where k is any field, there is 
a larger field E that contains k as a subfield and that contains all the roots of fix). For 
example, x 2 — 2 e I 3 [jr] has no root in I 3 , but we shall see that a version of *J2 does exist 
in some (finite) field containing I 3 . -4 

We will use the following elementary exercise in the proof of the next lemma. If 
fix), gix) e R[x), where R is a commutative ring, write 

aix) = fix) + gix) and mix) = /(x)g(x); 
evaluating at u e R gives aiu) — fiu) + giu) and miu) — fiu)giu). 
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Lemma 3.23. Let fix) e k[x], where k is a field, and let u e k. Then there is qix) e k[x] 
with 

fix) = q(x)(x -u) + fiu). 

Proof. The division algorithm gives 

fix) — qix)ix — u) + r; 

the remainder r is a constant because x — u has degree 1. Now evaluate: 

fiu) = qiu)iu — u) + r. 


and so r = fiu). • 

There is a connection between roots and factoring. 

Proposition 3.24. If fix) e k[x\ where k is afield, then a is a root of fix) in k if and 
only ifx — a divides fix) in k[x\ 

Proof. If a is aroot of fix) in k , then /(a) = 0 and the lemma gives fix) — qix)ix—a). 
Conversely, if fix) — g(x)(x — a), then evaluating at a gives fia) — gia)ia—a) = 0. • 


Theorem 3.25. Let k be afield and let fix) e k[x\ If fix) has degree n, then fix) has 
at most n roots in k. 

Proof. We prove the statement by induction on n > 0. If n — 0, then fix) is a nonzero 
constant, and so the number of its roots in k is zero. Now let n > 0. If fix) has no roots 
in k , then we are done, for 0 < n. Otherwise, we may assume that there is a e k with a a 
root of fix); hence, by Proposition 3.24, 

fix) = qix)ix - a); 

moreover, qix) e k[x] has degree n — 1. If there is a root b e k with b f a , then 

0 = fib) = qib)ib - a). 

Since b — a ^ 0, we have qib) — 0 (because A; is a field, hence is a domain), so that b is a 
root of qix). Now deg )q) = n — 1, so that the inductive hypothesis says that qix) has at 
most n — 1 roots in k. Therefore, fix) has at most n roots in C • 


Example 3.26. 

Theorem 3.25 is not true for polynomials with coefficients in an arbitrary commutative 
ring R. For example, if R = Ig, then the quadratic polynomial r - 1 e Jig[v ] has 4 roots: 


[1], [3], [5], and [7]. « 
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Corollary 3.27. Every nth wot of unity in C is equal to 


flnik/n = CQ& 


(“) 


— I + i sin 


/ 2 nkf 


where k = 0, 1 , 2, ..., n — 1 . 


Proof. We have seen, in Corollary 1.35, that each of the n different complex numbers 
finikin j s an root G f un ity; that is, each is a root of x" — 1. By Theorem 3.25, there can 
be no other complex roots. • 

Recall that every polynomial fix) e k[x] determines the polynomial function k -> k 
that sends a into f(a) for all a e k. In Exercise 3.22 on page 130, however, we saw that a 
nonzero polynomial in I p [x] (e.g., x p — x) can determine the constant function zero. This 
pathology vanishes when the field k is infinite. 


Corollary 3.28. Let k be an infinite field and let fix) and g(x) be polynomials in k[x\ If 
f(x) and g(x) determine the same polynomial function [i.e., if f (a) — g{a) for all a e k], 
then f(x) = g(x). 

Proof If fix) g(x), then the polynomial hix) = fix) — gix) is nonzero, so that it 
has some degree, say, n. Now every element of k is a root of h (x); since k is infinite, h(x) 
has more than n roots, and this contradicts the theorem. • 

This proof yields a more general result. 

Corollary 3.29. Let k be any field, perhaps finite. If fix), gix) e k\x\ if deg(/) < 
deg(g) < n, and if fia) — gia) forn + 1 elements a e k, then fix) — gix). 

Sketch of Proof. If f g, then deg (/ — g) is defined and deg(/ — g) < n. • 

Here is another nice application of Theorem 3.25. 

Theorem 3.30. Ifk is a field and G is a finite subgroup of the multiplicative group k x , 
then G is cyclic. In particular, ifk itself is finite (e.g., k — l p ), then k x is cyclic. 

Proof. Let d be a divisor of | G \. If there are two subgroups of G of order d, say, S and 
T, then S U 7’ > d. But each a e S U 7’ satisfies a d = 1, by Lagrange’s theorem, and 
hence it is a root of x d — 1. This contradicts Theorem 3.25, for this polynomial now has 
too many roots in k. Thus, G is cyclic, by Theorem 2.86. • 


Definition. If & is a finite field, a generator of the cyclic group k x is called a primitive 
element of k. 

Although the multiplicative groups I* are cyclic, no explicit formula giving a primitive 
element of each of them is known. For example, finding a primitive element of F 257 essen¬ 
tially involves checking the powers of each [i], where 1 < i < 257, until one is found for 
which i m f 1 mod 257 for all positive integers m < 256. 

The definition of a greatest common divisor of polynomials is essentially the same as 
the corresponding definition for integers. 
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Definition. If fix) and g(x) are polynomials in k[x], where k is a field, then a common 
divisor is a polynomial c(x) e £[*] with c(x) \ fix) and c(x) | gix). If fix) and gix) in 
k[x] are not both 0, define their greatest common divisor , abbreviated gcd, to be the monic 
common divisor having largest degree. If fix) = 0 = gix), define their gcd = 0. The 
gcd of fix) and gix) [which is uniquely determined by fix) and gix)] is often denoted 

by if, g)- 

Theorem 3.31. If k is a field and fix), gix) e k[x], then their gcd d(x) is a linear 
combination of fix) and g(x); that is, there are six), t ix) e A;[x] with 

dix) = six) fix) +tix)gix). 

Sketch of Proof. This proof is very similar to the corresponding result in Z; indeed, once 
we introduce principal ideal domains, we will prove this theorem and its analog in Z si¬ 
multaneously (see Theorem 3.57). • 

Corollary 3.32. Let k be a field and let fix), gix) e k[x], A monic common divisor 
dix) is the gcd if and only if dix) is divisible by every common divisor, that is, ifcix) is a 
common divisor, then cix) \ dix). 

Moreover, fix) and g(x) have a unique gcd. 

Sketch of Proof. Analogous to the proof of Proposition 1.8. • 

Every polynomial fix) is divisible by u and by af ix), where u is a unit. The analog of 
a prime number is a polynomial having only divisors of these trivial sorts. 

Definition. An element p in a domain R is irreducible if p is neither 0 nor a unit and, in 
any factorization p — u v in R, either u or v is a unit. Elements a, b e R are associates if 
there is a unit u e R with b — ua. 

For example, a prime p e Z is an irreducible element, as is —p. We now describe 
irreducible polynomials pix) e k[x], when A: is a field. 

Proposition 3.33. Ifk is afield, then a polynomial pix) e k[x] is irreducible if and only 
ifdcgip) — n > 1 and there is no factorization in £[x] of the form pix) — gix)hfx) in 
which both factors have degree smaller than n. 

Proof. We show first that hix) e £[x] is a unit if and only if deg(/z) = 0. If hix)ufx) = 1, 
then deg(/z) + deg(n) = deg(l) = 0; since degrees are nonnegative, we have deg(/z) = 0. 
Conversely, if deg(/z) = 0, then hix) is a nonzero constant; that is, h e k\ since A: is a field, 
h has an inverse. 

If pix) is irreducible, then its only factorizations are of the form pix) = g(x)h(x), 
where gix) or hix) is a unit; that is, where either deg(g) = 0 or deg(7z) = 0. Therefore, 
pix) has no factorization in which both factors have smaller degree. 

Conversely, if pix) is not irreducible, then it has a factorization pix) = gix)hix) in 
which neither gix) nor h(x) is a unit; that is, neither g(x) nor h(x) has degree 0. Therefore, 
pix) has a factorization as a product of polynomials of smaller degree. • 
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If k is not a field, however, then this characterization of irreducible polynomials no 
longer holds. For example, 2x + 2 = 2(x + 1) is not irreducible in Z[x], even though, 
in any factorization, one factor has degree 0 and the other degree 1 (when k is a field, the 
units are the nonzero constants, but this is no longer true for more general coefficients). 

As the definition of divisibility depends on the ambient ring, so irreducibility of a poly¬ 
nomial p(x ) e k\x] also depends on the commutative ring k[x ] and hence on the field 
k. For example, p(x ) = x 2 + 1 is irreducible in R[x], but it factors as (x + i)(x — i) in 
C[x], On the other hand, a linear polynomial /(x) is always irreducible [if / = gh. then 
1 = deg(/) = deg(g) + deg(/z), and so one of g or h must have degree 0 while the other 
has degree 1 = deg(/)]. 

Corollary 3.34. Let k be afield and let f(x) e A;[x] be a quadratic or cubic polynomial. 
Then fix) is irreducible in &[x] if and only if f(x) does not have a root in k. 

Sketch of Proof If fix) = g(x)h(x) and neither g nor h is constant, then deg(/) = 
deg(g) + deg(/z) implies that at least one of the factors has degree 1. • 

It is easy to see that Corollary 3.34 can be false if deg(/) > 4. For example, consider 
f{x) — x 4 + 2x 2 + 1 = (x 2 + l) 2 in R[x]. 

Example 3.35. 

(i) We determine the irreducible polynomials in l 2 [x] of small degree. 

As always, the linear polynomials x and x + 1 are irreducible. 

There are four quadratics: x 2 ; x 2 + x; x 2 + 1; x 2 + x + 1 (more generally, there 
are p n monic polynomials of degree n in I p [x], for there are p choices for each of the n 
coefficients ao,..., a„-i). Since each of the first three has a root in I 2 , there is only one 
irreducible quadratic. 

There are eight cubics, of which four are reducible because their constant term is 0. The 
remaining polynomials are 

x 3 + 1; x 3 + x + 1; x 3 + x 2 + 1; x 3 +x 2 + x+l. 

Since 1 is a root of the first and fourth, the middle two are the only irreducible cubics. 

There are 16 quartics, of which eight are reducible because their constant term is 0. Of 
the eight with nonzero constant term, those having an even number of nonzero coefficients 
have 1 as a root. There are now only four surviving polynomials fix), and each of them 
has no roots in I 2 ; i.e., they have no linear factors. If fix) — gix)hix), then both g(x) and 
/z(x) must be irreducible quadratics. But there is only one irreducible quadratic, namely, 
x 2 + x + 1, and so (x 2 + x + l) 2 = x 4 + x 2 + 1 is reducible while the other three quartics 
are irreducible. The following list summarizes these observations. 

Irreducible Polynomials of Low Degree over I 2 

degree 2: x 2 + x+l. 

degree 3: x 3 +x+l; x 3 +x 2 +l. 

degree 4: x 4 + x 3 + l; x 4 +x + l; x 4 + x 3 + x 2 + x + 1. 
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(ii) Here is a list of the monic irreducible quadratics and cubics in II 3 [a']. The reader can 
verify that the list is correct by first enumerating all such polynomials; there are 6 monic 
quadratics having nonzero constant term, and there are 18 monic cubics having nonzero 
constant term. It must then be checked which of these have 1 or — 1 as a root (it is more 
convenient to write —1 instead of 2 ). 


degree 2 : 
degree 3; 


Monic Irreducible Quadratics and Cubics over I 3 


v 2 + 1; 


x 2 + x 


1 ; 


1 . 


t 3 

x 3 — x 2 + x + 1; 

X 3 + X 2 + X - 1; 


x 3 +x 2 - x + 1; 
x 3 — x — 1; 
x 3 — x 2 — x — 1. 


c 3 — x 2 
x 3 + x 2 — 1; 


It is easy to see that if p(x) and q(x) are irreducible polynomials, then pix) \ q(x) if 
and only if there is a unit u with q(x) = up(x). If, in addition, both p(x) and q(x) are 
monic, then p(x) \ q(x) implies p(x) — q(x). 

Lemma 3.36. Let k be afield, let p(x), fix) e k[x], and let d(x) — ip, f) be their gcd. 
If pix) is a monic irreducible polynomial, then 

dU)= ( 1 

l pix) if pix) | fix). 

Sketch of Proof. Since d{x) \ pix), we have d(x) — 1 or d(x) — pix). • 


Theorem 3.37 (Euclid’s Lemma). Let k be afield and let fix), g(x) e k[x\ If pix) 
is an irreducible polynomial in k[x\ and pix) \ f ix)glx), then either 

Pix) I fix) or pix) | gix). 

More generally, if pix) \ /i(x) • • • f n (x), then pix) | fix) for some i. 

Sketch of Proof. Assume that p \ fg but that p \ f. Since p is irreducible, ip, f) — 1, 
and so 1 = sp + tf for some polynomials ,v and t. Therefore, 

g = spg + tfg. 

But p | fg, by hypothesis, and so p \ g. • 


Definition. Two polynomials fix), gix) e k[x ], where A: is a field, are called relatively 
prime if their gcd is 1 . 
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Corollary 3.38. Let fix), gix), h(x) e k\x\ where k is a field, and let h(x ) and fix) 
be relatively prime. Ifh(x) | f(x)g(x), then h(x) | gix). 

Sketch of Proof. The proof of Euclid’s lemma also works here: Since (h, f) = 1, we 
have 1 — sh + tf, and so g = shg + if g- • 


Definition. If k is a field, then a rational function fix) /gix) e k(x) is in lowest terms if 
fix) and g(x) are relatively prime. 


Proposition 3.39. If k is afield, every nonzero f (x ) / g (x ) e k(x) can be put in lowest 
terms. 

Sketch of Proof. If f — df and g — dg', where d — if, g), then /' and g' are relatively 
prime, and so f'/g' is in lowest terms. • 

The next result allows us to compute gcds. 

Theorem 3.40 (Euclidean Algorithm). Ifk is afield and fix), gix) e k[x], then there 
are algorithms for computing the gcd if, g), as well as for finding a pair of polynomials 
six) and tix) with 

if, g) = six) fix) + tix)gix). 

Proof. The proof is essentially a repetition of the proof of the euclidean algorithm in Z; 
just iterate the division algorithm: 

§ = qif + n 
f = qm + r 2 
r\ = qV 2 + r 3 

Fpi—4 — qn—2^n—3 ^n—2 

rn —3 ~ qn— \Tn—2 “1“ ^n— 1 
f n—2 — qnrn— 1 ~ ^n 
r n — 1 — d n I I rn. 

Since the degrees of the remainders are strictly decreasing, this procedure must stop after 
a finite number of steps. The claim is that d — r n is the gcd, once it is made monic. We 
see that d is a common divisor of / and g by back substitution: work from the bottom up. 
To see that d is the gcd, work from the top down to show that if c is any common divisor 
of / and g, then c \ r, for every i. Finally, to find .v and t with d — sf + tg, again work 
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from the bottom up. 


F n —2 n F n — 1 

—2 QniTn— 3 
(1 “i” Qn—\)fn—2 
(1 “t“ tfn — 1) (j"n —4 
(1 “1“ tfn—\)^n— 4 

Qn—\^n—2) 

tfnfn —3 

tfri—2^ n— 3 ) n— 3 

[(1 “h q n —\)q n —2 “1” Qn\ 

sf + tg • 



Here is an unexpected bonus from the euclidean algorithm. 

Corollary 3.41. Let k be a subfield of a field K, so that k[x] is a subring of K[x]. If 
fix), g(x) e k[x\ then their gcd in k[x] is equal to their gcd in AT [a:]. 

Proof. The division algorithm in K[x) gives 

g(x) = Q(x)f(x) + R(x), 

where Q(x), R(x ) e K[x\, since fix), g(x ) e k[x ], the division algorithm in k[x\ gives 

g(x ) = q(x)fix) + r(x), 

where q(x), r(x) e k[x]. But the equation g(x) = q(x)f(x) + r(x) also holds in K[x) 
because k[x~\ C K [x], so that the uniqueness of quotient and remainder in the division 
algorithm in Ai[x] gives Q{x) = q(x) e k[x ] and R(x) — r(x) e k\x\. Therefore, the list 
of equations occurring in the euclidean algorithm in K[x] is exactly the same list occurring 
in the euclidean algorithm in the smaller ring k[x\, and so the same gcd is obtained in both 
polynomial rings. • 

For example, the gcd of x 3 — x 2 + x — 1 and x 4 — 1 is x 2 + 1, whether computed in 
R[x] or in C[x], in spite of the fact that there are more divisors with complex coefficients. 

Here is the analog for polynomials of the fundamental theorem of arithmetic; it shows 
that irreducible polynomials are “building blocks” of arbitrary polynomials in the same 
sense that primes are building blocks of arbitrary integers. To avoid long sentences, let us 
agree that a “product” may have only one factor. Thus, when we say that a polynomial 
fix) is a product of irreducibles, we allow the possibility that the product has only one 
factor, that is, that fix) is itself irreducible. 

Theorem 3.42 (Unique Factorization). If k is a field, then every polynomial fix) e 
k[x) of degree > 1 is a product of a nonzero constant and monic irreducibles. Moreover, if 
fix) has two such factorizations 


fix) = apiix) ■ ■ ■ p m ix) and fix) = bq\ (x) ■ ■ ■ q n (x), 
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that is, a and b are nonzero constants and the p 's and q's are monic irreducibles, then 
a — b, m = n, and the q’s may be reindexed so that q ,■ = p, for all i. 

Proof. We prove the existence of a factorization for a polynomial fix) by (the second 
form of) induction on deg (/) > 1. If deg if) = 1, then fix) = ax + c = a(x + a _1 c). As 
every linear polynomial, x + a~ l c is irreducible, and so it is a product of irreducibles in 
our present usage of "product.” Assume now that deg(/) > 1. If fix) is irreducible and 
its leading coefficient is a, write fix) — a(a~ l fix))’, we are done, for a~ l fix) is monic. 
If f(x) is not irreducible, then fix) — gixfhix ), where deg(g) < deg(/) and deg(/z) < 
deg(/). By the inductive hypothesis, there are factorizations g(x) = bp\ ix) ■ ■ ■ p m (x) and 
hix) — cqiix) ■ ■ ■ q„ix), where the p’s and q’s are monic irreducibles. It follows that 

fix) = ( bc)p\ix) ■■■ p m ix)qi ix) ■ ■ ■ q n ix), 


as desired. 

We now prove, by induction on M = max{m, n} > 1, that if there is an equation 
api ix) ■ ■ ■ p m ix) = bqi ix) ■ ■ ■ q n ix) 

in which a and b are nonzero constants and the p’s and q’s are monic irreducibles, then 
a — b, m — n, and the q’s may be reindexed so that q, = /;, for all i. For the base step 
M — 1, the hypothesis gives a polynomial, call it g(v), with gix) = ap\ (x) = bqiix). 
Now a is the leading coefficient of g(x), because p \ ix ) is monic; similarly, b is the 
leading coefficient of gix) because q\ ix ) is monic. Therefore, a = b. and canceling 
gives p j(x) = qiix). For the inductive step, the given equation shows that p m ix) \ 
qiix) ■ ■ ■ q n ix). By Euclid’s lemma for polynomials, there is some i with p„,ix) \ qiix). 
But qiix), being monic irreducible, has no monic divisors other than 1 and itself, so that 
qiix) — pmix). Reindexing, we may assume that q,fx) = p m ix). Canceling this factor, 
we have ap\ix) ■ ■ ■ p m -tix) = bqiix) ■ ■ ■ q n -iix). By the inductive hypothesis, a = b, 
in — 1 = n — 1 (hence m — n), and after possible reindexing, q t — p, for all i. • 

Let k be a field, and assume that there are a, r\,..., r„ e k with 

n 

fix) = a ]~[(x - n). 

1=1 

If r\,..., r s , where ,v < n, are the distinct roots of fix), then collecting terms gives 
fix) = aix - ri) ei ix - r 2 ) ei ■ ■ ■ ix - r s ) es , 

where the rj are distinct and ej > 1 for all We call e j the multiplicity of the root rj. As 
linear polynomials are always irreducible, unique factorization shows that multiplicities of 
roots are well-defined. 

Although there are some techniques to help decide whether an integer is prime, the 
general problem is a very difficult one. It is also very difficult to determine whether a 
polynomial is irreducible, but we now present some useful techniques that frequently work. 
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We know that if fix) e k[x] and r is a root of fix) in a field k. then there is a 
factorization fix) — (x-r)g(x) in k[x], so that fix) is not irreducible. In Corollary 3.34, 
we saw that this decides the matter for quadratic and cubic polynomials in k[x]: such 
polynomials are irreducible in k[x] if and only if they have no roots in k. This is no longer 
true for polynomials of degree > 4. 

Theorem 3.43. Let fix) — ciq + a\x + • • • + a„x n e Z[x] C Q[x], Every rational root 
r of fix) has the form b/c, where b \ ao and c \ a„. 

Proof. We may assume that r = b/c is in lowest terms, that is, ib, c) — 1. Substituting r 
into f(x) gives 

0 — fib/c) — «o + a\b/c + - Ya n b n /c n , 

and multiplying through by c" gives 

0 = aoc n + a\bc n ~ { H-+ a„b n . 

Hence, aoc n — b(—a\c n ~ { — • • • — a n b n ~ l ), that is, b \ aoc n . Since b and c are relatively 
prime, it follows that b and c n are relatively prime, and so Euclid’s lemma in Z gives b \ ao. 
Similarly, a n b n — ci—a„-ib n ~ l — ■ ■ ■ — aoc”~ l ), c \ a n b'\ and c \ a„. • 

Definition. A complex number a is called an algebraic integer if a is a root of a monic 
fix) e Z[x], 

We note that it is crucial, in the definition of algebraic integer, that fix) e Z[x] be 
monic. Every algebraic number z, that is, every complex number z that is a root of some 
polynomial g(x) e Q[x], is necessarily a root of some polynomial hfx) e Z[x\, just clear 
the denominators of the coefficients of gix). 

Of course, every ordinary integer is an algebraic integer. To contrast ordinary integers 
with more general algebraic integers, elements of Z may be called rational integers. 

Corollary 3.44. A rational number z that is an algebraic integer must lie in Z. More 
precisely, if fix) e Z[x] C Q[jc] is a monic polynomial, then every rational root of fix) 
is an integer that divides the constant term. 

Proof. If fix) — ao + a\x H-b a„x n is monic, then a n = 1, and Theorem 3.43 applies 

at once. • 

For example, consider fix) — x 3 + 4x 2 — 2x — 1 e Q[x], By Corollary 3.34, this 
cubic is irreducible if and only if it has no rational root. As fix) is monic, the candidates 
for rational roots are ±1, for these are the only divisors of —1 in Z. But /(1) = 2 and 
/(—l) = 4, so that neither 1 nor —1 is a root. Thus, fix) has no roots in Q, and hence 
fix) is irreducible in Q[x], 

This corollary gives a new solution of Exercise 1.15(i) on page 12. If m is an integer 
that is not a perfect square, then the polynomial x 2 — m has no integer roots, and so *fm is 
irrational. Indeed, the reader can now generalize to nth roots: If m is not an nth power of 
an integer, then ffm is irrational, for any rational root of x n — m must be an integer. 
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Exercises 

3.28 Find the gcd of x~ — x — 2 and x 3 — lx + 6 in I 5 U ], and express it as a linear combination of 
them. 

Hint. The answer is x — 2. 

3.29 Let R be a domain. If fix) e 7?[x] has degree n, prove that f{x) has at most n roots in R. 
Hint. Use Frac(ff). 

3.30 Show that the following pseudocode implements the euclidean algorithm finding the gcd fix) 
and g(x) in I 3 f.v ], where fix) = x~ + 1 and g(x) = x 3 + x + 1. 

Input: g, f 
Output: d 
d := /; s := g 
WHILE s^ODO 

rem := remainder!/!, s) 
h := s 
s := rem 
END WHILE 

3.31 Prove the converse of Euclid's lemma. Let k be a field and let fix) e fc[x] be a polynomial 
of degree > 1 ; if, whenever fix) divides a product of two polynomials, it necessarily divides 
one of the factors, then fix) is irreducible. 

3.32 Let fix), gix) e f?[x], where R is a domain. If the leading coefficient of fix) is a unit in R, 
then the division algorithm gives a quotient qix) and a remainder rfx) after dividing gix) by 
fix). Prove that qix) and r(x) are uniquely determined by gix) and fix). 

Hint. Use Fraci/Z). 

3.33 Let k be a field, and let fix), gix) e L[x] be relatively prime. If /i(x) e /cfx], prove that 
fix) \ Hx) and gix) \ hfx) imply fix) gix) | hfx). 

Hint. See Exercise 1.19 on page 13. 

3.34 If k is a field, prove that y/\ — x 2 f kfx), where kfx) is the field of rational functions. 

Hint. Mimic a proof that \[2 is irrational. 

3.35 (i) In /? [x], where R is a field, let / = p j 1 ■ ■ • p e ™ and g — p j 1 ■ ■ • p^f , where the pf s 

are distinct monic irreducibles and e;, s; > 0 for all i (as with integers, the device 
of allowing zero exponents allows us to have the same irreducible factors in the two 
factorizations). Prove that / | g if and only if e; < Sj for all i. 

(ii) Use the (unique) factorization into irreducibles to give formulas for the gcd and 1cm of 
two polynomials analogous to the formulas in Proposition 1.17. 

3.36 If p is a prime, prove that there are exactly j (p 3 — p) monic irreducible cubic polynomials 
in I p [x]. (A formula for the number of monic irreducible polynomials of degree n in Ip[x] is 
given on page 194.) 

3.37 (i) Let fix) = (x — aj) • • • (x — a n ) e fc[x], where k is a field. Show that fix) has 

no repeated roots (that is, all the a, are distinct elements of k) if and only if the gcd 
if, f') = 1 , where f'(x) is the derivative of /. 

Hint. Use Exercise 3.24 on page 130. 

(ii) Prove that if p(.x) e Q[x] is an irreducible polynomial, then p(x) has no repeated roots 
in C. 



Sec. 3.5 Homomorphisms 


143 


Hint. Corollary 3.41. 

3.38 Let f = e 271 '!". 

(i) Prove that 

x n - 1 = (* - 1)(jc - ?)(* - f 2 ) • • ■ (x - f"- 1 ) 
and, if n is odd, that 

x n + 1 = (x + l)(x + ?)(* + f 2 ) • • ■ (-V + 

Hint. Use Corollary 3.29. 

(ii) For numbers a and b, prove that 

a n - b" ={a- b)(a - $b)(a - t; 2 b) •■■(«- 
and, if n is odd, that 

a" + b n = (a + b)(a + t;b)(a + f 2 fo) •••(« + 

Hint. Set x = a/b if b ^ 0. 


3.5 Homomorphisms 

Just as homomorphisms are used to compare groups, so are homomorphisms used to com¬ 
pare commutative rings. 

Definition. If A and R are (commutative) rings, a (ring) homomorphism is a function 
/: A —>■ R such that 

(i) /(I) = 1; 

(ii) f(a + a') — f(a) + f(a') for all a, a' e A\ 

(iii) f(aa') — f(a)f(a') for all a, a' e A. 

A homomorphism that is also a bijection is called an isomorphism. Commutative rings A 
and R are called isomorphic , denoted by A = R, if there is an isomorphism /: A —>• R. 

Example 3.45. 

(i) Let R be a domain and let F — Frac(/() denote its fraction field. In Theorem 3.13 
we said that R is a subring of F, but that is not the truth; R is not even a subset of F. 
We did find a subring R' of F, however, that has a very strong resemblance to R, namely, 
R' — {[a, 1] : a € R} c F. The function /: R -> R', given by f (a) = [a, 1], is easily 
seen to be an isomorphism. 

(ii) When an element in a commutative ring R was “identified” with a constant polynomial 
[in the proof of Lemma 3.16(iii)], that is, r was identified with (r, 0, 0,...), we implied 
that R is a subring of /?[x]. The subset R' — {(r, 0, 0, ...) : r e R} is a subring of /?[x], 
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and it is easy to see that the function /:/?—»■ R', defined by f(r) = (r. 0, 0,...), is an 
isomorphism. 

(iii) If S is a subring of a commutative ring R , then the inclusion i: S —> R is a ring homo¬ 
morphism because we have insisted that the identity 1 of R lies in S. [See Exercise 3.4(iii) 
on page 124.] ◄ 


Example 3.46. 

(i) Complex conjugation z = a + ib i —z = a — ib is an isomorphism C —*■ C because 
1 = 1 , z + w = z + w, and zw — zw 

(ii) Here is an example of a homomorphism of rings that is not an isomorphism. Choose 
m > 2 and define /: Z —* I m by /(n) = [n]. Notice that / is suijective (but not injective). 

(iii) The preceding example can be generalized. If R is a commutative ring with its “one” 
denoted by e, then the function / : Z —> R, defined by x(n) = ns, is a ring homomor¬ 
phism . 8 

(iv) Let I? be a commutative ring, and let a e R. Define the evaluation homomorphism 

e a : RM Rbye a (f(x )) = /(a); that is, if/(x) = £>,x ! , then f(a) = We 

let the reader check that e a is a ring homomorphism. < 

Certain properties of a ring homomorphism /: A —> R follow from its being a ho¬ 
momorphism between the additive groups A and R. For example, /(0) = 0, /(—a) = 
—/(a), and f(na) — nf(a ) for all n e Z. 

Lemma 3.47. If f : A —> R is a ring homomorphism, then, for all a e A, 

(i) f(a n ) = f (a)" for all n > 0 ; 

(ii) if a is a unit, then f(a) is a unit and /(a -1 ) = /(a) -1 ; in fact, if a is a unit, then 
f (a~ n ) — f (a)~ n for all n > 1; 

(iii) if f : A —> R is a ring homomorphism, then 

f (U(A)) < UiR), 

where U (A) is the group of units of A: if f is an isomorphism, then 

U (A) = U (R). 

Sketch of Proof (i) Induction on n > 0. 

(ii) If ab = 1, then 1 = f(ab) — f(a)f(b). The last statement follows by induction on 
n > 1. 

(iii) Immediate, from part (ii). • 


Recall that if a £ R and n is a positive integer, then na is the additive version of the multiplicative notation 
a"; that is, na is the sum of a with itself n times. 
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Proposition 3.48. If R and S are commutative rings and cp : R —>■ S is a ring homomor¬ 
phism, then there is a ring homomorphism (p* : 7? [jc] —> S[jc] given by 

<p *: ro + r\x + r 2 x 2 H-i-> (pirf) + <p(r\)x + (p{r 2 )x 2 H-. 

Sketch of Proof. It is clear that (p* is well-defined, and a routine calculation shows that it 
is a ring homomorphism. • 


Definition. If /: A -> R is a ring homomorphism, then its kernel is 

ker/ = {a e A with f(a) = 0}, 


and its image is 

im/= {r e R : r — f(a) for some a e R}. 

Notice that if we forget their multiplications, then the rings A and R are additive abelian 
groups and these definitions coincide with the group-theoretic ones. 

Let k be a commutative ring, let a e k, and, as in Example 3.46(iv), consider the 
evaluation homomorphism e a : k[. r] -> k sending fix) i-»- f(a). Now e a is always 
suijective, for if b e k, then b — e a {f ), where fix) = x — a + b. By definition, kere a 
consists of all those polynomials g(x) for which g(a) — 0 ; that is, kere„ consists of all the 
polynomials in k[v] having a as a root. 

The kernel of a group homomorphism is not merely a subgroup; it is a normal subgroup; 
that is, it is also closed under conjugation by any element in the ambient group. Similarly, 
if R is not the zero ring, the kernel of a ring homomorphism /: A -> R is almost a subring 
[ker / is not a subring because it never contains 1 : /( 1 ) = 1 f 0 ], and we shall see that it 
is closed under multiplication by any element in the ambient ring. 

Definition. An ideal in a commutative ring R is a subset I of R such that 

(i) 0 e /; 

(ii) if a, b e /, then a + b e /; 9 

(iii) if a e I and r e R, then ra e I . 

The ring R itself and the subset consisting of 0 alone, which we denote by {0}, are 
always ideals in a commutative ring R. An ideal / f R\s called a proper ideal. 

Example 3.49. 

If bi , b 2 , ..., b n lie in R, then the set of all linear combinations 

I — [r\b\ + r 2 bi H-h r n b n : r, e R for all i } 

9 In contrast to the definition of subring, it suffices to assume that a f bet instead of a — b e I. If I is an 
ideal and b e /, then (—1 )b e /, and so a — b = a + (—1 )b e I. 
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is an ideal in R. We write I = (h\. bi, ■ ■ ■ , b n ) in this case, and we call I the ideal 
generated by b\, Z? 2 , • • •, b n . In particular, if n = 1, then 

I — ( b) — {rb : r e R} 

is an ideal in R\ (b) consists of all the multiples of b , and it is called the principal ideal 
generated by b. Notice that R and {0} are always principal ideals: R = (1) and {0} = (0). 
In Z, the even integers form the principal ideal (2). ■* 

Proposition 3.50. If f : A —> R is a ring homomorphism, then ker f is an ideal in A 
and im / is a subring of R. Moreover, if A and R are not zero rings, then ker / is a proper 
ideal. 

Sketch of Proof ker / is an additive subgroup of A; moreover, if u e ker / and a e A, 
then f(au ) = f(a)f(u) — f(a) ■ 0 = 0. Hence, ker / is an ideal. If R is not the zero 
ring, then 1 f 0; hence, the identity 1 e A does not lie in ker /, because /Cl) = 1 f 0 in 
R, and so ker / is a proper ideal. It is routine to check that im / is a subring of R. • 

Example 3.51. 

(i) If an ideal I in a commutative ring R contains 1, then I — R, for now I contains r = r 1 
for every r e R. Indeed, if I contains a unit u, then I = R. for then I contains m _1 m = 1. 

(ii) It follows from (i) that if R is a field, then the only ideals I in R are {0} and R itself: if 
I f {0}, it contains some nonzero element, and every nonzero element in a field is a unit. 

Conversely, assume that R is a nonzero commutative ring whose only ideals are R itself 
and {0}. If a € R and a f 0, then the principal ideal (a) — R, for (a) f 0, and so 
1 e R = (a). There is thus r e R with 1 = ra\ that is, a has an inverse in R, and so R is a 
field. ◄ 

Proposition 3.52. A ring homomorphism f : A —> R is an injection if and only if ker / = 
{0}. 

Sketch of Proof This follows from the corresponding result for group homomorphisms, 
because / is a homomorphism from the additive group of A to the additive group of R. • 

Corollary 3.53. If f : k —> R is a ring homomorphism, where k is a field and R is not 
the zero ring, then f is an injection. 

Proof. The only proper ideal in k is {0}. • 

Theorem 3.54. Ifk is a field, then every ideal I in k\x\ is a principal ideal. Moreover, if 
I zfi {0}, there is a monic polynomial that generates I. 

Sketch of Proof. If k is a field, then k[xj is an example of a euclidean ring. In Theo¬ 
rem 3.60, we will prove that every ideal in a euclidean ring is a principal ideal. • 
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Definition. A domain R is a principal ideal domain if every ideal in R is a principal 
ideal. This name is often abbreviated to PID. 

Example 3.55. 

(i) The ring of integers is a PID. 

(ii) Every field is a PID, by Example 3.5 l(ii). 

(iii) If A; is a field, then the polynomial ring Ac[x] is a PID, by Theorem 3.54. 

(iv) There are rings other than Z and k[x~\, where A: is a field, that have a division algorithm; 
they are called euclidean rings , and they, too, are PIDs. We shall consider them in the next 
section. < 

It is not true that ideals in arbitrary commutative rings are always principal ideals. 

Example 3.56. 

Let R — Z[x], the commutative ring of all polynomials over Z. It is easy to see that the set 
I of all polynomials with even constant term is an ideal in Z[x], We show that / is not a 
principal ideal. 

Suppose there is d(x) e Z[x ] with I — ( d{x )). The constant 2 e /, so that there 
is f(x) e Z[x] with 2 = d(x)f(x). Since the degree of a product is the sum of the 
degrees of the factors, 0 = deg(2) = deg ft/) + deg(/). Since degrees are nonnegative, it 
follows that deg(t/) = 0 [i.e., d(x ) is a nonzero constant]. As constants here are integers, 
the candidates for d(x) are ±1 and ±2. Suppose d(x) — ±2; since x e /, there is 
g(x) e Z[x] with x = d(x)g(x) = ±2 g(x). But every coefficient on the right side is even, 
while the coefficient of x on the left side is 1. This contradiction gives d(x) — ±1. By 
Example 3.5l(ii), I = Z[x], another contradiction. Therefore, no such d(x) exists, that is, 
the ideal I is not a principal ideal. ■* 

Certain theorems holding in Z carry over to PIDs once the standard definitions are 
generalized; the notion of divisor has already been generalized. 

Definition. An element S in a commutative ring R is a greatest common divisor , gcd, of 
elements a. ft e R if 

(i) S is a common divisor of a and /l; 

(ii) if y is any common divisor of a and ft, then y \ S. 


Greatest common divisors, when they exist, need not be unique; for example, it is easy 
to see that if c is a greatest common divisor of / and g. then so is uc for any unit u e R. 
In the special case R = Z, we force uniqueness of the gcd by requiring it to be positive; if 
R — k[x], where A: is a field, then we force uniqueness of the gcd by further requiring it to 
be monic. 
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Remark. Let R be a PID and let n,a e R with n irreducible. A gcd <5 of tc and a is, in 
particular, a divisor of n . Hence, n — Se, and irreducibility of n forces either <5 or e to be 
a unit. Now a — 8ft. If S is not a unit, then e is a unit, and so 

a = 8/3 = ne~ l ft; 

that is, t r | a. We conclude that if n \ a, then 8 is a unit; that is, 1 is a gcd of it and a. ◄ 

For an example of a domain in which a pair of elements does not have a gcd, see 
Exercise 3.60 on page 158. 

Theorem 3.57. Let R be a PID. 

(i) Every a, ft e R has a gcd, 8, which is a linear combination of a and f>: 

8 = aa + rft, 

where a, x e R. 

(ii) If an irreducible element n e R divides a product a/3, then either n \ a or n \ ft. 

Proof (i) We may assume that at least one of a and /3 is not zero (otherwise, the gcd is 0 
and the result is obvious). Consider the set I of all the linear combinations: 

I = {era + t/ 3 : a, r in R}. 

Now a and j3 are in I (take a — 1 and r = 0 or vice versa). It is easy to check that I is an 
ideal in R , and so there is 8 e I with I — ( 8 ), because R is a PID; we claim that 8 is a gcd 
of a and ft. 

Since a e I — (S), we have a — p8 for some p e R; that is, 8 is a divisor of a; 
similarly, 8 is a divisor of ft, and so 8 is a common divisor of a and ft. 

Since 8 e I, it is a linear combination of a and ft: There are a. r e R with 

8 = aa + tft. 

Finally, if y is any common divisor of a and ft, then a = ya' and ft = yft', so that y 
divides 8, for 8 — a a + rft — y(aa' + rft'). We conclude that 8 is a gcd. 

(ii) If jt Icr, we are done. If n | a, then the remark says that 1 is a gcd of it and a. There 
are thus elements a, r e R with 1 = an + ta, and so 

ft = an ft + Taft. 

Since n \ aft, it follows that n \ ft, as desired. • 


Example 3.58. 

If I and J are ideals in a commutative ring R, we now show that I (T J is also an ideal in R. 
Since 0 el and 0 e /, we have 0 e I fl J. If a, b e I IT /, then a — b e I and a — b e J, 
for each is an ideal, and so a — b e IHJ. If a e I DJ and r e R, then ra e I and ra e J, 
hence ra e I fl J. Therefore, I IT / is an ideal. With minor alterations, this argument also 
proves that the intersection of any family of ideals in R is also an ideal in R. a 
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Definition. If / and g are elements in a commutative ring R, then a common multiple 
is an element m e R with / | m and g \ m. If / and g in R are not both 0, define their 
least common multiple , abbreviated 1cm, to be a common multiple c of them with c \ m 
for every common multiple m. If / = 0 = g, define their 1cm = 0. The 1cm of / and g is 
often denoted by [f,g]. 

Least common multiples, when they exist, need not be unique; for example, it is easy 
to see that if c is a least common multiple of / and g, then so is uc for any unit u € R. In 
the special case R — Z, we force uniqueness of the 1cm by requiring it to be positive; if 
R — k[x], where A' is a field, then we force uniqueness of the 1cm by further requiring it to 
be monic. 

Exercises 

3.39 (i) Let (p: A —> R be an isomorphism, and let i/r: R —> A be its inverse. Show that f is 

an isomorphism. 

(ii) Show that the composite of two homomorphisms (isomorphisms) is again a homomor¬ 
phism (isomorphism). 

(iii) Show that A = R defines an equivalence relation on the class of all commutative rings. 

3.40 Let R be a commutative ring and let ^(R) be the commutative ring of all functions /: R —»■ R 
with pointwise operations. 

(i) Show that R is isomorphic to the subring of tFlR) consisting of all the constant func¬ 
tions. 

(ii) If fix) e A*|a], let (pf : R -*■ R be defined by r fir) [thus, <pj is the polynomial 
function associated to fix)]. Show that the function <p: /?[*] -*■ J-(R), defined by 
(pi fix)) = (pf, is a ring homomorphism. 

(iii) Show that ip is injective if R is an infinite field. 

3.41 Let I and J be nonzero ideals in a commutative ring R. If R is a domain, prove that 1 R\ J f 
10 ). 

3.42 Let R be a commutative ring. Show that the function e: f?[x] -*■ R, defined by 

S’. Cl2X + • • • + OfiX I—A flQ, 

is a homomorphism. Describe ker e in terms of roots of polynomials. 

3.43 If R is a commutative ring and c e R, prove that the function 1 p: /?[jc] —»■ /?[*], defined by 
f{x) f(x + c), is an isomorphism. In more detail, (pCf^j s,x l ) = J]/ s ii x + C Y ■ 

Hint. This is a routine but long calculation. 

3.44 (i) Prove that F , the field with four elements (see Exercise 3.14 on page 125), and I 4 are 

not isomorphic commutative rings. 

(ii) Prove that any two fields having exactly four elements are isomorphic. 

Hint. First prove that 1 + 1 = 0, and then show that the nonzero elements form a cyclic 
group of order 3 under multiplication. 

3.45 (i) Show that every element a e I p has a pth root (i.e., there is b e l p with a — b p ). 

(ii) Let k be a field that contains l p as a subfield [e.g., k = Ip(.v)]. For every positive integer 
n, show that the function <p n : k -*■ k, given by (p{a) = a p , is a ring homomorphism. 
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3.46 If R is a field, show that R = Frac(R). More precisely, show that the homomorphism 
/:!?—>■ Fractal in Example 3.45(i), namely, r [r, 1], is an isomorphism. 

3.47 (i) If A and R are domains and <p: A —»■ R is a ring isomorphism, prove that 

[a, b ] [i p{a), (p(b)] 

is a ring isomorphism Frac(A) —> Frac(R). 

(ii) Prove that if a field k contains an isomorphic copy of Z as a subring, then k must contain 
an isomorphic copy of Q. 

(iii) Let R be a domain and let < p: R —*■ k be an injective ring homomorphism, where k is a 

field. Prove that there exists a unique ring homomorphism <t>: Frac (R) -*■ k extending 
i p; that is, <t>| = <p. 

3.48 Let R be a domain with fraction field F — Frac(R). 

(i) Prove that Frac(/?[x]) = Fix). 

(ii) Prove that Frac(R[xj, xi, ■ ■ ■, x n ]) = F(x j, X 2 , ..., x n ) (see page 129). 

3.49 (i) If R and 5 are commutative rings, show that their direct product R x S is also a com¬ 

mutative ring, where addition and multiplication in R x 5 are defined “coordinatewise”: 

(r, s) + (/, s') = (r + r', s + s') and (r, s)(r f , s') — {rr , ss'). 

(ii) Show that if m and n are relatively prime, then I mn = I m x I„ as rings. 

Hint. See Theorem 2.81. 

(iii) Show that if neither R nor S is the zero ring, then R x 5 is not a domain. 

(iv) Show that R x (0) is an ideal in R x 5. 

(v) Show that R x (0) is a ring isomorphic to R, but it is not a subring of R x 5. 

3.50 (i) If R and S are nonzero commutative rings, prove that 

U(R xS) = U(R) x U(S), 
where U(R) is the group of units of R. 

Hint. Show that (r, s) is a unit in R x S if and only if r is a unit in R and s is a unit in 
5. 

(ii) Redo Exercise 2.65 on page 94 using part (i). 

(iii) Use part (i) to give another proof of Corollary 2.83. 

3.51 Let F be the set of all 2 x 2 real matrices of the form 

a b 

A = , 

—b a 

Prove that F is a field (with operations matrix addition and matrix multiplication), and prove 
that there is an isomorphism tp : F —> C with det(A) = ip(A)<p(A). 

Hint. Define <p: F C by <p(A) = a + ib. 

3.52 If k is a field and [/, g] denotes the 1cm of monic polynomials fix), g(x) e k[x ], show that 

[/.*](/.*) = /*• 

Hint. See Exercise 1.26 on page 13. By definition, lcm’s are monic. 
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3.53 If R is a PID and a.b e R, prove that their 1cm exists. 

3.54 (i) If k is a field, prove that the ring of formal power series k[[x]] is a PID. 

Hint. If I is a nonzero ideal, choose r e / of smallest order. Use Exercise 3.27 on 
page 130 to prove that I = (r). 

(ii) Prove that every nonzero ideal in k[[x]] is equal to (x") for some n > 0. 

3.55 If k is a field, show that the ideal (x, y) in k[x, y] is not a principal ideal (see page 129). 

3.56 For every m > 1, prove that every ideal in I m is a principal ideal. (If m is composite, then I m 
is not a PID because it is not a domain.) 

3.6 Euclidean Rings 

There are rings other than Z and k[x], where A: is a field, that have a division algorithm. In 
particular, we present an example of such a ring in which the quotient and remainder are 
not unique. We begin by generalizing a property shared by both Z and k[x~\. 

Definition. A euclidean ring is a domain R that is equipped with a function 

9 : R - {0} -» N, 


called a degree function, such that 

(i) 3(/) < Hfg) for all /, geR with /, g # 0; 

(ii) for all /, g e R with f f 0, there exist q, r e R with 

g = qf + r, 

where either r — 0 or 9(r) < 9(/). 

Note that if R has a degree function 9 that is identically 0, then condition (ii) forces 
r — 0 always; taking g — 1 shows that R is a field in this case. 

Example 3.59. 

(i) The integers Z is a euclidean ring with degree function 9(m) = \m\. In Z, we have 

9(mn) = \mn\ — |m||n| = 9(m)9(n). 

(ii) When k is a field, the domain A[x] is a euclidean ring with degree function the usual 
degree of a nonzero polynomial. In k[x], we have 

d(fg) = deg (fg) 

= deg (/) + deg(g) 

= 3 (/) + 9(g). 
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Since 3 (inn) = 3(m)3(n) in Z, the behavior of the degree of aproduct is not determined 
by the axioms in the definition of a degree function. If a degree function 3 is multiplicative, 
that is, if 

3 ifg) = d(f)dig), 

then 3 is called a norm. 

(iii) The Gaussian 10 integers Z[i] form a euclidean ring whose degree function 

3 (a + bi) — a 2 + b 2 

is a norm. One reason for showing that Z[i] is a euclidean ring is that it is then a PID, and 
hence it has unique factorization of its elements into products of irreducibles; Gauss used 
this fact in his proof that if an odd prime p is sum of two squares, say p — a 2 + b 2 , where 
a and b are natural numbers, then the pair a, b is unique (see Theorem 3.66). 

To see that 3 is a multiplicative degree function, note first that if a = a + bi. then 

d(a) = ctff, 

where a — a — bi is the complex conjugate of a. It follows that 3 (a/3) — d(a)d(/B) for all 
a, p e Z[/], because 


3 (aft) — 0//30//3 — aPaP = aaPP — d(a)d(P)\ 

indeed, this is even true for all a, p e Q[i] = {x + yi : x, y e Q}, by Corollary 1.31. 

We now show that 3 satisfies the first property of a degree function. If p — c+id e Z[i] 
and /3 ^ 0 , then 

1 < 3(/3), 

for d(/3) — c 2 + d 2 is a positive integer; it follows that if a, ft e Z[i] and /I/O, then 

3(a) < 3(a)3(/3) = 3(a/0). 

Let us show that 3 also satisfies the second desired property. Given a, /J e Z[i] with 
P ^ 0, regard a/ft as an element of C. Rationalizing the denominator gives a/ft = 
afi/PP — aP/d(P), so that 

a/p = x + yi, 

where x, y e Q. Write x — a + u and y = b + v, where a, b e Z are integers closest 
to x and y, respectively; thus, \u\, |v| < (If x or y has the form m + \, where m is an 
integer, then there is a choice of nearest integer: x = m + \ or x — (m + 1) — a similar 
choice arises if x or y has the form m — i,.) It follows that 

a = P(a + bi) + P(u + vi). 

10 The Gaussian integers are so called because Gauss tacitly used Z[i] and its norm 9 to investigate biquadratic 
residues. 
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Notice that f3(u + vi) e Z[i], for it is equal to a — f(a + bi). Finally, we have 

3 (/3(m + vif) = 3 (yS) 9 (w + vi), 

and so 3 will be a degree function if 3 (u + vi) < 1. And this is so, for the inequalities 
\u\ < j and |v| < 4 give u < ^ and v < and hence 3 (m + vi) — u + v < | + | = 
^ < 1. Therefore, 3( y 6(11 + vi)) < 3 {f)), and so Z[i] is a euclidean ring whose degree 
function is a norm. 

We now show that quotients and remainders may not be unique (because of the choices 
noted previously). For example, let a = 3 + 5/ and f = 2. Then a/f J > = % + f;; the 
choices are 

a — 1 and u = \ or a — 2 and u = — ^; 

b — 2 and v — \ or b = 3 and v = — ^. 

There are four quotients and remainders after dividing 3 + 5/ by 2 in Z[i], for each of the 
remainders (e.g., 1 + i) has degree 2 < 4 = 3(2): 

3 + 5; =2(1 + 2i) + (l + t); 

= 2(1 + 3z) + (1 - 0; 

= 2(2+ 20+ (-i + 0; 

= 2(2+ 30+ (-1-0- * 


Theorem 3.60. Every euclidean ring R is a PID. 

Proof. Let I be an ideal in R. If I = {0}, then I = (0) is principal; therefore, we may 
assume that I f (0). By the least integer axiom, the set of all degrees of nonzero elements 
in I has a smallest element, say, n; choose del with d(d) = n. Clearly, id) C /, and so 
it suffices to prove the reverse inclusion. If a e /, then there are q , r e R with a = qd + r, 
where either r = 0 or 3(r) < did). But r — a — qd e I , and so d having least degree 
implies that r — 0. Hence, a — qd e ( d ), and I — (d). • 

Corollary 3.61. The ring of Gaussian integers Z[i ] is a principal ideal domain. 

The converse of Theorem 3.60 is false: There are PIDs that are not euclidean rings, as 
we see in the next example. 

Example 3.62. 

It is shown in algebraic number theory that the ring 

R — [a + ba : a, b e Z], 

where a — j(l + V—19), is a PID | R is the ring of algebraic integers in the quadratic 
number field Q(V—19)]. In 1949, T. S. Motzkin showed that R is not a euclidean ring by 
showing that it does not have a certain property of euclidean rings that does not mention 
its degree function. 
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Definition. An element u in a domain R is a universal side divisor if u is not a unit and, 
for every x e R, either u \ x or there is a unit z € R with u \ (x + z). 


Proposition 3.63. If R is a euclidean ring but not afield, then R has a universal side 
divisor. 

Proof. Define 

S = {d(v) : t / 0 and v is not a unit}, 

where 3 is the degree function on R. Since R is not a field, by hypothesis, S is a nonempty 
subset of the natural numbers. By the least integer axiom, S has a smallest element, say, 
3 (u). We claim that u is a universal side divisor. If x e R, there are elements q and r with 
x — qu + r, where either r — 0 or 3(r) < 3(n). If r — 0, then u \ x; if r 0, then r must 
be a unit, otherwise its existence contradicts 3 (u) being the smallest number in S. We have 
shown that u is a universal side divisor. • 

Motzkin then showed that the ring {a + ba : a, b e Z}, where a — j(l + V - 19), 
has no universal side divisors, concluding that this PID is not a euclidean ring. For details, 
we refer the reader to K. S. Williams, “Note on Non-euclidean Principal Ideal Domains,” 
Math. Mag. 48 (1975), 176-177. a 

What are the units in the Gaussian integers? 

Proposition 3.64. 

(i) Let R be a euclidean ring R that is not a field. If the degree function 3 is a norm, 
then a is a unit if and only ifd(a) — 1. 

(ii) Let R be a euclidean ring R that is not a field. If the degree function 3 is a norm and 
ifd(a) — p, where p is a prime number, then a is irreducible. 

(iii) The only units in the ring Z[i] of Gaussian integers are ±1 and ±/. 

Proof, (i) Since l 2 = 1, we have 3(1 ) 2 = 3(1), so that 3(1) = 0 or 3(1) = 1. If 
3 ( 1 ) = 0, then 3(a) = 3(la) = 3(l)3(a) = 0 for all a e R. But R is not a field, and so 3 
is not identically zero. We conclude that 3(1) = 1. 

If a e R is a unit, then there is f> e R with a (1 = 1. Therefore, d(a)d(f) = 1. Since 
the values of 3 are nonnegative integers, 3 (a) — 1 . 

For the converse, we begin by showing that there is no element f e R with <Hf ) — 0. 
If such an element existed, the division algorithm would give 1 — qf + r, where q, r e R 
and either r — 0 or 3 (r) < 3 (fi) = 0. The inequality cannot occur, and so r = 0; that is, 
f is a unit. But if is a unit, then d(fi) — 1, as we have just proved, and this contradicts 
d(f) — 0 . 

Assume now that 3(a) = 1. The division algorithm gives q, r e R with 

a = qa 2 + r, 
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where r = 0 or 3 (r) < 3(or). As 3(a 2 ) — 3(a ) 2 = 1, either r — 0 or 3 (r) = 0. But 
we have just seen that 3 (r) =0 cannot occur, so that r = 0 and a — qa 2 . It follows that 
1 = qa , and so a is a unit. 

(ii) If, on the contrary, a — fiy, where neither f nor y is a unit, then p — 3(a) = 
d(/3)d(y). As p is a prime, either d(fi) = 1 or 3 (y) — 1. By part (i), either f or y is a 
unit; that is, a is irreducible. 

(iii) If a — a + bi e Z[i] is a unit, then 1 = 3(a) — a 2 + b 2 . This can happen if and only 

if a 2 = 1 and b 2 = 0 or a 2 = 0 and b 2 = 1 ; that is, a — ±1 or a = ±i. • 

If n is an odd number, then either n = 1 mod 4 or n = 3 mod 4; consequently, the odd 
prime numbers are divided into two classes. For example, 5, 13, 17 are congruent to 1 mod 
4, while 3,7, 11 are congruent to 3 mod 4. 

Lemma 3.65. If p is a prime and p = 1 mod 4, then there is an integer m with 

9 

m = — 1 mod p. 

Proof. If G — (I / ,) x is the multiplicative group of nonzero elements in E ;) , then |G| = 
p — 1=0 mod 4; that is, 4 is a divisor of |G|. By Proposition 2.78, G contains a subgroup 
S of order 4. By Exercise 2.36 on page 72, either S is cyclic ora 2 = 1 for all a e S. 
Since Ip is a field, however, it cannot contain four roots of the quadratic x 2 — 1. Therefore, 
S is cyclic , 11 say, S = ([/«]), where [m\ is the congruence class of m mod p. Since [m] 
has order 4, we have [m 4 ] = [1], Moreover, [m 2 ] [1] (lest [m] have order < 2 < 4), 

and so [m 2 ] = [—1], for [—1] is the unique element in S of order 2. Therefore, mr = 
— 1 mod p. • 


Theorem 3.66 (Fermat’s 12 Two-Squares Theorem). An odd prime p is a sum of two 

squares, 

p — a 2 + b 2 , 

where a and b are integers, if and only if p = 1 mod 4. 

Proof. Assume that p — a 2 + b 2 . Since p is odd, a and b have different parity; say, a is 
even and b is odd. Hence, a = 2m and b = 2n + 1, and 

p = a 2 + b 2 — 4m 2 + 4n 2 + 4n + 1 = 1 mod 4. 

Conversely, assume that p = 1 mod 4. By the lemma, there is an integer m such that 

P I (m 2 + 1). 

^Theorem 3.30 says that G is a cyclic group, which implies that S is cyclic, for every subgroup of a cyclic 
group is itself cyclic. We choose to avoid this theorem here, for the proof just given is more elementary. 

12 Fermat was the first to state this theorem, but the first published proof is due to Euler. Gauss proved that 
there is only one pair of natural numbers a and b with p = + Z? 2 . 
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In Z[i], there is a factorization m 2 + 1 = (m + i)(m — i), and so 

p | (m + i)(m — i) in Z[i], 

If p | (m ± i) in Z[i], then there are integers u and v with m ± i — p(u + iv). Comparing 
the imaginary parts gives pv — 1, a contradiction. We conclude that p does not satisfy 
the analog of Euclid’s lemma in Theorem 3.57 (recall that Z[i\ is a PID); it follows from 
Exercise 3.62 on page 158 that p is not an irreducible element in Z[i], Hence, there is a 
factorization 

p = a/3 in Z[i] 

in which neither a — a + ib nor f — c + id is a unit. Therefore, taking norms gives an 
equation in Z: 


P 2 = 3 (p) 

= 9 (a/3) 

= 3(a)3(/J) 

= (a 2 + b 2 )(c 2 + d 2 ). 

By Proposition 3.64, the only units in Z[i] are ±1 and ±i, so that any nonzero Gaussian 
integer that is not a unit has norm > 1; therefore, a 2 + b 2 1 and c 2 + d 2 f 1. Euclid’s 
lemma now gives p | (a 2 + b 2 ) or p | (c 2 + d 2 )\ the fundamental theorem of arithmetic 
gives p = a 2 + b 2 (and p = c 2 + d 2 ), as desired. • 

We are going to determine all the irreducible elements in Z[i\, but we first prove a 
lemma. 

Lemma 3.67. If a e Z[i] is irreducible, then there is a unique prime number p with 
a | p in Z[i], 

Proof. Note that if a e Z[i], then a e Z[i]; since 3(a) = aa, we have a | 3(a). Now 
3(a) = pi ■ ■ ■ p n , where the p, are prime numbers. As Z[i] is a PID, Exercise 3.62 on 
page 158 gives a | p, for some i (for a is irreducible). If a | q for some prime q f pj, 
then a | (q. p ,) = 1, forcing a to be a unit. This contradiction shows that p t is the unique 
prime number divisible by a. • 


Proposition 3.68. Let a = a + bi e Z[i] be neither 0 nor a unit. Then a is irreducible if 
and only if 

(i) a is an associate of a prime p in Z of the form p — 4 m + 3; or 

(ii) a is an associate of 1 + i or its conjugate 1 — i; or 

(iii) 3(a) = a 2 + b 2 is a prime in Z of the form 4m + 1. 
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Proof. By Lemma 3.67, there is a unique prime number p divisible by a in Z[i], Since 
a | p, we have 3(a) | 3 (p) — p 2 in Z, so that 3(a) — p or 3(a) = p 2 \ that is, 

a 2 + b 2 — p or a 2 + b 2 — p 2 , 

Looking at p mod 4, we see that there are three possibilities (for p = 0 mod 4 cannot 
occur). 

(i) p = 3 mod 4. 

In this case, a 2 + b 2 — p cannot occur, by (the easy direction of) Theorem 3.66, so 
that 3(a) — a 2 + b 2 — p 2 . Now p is divisible by a, so there is ft with aft — p. Hence, 
3(a)3 (f) — 3 (p). Since p e Z, we have 3 (p) — p 2 , so that p 2 d(f) = p 2 . Thus, 
3 if) — 1, f is a unit, by Proposition 3.64(i), and p is irreducible in Z [(]. 

(ii) p = 2 mod 4. 

In this case, p — 2, and so a 2 + b 2 = 2 or a 2 + b 2 — 4. The latter case cannot occur 
(because a and b are integers), and the first case gives a = 1 ± i (up to multiplication 
by units). The reader should check that both 1 + i and 1 — i are, indeed, irreducible 
elements. 

(iii) p = 1 mod 4. 

If 3(a) is a prime p (with p = 1 mod 4), then a is irreducible, by Proposition 3.64(h). 
Conversely, suppose a is irreducible. As 3(a) = p or 3(a) = p 2 , it suffices to eliminate 
the latter possibility. Since a | p, we have p = af for some f e Z[/]; hence, as in 
case (i), 3(a) = p 2 implies that f is a unit. Now aa = p 2 — {af) 2 , so that a = af 2 . But 
f 2 — ±1, by Proposition 3.64(iii), contradicting a f ±a. Therefore, 3(a) = p. • 

For example, 3 is an irreducible element of the first type, and 2 + i is an irreducible 
element of the third type. We should remember that there are interesting connections be¬ 
tween prime numbers and irreducible Gaussian integers, that knowing the Gaussian units 
is valuable, and that the norm is a useful tool in proving results. The ring of Gaussian 
integers is an instance of a ring of algebraic integers, and these comments remain true for 
these rings as well. 

Exercises 


Definition. Let k be a field. A common divisor of ci\ (x), a 2 (x), .... a n (x) in k\x ] is a polynomial 
e(x) e k\x) with c(x) | ctj (x) for all i ; the greatest common divisor is the monic common divisor of 
largest degree. We write c{x) = {a\, a 2 , ■ ■ ■, a n ). 

3.57 Let k be a field, and let polynomials a\ (x), < 22 (x), • • •, a n (x) in k[x) be given. 

(i) Show that the greatest common divisor d(x) of these polynomials has the form 

tj {x)ai (x), where t\ (x) e k[x ] for 1 < i < n. 

Hint. Example 3.49. 

(ii) Prove that c(x) | d{x) for every monic common divisor c(x) of the a, (x). 
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3.58 (i) Show that x, y e k[x, y] are relatively prime, but that 1 is not a linear combination of 

them [i.e., there do not exist s(x, y), t(x, y) e k[x, y] with 1 = ,y.s(jc, y) + yt(x, y)]. 
Hint. Use a degree argument. 

(ii) Show that 2 and x are relatively prime in Z[x], but that 1 is not a linear combination of 
them; that is, there do not exist sfx), t(x) e Z[y] with 1 = 2s{x) + xt(x). 

3.59 A student claims that x — 1 is not irreducible because x — 1 = ( *Jx + 1 ){*/x — 1) is a 

factorization. Explain the error of his ways. 

Hint. Show that + 1 is not a polynomial. 

3.60 Prove that there are domains R containing a pair of elements having no gcd. (See the definition 
on page 147.) 

Hint. Let k be a field and let R be the subring of k[x ] consisting of all polynomials having 
no linear term; that is, f{x) e R if and only if 

f(x) = s 0 + s 2 x 2 + s 3 x 3 4-. 

Show that x 5 and y 6 have no gcd in R. 

3.61 Prove that R = Z[->/2] = (a + b-j2 : a, b e Z) is a euclidean ring with 3(a + b-J 2) = 
\a 1 -2b 1 \. 

3.62 If R is a euclidean ring and n e R is irreducible, prove that n \ a/3 implies n \ a or n \ fi. 

3.63 Let 3 be the degree function of a euclidean ring R. If m, n e N and m > 1, prove that d' is 
also a degree function on R, where 

d'(x) = md(x) + n 

for all x e R. Conclude that a euclidean ring may have no elements of degree 0 or degree 1. 

3.64 Let R be a euclidean ring with degree function 3. 

(i) Prove that 3(1) < 3(a) for all nonzero a e R. 

(ii) Prove that a nonzero u € R is a unit if and only if 3(a) = 3(1). 

Hint. A proof can be generalized from the special case of polynomials. 

3.65 Let R be a euclidean ring, and assume that b e R is neither zero nor a unit. Prove, for every 
i > 0, that 3 (b' ) < 3 (b i+1 ). 

Hint. There are q,r e R with b' — qb i+i + r. 

3.66 If p is a prime and p = 3 mod 4, prove that one of the congruences a 2 = 2 mod p or 
a 2 = —2 mod p is solvable. 

Hint. Show that Ip = (—1) x H, where H is a group of odd order m, say, and observe that 
either 2 or —2 lies in H because 

h xlm = ({1} X H) U ({—1} X H). 

Finally, use Exercise 2.54 on page 81. 


3.7 Linear Algebra 

We interrupt the exposition to discuss some linear algebra, for it is a necessary tool in 
further investigation of commutative rings. 
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Vector Spaces 

Linear algebra is the study of vector spaces and their homomorphisms, with applications 
to systems of linear equations. From now on, we are going to assume that most readers 
have had some course involving matrices, perhaps only with real entries or with complex 
entries. Such courses often deal mainly with computational aspects of the subject, such as 
Gaussian elimination, and finding inverses, determinants, eigenvalues, and characteristic 
polynomials of matrices, but here we do not emphasize this important aspect of linear 
algebra. Instead, we discuss more theoretical properties of vector spaces (with scalars in 
any field) and linear transformations (which are homomorphisms between vector spaces). 

Dimension is a rather subtle idea. We think of a curve in the plane, that is, the image 
of a continuous function /: R —>• R 2 , as a one-dimensional subset of a two-dimensional 
ambient space. Imagine the confusion at the end of the nineteenth century when a “space¬ 
filling curve’’ was discovered: There exists a continuous function /: R -» R 2 with image 
the whole plane! We are going to describe a way of defining dimension that works for 
analogs of euclidean space, called vector spaces (there are topological ways of defining 
dimension of more general spaces). 

Definition. If k is a field, then a vector space over k is an (additive) abelian group V 
equipped with a scalar multiplication ; that is, there is a function k x V —> V , denoted by 
(a, v) h* av, such that, for all a.b, 1 e k and all u, v e V, 

(i) a(u + v) — au + av; 

(ii) (a + b)v = av + bv, 

(iii) ( ab)v = a{bv)\ 

(iv) lv = v. 

The elements of V are called vectors and the elements of k are called scalars} 3 

Example 3.69. 

(i) Euclidean space V — R" is a vector space over R. Vectors are u-tuples (ai ,,.., a n ), 
where a,- e R for all;. Picture a vector v as an arrow from the origin to the point having 
coordinates (ai,..., a n ). Addition is given by 

(at, , a„) + (b i,..., b„) — (at + b\, - a„ + b n )\ 

geometrically, the sum of two vectors is described by the parallelogram law. 

Scalar multiplication is given by 

av — a(a i,..., a n ) — (aa i,..., aa n ). 

1 'The word vector comes from the Latin word meaning “to carry”; vectors in euclidean space carry the data of 
length and direction. The word scalar comes from regarding v i—» av as a change of scale. The terms scale and 
scalar come from the Latin word meaning “ladder,” for the rungs of a ladder are evenly spaced. 
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Scalar multiplication u h> at “stretches” v by a factor |a|, reversing its direction when a 
is negative (we put quotes around stretches because an is shorter than v when W < l). 

(ii) The example in part (i) can be generalized. If k is any field, define V = k", the set of 
all -tuples v — (a i,..., a„ ), where a,- e k for all i. Addition is given by 

(at,- a n ) + ib i,- b n ) = (cq + b \,..., a n + b n ), 

and scalar multiplication is given by 

av = a(a i,..., a n ) = (aa i,..., aa n ). 

(iii) If R is a commutative ring and k is a subring that is a field, then R is a vector space 
over k. Regard the elements of R as vectors and the elements of k as scalars; define scalar 
multiplication av , where a e k and v e R, to be the given product of two elements in R. 
Notice that the axioms in the definition of vector space are just particular cases of some of 
the axioms holding in the commutative ring R. 

For example, if k is a field, then the polynomial ring R — k[x] is a vector space over k. 
Vectors are polynomials fix), scalars are elements a e k, and scalar multiplication gives 
the polynomial af(x); that is, if 

fix) = b„x n H-1- b\x + b 0 , 

then 

afix) = ab n x n + ■ ■ ■ + ab\x + abo- 

In particular, if a field k is a subfield of a larger field E, then £ is a vector space 
over k. a 

A subspace of a vector space V is a subset of V that is a vector space under the addition 
and scalar multiplication in V. 

Definition. If V is a vector space over a field k, then a subspace of V is a subset U of V 
such that 

(i) 0 eU\ 

(ii) u,u' e U imply u + u' e U ; 

(iii) u e U and a e k imply au e U. 


Example 3.70. 

(i) The extreme cases U — V and U — {0} (where {0} denotes the subset consisting of 
the zero vector alone) are always subspaces of a vector space. A subspace U c V with 
U V is called a proper subspace of V; we may write U C V to denote U being a proper 
subspace of V. 
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(ii) If v = (ai,..., a n ) is a nonzero vector in R", then the line through the origin 

l = {an : a e R} 

is a subspace of R". 

Similarly, a plane through the origin consists of all vectors of the form avi +bv 2 , where 
v\, ih is a fixed pair of noncollinear vectors, and a, b vary over R. It is easy to check that 
planes through the origin are subspaces of R". 

(iii) If m < n and R'” is regarded as the set of all those vectors in R" whose last n — m 
coordinates are 0, then R'" is a subspace of R". For example, we may regard the plane R 2 
as all points (x, y, 0) in R 3 . 

(iv) If k is a field, then a homogeneous linear system over k of m equations in 11 unknowns 
is a set of equations 

<211*1 H-h CllnXn = 0 

<221*1 H-h <22«*n = 0 

< 2 mi 1 *i d - * * * T a mn x n — 0, 

where aji e k. A solution of this system is a vector (c\,... ,c n ) e k n , where JT ajici = 0 
for all j ; a solution (ci,..., c„) is nontrivial if some c,- ^ 0. The set of all solutions forms 
a subspace of k’\ called the solution space (or nullspace) of the system. 

In particular, we can solve systems of linear equations over I p , where p is a prime. This 
says that we can treat a system of congruences mod p just as one treats an ordinary system 
of equations. 

For example, the system of congruences 

3* — 2y + z = 1 mod 7 
x + y — 2z = 0 mod 7 
—* + 2y + z = 4 mod 7 

can be regarded as a system of equations over the field I 7 . This system can be solved 
just as in high school, for inverses mod 7 are now known: [2][4] = [1]; [3][5] = [1]; 
[6] [6] = [1], The solution is 

(*, y, z) — ([5], [4], [1]). « 


Definition. A list in a vector space V is an ordered set u 1 ...., v n of vectors in V. 

More precisely, we are saying that there is some n > 1 and some function 

(p: {1,2 V, 

with (p(i ) = Vj for all i. Thus, X = im <p\ note that X is ordered in the sense that there is 
a first vector iq, a second vector 1 ) 2 , and so forth. A vector may appear several times on a 
list; that is, (p need not be injective. 
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Definition. Let V be a vector space over a field k. A k-linecir combination of a list 
v\,..., v n in V is a vector v of the form 


v — a n>i H-h a n v n , 

where a,- e k for all i. 

Definition. If X = v \,..* v m is a list in a vector space V, then 

(ifi ? • ■ • . t } m ) > 

the set of all the ^-linear combinations of v\,..., v m , is called the subspace spanned by 
X. We also say that v\, ..., v„ spans (m,..., v m ). 

Lemma 3.71. Let V be a vector space over afield k. 

(i) Every intersection of subspaces ofV is itself a subspace. 

(ii) If X — vi, , v m is a list in V, then the intersection of all the subspaces ofV con¬ 
taining X is (v i,..., v m ), the subspace spanned by v i,..., v m , and so (v i,..., v m ) 
is the smallest subspace of V containing X. 

Sketch of Proof. Part (i) is routine. Let X = {i>i,..., v m |, and let S denote the family of 
all the subspaces of V containing X ; we claim that 

P| S — (vu .... v m ). 

SeS 

The inclusion C is clear, because (i>i,..., v m ) e S. For the reverse inclusion, note that if 
SeS , then S contains v\,... ,v m , and so it contains the set of all linear combination of 
vi -- v m , namely, (t>i, v m ). • 

It follows from the second part of the lemma that the subspace spanned by a list X — 
v\,..., v m does not depend on the ordering of the vectors, but only on the set of vectors 
themselves. Were all terminology in algebra consistent, we would call (i>i,..., v m ) the 
subspace generated by X. The reason for the different terms is that the theories of groups, 
rings, and vector spaces developed independently of each other. 

If X — 0 , then ^' where S is the family of all the subspaces of \' 

containing X. As every subspace contains X — 0 , {0} itself is one of the subspaces 
occurring in the intersection of all the subspaces of V. and so ( 0 ) = Plscy $ — { 0 }- 

Example 3.72. 

(i) Let V = R 2 , let e\ = (1, 0), and let ei — (0, 1). Now V — {e\, ef), for if v = (a. b) e 
V. then 


v = (a, 0) + ( 0 , b) 

= a(l, 0 ) + b(0, 1 ) 

= ae i + be 2 e (e\, ef) ■ 
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(ii) If k is a field and V = k", define e, as the n -tuple having 1 in the ;th coordinate and 0’s 
elsewhere. The reader may adapt the argument in part (i) to show that e\, ..., e n spans k". 

(iii) A vector space V need not be spanned by a finite list. For example, let V = k[x], and 

suppose that X — f\(x),, f m (x) is a finite list in V. If cl is the largest degree of any 
of the fi (x), then every (nonzero) ^-linear combination of f\ (x),..., f m ( x ) has degree at 
most d. Thus, x' /+l is not a ^-linear combination of vectors in X. and so X does not span 
k[x], ◄ 

The following definition makes sense even though we have not yet defined dimension. 

Definition. A vector space V is called finite-dimensional if it is spanned by a finite list; 
otherwise, V is called infinite-dimensional. 

Example 3.72(ii) shows that k" is finite-dimensional, while part (iii) of this Example 
shows that k[x] is infinite-dimensional. By Example 3.69(iii), both R and C are vector 
spaces over Q, and they are both infinite-dimensional. 

Notation. If t>i,..., v m is a list, then v\...., vj..., v m is the shorter list with u,- deleted. 


Proposition 3.73. If V is a vector space, then the following conditions on a list X = 
Vi, ..., i> m spanning V are equivalent: 

(i) X is not a shortest spanning list; 

(ii) some Vj is in the subspace spanned by the others; that is, 

Vi (ft, ..., Vi ,..., v m ), 

(iii) there are scalars ai, ..., a m , not all zero, with 

m 

T, aeve = 0 . 
i= l 


Sketch of Proof, (i) =► (ii). If X is not a shortest spanning list, then one of the vectors in 
X can be thrown out, and the shorter list still spans. 

(ii) =£> (iii). If V; — Ylj+j cjVj, then define a,- = — 1 ^ 0 and aj = cj for all j f i. 

(iii) => (i). The given equation implies that one of the vectors, say, u;, is a linear combi¬ 
nation of the others. Deleting t> ( - gives a shorter list, which still spans: If v e T is a linear 
combination of all the vj (including V/), just substitute the expression for v; as a linear 
combination of the other vj and collect terms. • 
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Definition. A list X = v\,..., v m in a vector space V is linearly dependent if there 
are scalars a \,..., a m , not all zero, with Y1T=i a l v l — 0; otherwise, X is called linearly 
independent. 

The empty set 0 is defined to be linearly independent (we may interpret 0 as a list of 
length 0 ). 

Example 3.74. 

(i) Any list X = v\,..., v m containing the zero vector is linearly dependent. 

(ii) A list i>i of length 1 is linearly dependent if and only if r>i = 0; hence, a list v\ of 
length 1 is linearly independent if and only if i>i f 0 . 

(iii) A list r ], V 2 is linearly dependent if and only if one of the vectors is a scalar multiple 
of the other. 

(iv) If there is a repetition in the list t>i,..., v m (that is, if ty = vi for some i f j). then 
v\,..., v m is linearly dependent: Define c, = 1 ,cj — — 1, and all other c — 0. Therefore, 
if v\, ..., v m is linearly independent, then all the vectors v, are distinct. < 

The contrapositive of Proposition 3.73 is worth stating. 

Corollary 3.75. If X — v\,,v m is a list spanning a vector space V, then X is a 
shortest spanning list if and only ifX is linearly independent. 

Linear independence has been defined indirectly, as not being linearly dependent. Be¬ 
cause of the importance of linear independence, let us define it directly. A list X = 
v\,..., v m is linearly independent if, whenever a ^-linear combination fft'= i a t v t — 0 . 
then every a; = 0. It follows that every sub list of a linearly independent list is itself linearly 
independent (this is one reason for decreeing that 0 be linearly independent). 

We have arrived at the notion we have been seeking. 

Definition. A basis of a vector space V is a linearly independent list that spans V. 

Thus, bases are shortest spanning lists. Of course, all the vectors in a linearly indepen¬ 
dent list ui,..., v n are distinct, by Example 3.74(iv). 

Example 3.76. 

In Example 3.72(h), we saw that X = e \,..., e n spans k", where e, is the /;-tuple having 1 
in the ;th coordinate and 0’s elsewhere. We can easily prove that X is linearly independent, 
and hence it is a basis; it is called the standard basis of k". a 


Proposition 3.77. Let X — vi, ... ,v n be a list in a vector space V over a field k. Then X 
is a basis if and only if each vector in V has a unique expression as a k-linear combination 
of vectors in X. 
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Sketch of Proof. If a vector v = ^ a,u,■ = ^ bj Vj , then — ft,)iq = 0, and so 

independence gives a,- = ft, for all i; that is, the expression is unique. 

Conversely, existence of an expression shows that the list of r>, spans. Moreover, if 
0 — Vj with not all c,- = 0, then the vector 0 does not have a unique expression as a 
linear combination of the v,. • 


Definition. If X = iq,..., v n is a basis of a vector space V and if v € V, then there 
are unique scalars a\, ..., a n with v = ^" =1 aiVi. The n -tuple (ai,..., a n ) is called the 
coordinate set of a vector v e V relative to the basis X. 

Observe that if v\,... ,v n is the standard basis of V = ft", then this coordinate set 
coincides with the usual coordinate set. 

If tq,..., v n is a basis of a vector space V over a field ft, then each vector v e V has a 
unique expression 

v = a\V\+ Q2V2 H-h a n v„, 

where a, e ft for all i. Since there is a first vector iq, a second vector in, and so forth, the 
coefficients in this ft-linear combination determine a unique n -tuple (a\ .cn.... ■ «„)• Were 
a basis merely a subset of V and not a list (i.e., an ordered subset), then there would be n\ 
coordinate sets for every vector. 

We are going to define the dimension of a vector space V to be the number of vectors 
in a basis. Two questions arise at once. 

(i) Does every vector space have a basis? 

(ii) Do all bases of a vector space have the same number of elements? 

The first question is easy to answer; the second needs some thought. 

Theorem 3.78. Every finite-dimensional vector space V has a basis. 

Sketch of Proof. A finite spanning list X exists, since V is finite-dimensional. If it is 
linearly independent, it is a basis; if not, X can be shortened to a spanning sublist X', by 
Proposition 3.73. If X' is linearly independent, it is a basis; if not, X' can be shortened 
to a spanning sublist X". Eventually, we arrive at a shortest spanning sublist, which is 
independent and hence is a basis. • 

The definitions of spanning and linear independence can be extended to infinite lists in a 
vector space, and we can then prove that infinite-dimensional vector spaces also have bases 

(see Theorem 6.48). For example, it turns out that a basis of ft[x] is 1, x, x ,..., x n , _ 

We can now prove invariance of dimension, one of the most important results about 
vector spaces. 

Lemma 3.79. Let u\,... ,u n be elements in a vector space V, and let v\,..., v m e 
(mi, ..., m„). Ifm > n, then tq, ..., v m is a linearly dependent list. 
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Proof. The proof is by induction on n > 1. 

Base Step. If n = 1 , then there are at least two vectors v i . vj and v\ = a\it \ and V2 = 02112. 
If mi = 0, then r>i = 0 and the list of v’s is linearly dependent. Suppose u 1 f 0. We may 
assume that v\ f 0, or we are done; hence, a\ f 0. Therefore, v \, ih is linearly dependent, 
for i >2 — 02a f 1 vi = 0, and hence the larger list vi,..., v m is linearly dependent. 

Inductive Step. There are equations, for i — 1,..., m, 


Vj = OiiU l -\ -h Oj n U n . 


We may assume that some an 0, otherwise tq,..., v m e (u 2 , ..., m„), and the inductive 
hypothesis applies. Changing notation if necessary (that is, by re-ordering the v’s). we may 
assume that an ^ 0. For each i > 2, define 

v'j = Vj - onafjVi e (u 2 ,.... u n ) 

(writing v[ as a linear combination of the u’s, the coefficient of u\ isa,i — (anafj)an = 0 ). 
Since m — 1 > n — 1, the inductive hypothesis gives scalars 1)2, ■■■ ■ b m , not all 0, with 

^ 2^2 + • • • + b m v' m — 0 . 

Rewrite this equation using the definition of : 

(- y^bianaf\)vi + b 2 v 2 H-b b m v m = 0 . 

1 >2 

Not all the coefficients are 0, and so iq, ..., v m is linearly dependent. • 

The following familiar fact illustrates the intimate relation between linear algebra and 
systems of linear equations. 

Corollary 3.80. A homogeneous system of linear equations, over a field k, with more 
unknowns than equations has a nontrivial solution. 

Proof. An n-tuple {fi [,..., f n ) is a solution of a system 

<*11*1 H-b OllnXn = 0 


T * * * T OlmnXn — 0 

if anfi + • • • + cti n l3 n — 0 for all i. In other words, if ci, ..., c n are the columns of the 
m x n coefficient matrix A = [or,y], then 

fi\C\ + ■ ■ ■ + fi n c n — 0 . 

Note that c,- e k m . Now k m can be spanned by m vectors (the standard basis, for example). 
Since n > m, by hypothesis. Lemma 3.79 shows that the list ci,..., c n is linearly depen¬ 
dent; there are scalars y\,.... y n , not all zero, with yici + • • • + y n c n — 0. Therefore, 
(yi ,..., y n ) is a nontrivial solution of the system. • 
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Theorem 3.81 (Invariance of Dimension). IfX = x\, ... , x n and Y = y {,..., y m are 
bases of a vector space V, then m = n. 

Proof. If m f n, then either n < m or m < n. In the first case, y\,...,y m e 
(xi,..., x n }, because X spans V. and Lemma 3.79 gives Y linearly dependent, a con¬ 
tradiction. A similar contradiction arises if m < n, and so we must have m — n. • 

It is now permissible to make the following definition. 

Definition. If V is a finite-dimensional vector space over a field k, then its dimension. 
denoted by dim^fV) or dim(V), is the number of elements in a basis of V. 


Example 3.82. 

(i) Example 3.76 shows that k n has dimension n. which agrees with our intuition when 
k — R. Thus, the plane R x 1 is two-dimensional! 

(ii) If V — {0}, then dim(V) = 0, for there are no elements in its basis 0. (This is a good 
reason for defining 0 to be linearly independent.) 

(iii) Let X = {xi,,.., x „} be a finite set. Define 

k x — {functions /: X —» k}. 

Now k x is a vector space if we define addition / + /' to be 

f + f':x»f(x) + f'(x) 

and scalar multiplication af, for a e k and /: X -» k, by 

af: x af lx). 


It is easy to check that the set of n functions of the form f x , where x e X. defined by 


fxiy) = 


if y = x; 
if y / x, 


form a basis, and so dim(k' Y ) = n = |X|. 

The reader should note that this is not a new example: An /;-tuple (a\,..., a n ) is really 
a function /: {1,...,«} -» k with f{i) = a, for all i. Thus, the functions f x comprise 
the standard basis. ◄ 


Here is a second proof of invariance of dimension; it will be used, in Chapter 6 , to 
generalize the notion of dimension to the notion of transcendence degree. We begin with 
a modification of the proof of Proposition 3.73. 


Lemma 3.83. If X — v\,..., v n is a linearly dependent list of vectors in a vector space 
V, then there exists v r with r > 1 with v r e (i>i, i> 2 ,..., v r -\) [when r — 1, we interpret 
(i>l, ..., IV- 1 ) to mean {0}]. 
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Remark. Let us compare Proposition 3.73 with this one. The earlier result says that if 
v\, i> 2 , t>3 is linearly dependent, then either V] e ( 112 , V 3 ) , V 2 e (i>i, U3}, or U3 e (ui, vf). 
This lemma says that either r>i e {0}, V 2 e (i>i), or U 3 e (i>i, U 2 }. ◄ 


Proof. Let r be the largest integer for which ui,..., v r -1 is linearly independent. If 
i>i = 0, then ui e {0}, and we are done. If v\ 0, then r > 2; since uj, in, ■ ■ ■ i v n is 
linearly dependent, we have r — 1 < n. As r — 1 is largest, the list v\, t> 2 , ..., v r is linearly 
dependent. There are thus scalars a\,..., a r , not all zero, with a\vi + • • • + a r v r — 0. 
In this expression, we must have a r f 0, for otherwise v \,..., u,_i would be linearly 
dependent. Therefore, 


r— 1 

Vr — ^2 (-af l )ciiVj e (Vi, - v r ~\). • 

i=t 

Lemma 3.84 (Exchange Lemma). If X = x\,..., x m is a basis of a vector space V 
and y\,... ,y n is a linearly independent subset ofV, then n < m. 

Proof. We begin by showing that one of the x’s in X can be replaced by y n so that the 
new list still spans V. Now y n e (X), since X spans V , so that the list 

y n , a 1 ,.,., x m 

is linearly dependent, by Proposition 3.73. Since the list yi, ..,, y„ is linearly independent, 
y„ f {0}. By Lemma 3.83, there is some i with Xj — ay n + a j x j- Throwing out a,- 
and replacing it by y„ gives a spanning list 

X' = y n ,x\,... Si, ■ ■ -,x m : 

If v = 1 bj x j> then (as in the proof of Proposition 3.73), replace x/ by its expression 

as a £-1 inear combination of the other x’s and y„, and then collect terms. 

Now repeat this argument for the spanning list 1 , y„, X|,.... x,-,..., x m . The options 
offered by Lemma 3.83 for this linearly dependent list are y n e {y n ~\), x\ e {y n -l, yn ), 
X 2 e ( 33 ,- 1 , y„, xi), and so forth. Since Y is linearly independent, so is its sublist y„ 1 , y n , 
and the first option y n e (y„~ 1 ) is not feasible. It follows that the disposable vector 
(provided by Lemma 3.83) must be one of the remaining x’s, say xt. After throwing out 
we have a new spanning list X" . Repeat this construction of spanning lists; each time a 

new y is adjoined as the first vector, an x is thrown out, for the option 33 e ( 33+1 __ y n ) 

is not feasible. If n > 111 . that is, if there are more y’s than x’s, then this procedure ends 
with a spanning list consisting of m y’s (one for each of the m x’s thrown out) and 
no x’s. Thus a proper sublist of Y = 33 ...., y„ spans V, and this contradicts the linear 
independence of Y. Therefore, n < m. • 
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Theorem 3.85 (Invariance of Dimension). IfX = x\,, x m and Y = y \...., y n are 

bases of a vector space V, then m = n. 

Proof. By Lemma 3.84, viewing X as a basis with m elements and and Y as a linearly 
independent list with n elements gives the inequality n < m; viewing Y a basis and X 
as a linearly independent list gives the reverse inequality m < n. Therefore, m — n, as 
desired. • 


Definition. A longest (or a maximal ) linearly independent list u\ . u m is a linearly 

independent list for which there is no vector v e V such that u\,... ,u m ,v is linearly 
independent. 


Lemma 3.86. If V is a finite-dimensional vector space, then a longest linearly indepen¬ 
dent list vi,. .., v n is a basis ofV. 

Sketch of Proof. If the list is not a basis, then it does not span: There is w <= V with 
w £ (i’ i,.... v n }. But the longer list with w adjoined is linearly independent, by Proposi¬ 
tion 3.73. • 

It is not obvious that there are any longest linearly independent lists; that they do exist 
follows from the next result, which is quite useful in its own right. 

Proposition 3.87. Let Z = u \...., u m be a linearly independent list in an n-dimensional 
vector space V. Then Z can be extended to a basis ; i.e., there are vectors v m + 1 ,... ,v n so 
that mi, ..., u m , ..., v„ is a basis ofV. 

Sketch of Proof. If the linearly independent list Z does not span V. there is wj e V 
with w i (Z), and the longer list Z, w\ is linearly independent, by Proposition 3.73. If 
Z, wi does not span V, there is u >2 e V with u >2 f (Z, w\). Since dim(V) = n, the 
length of these lists can never exceed n. Otherwise, compare a linearly independent list 
with n + 1 elements with a basis, and reach a contradiction using the exchange lemma. 
Lemma 3.84. • 

Corollary 3.88. If d\m(V ) = n, then any list of n + 1 or more vectors is linearly depen¬ 
dent. 

Sketch of Proof. Otherwise, such a list could be extended to a basis having too many 
elements. • 

Corollary 3.89. Let V be a vector space with dim(V) = n. 

(i) A list of n vectors that spans V must be linearly independent. 

(ii) Any linearly independent list of n vectors must span V. 
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Sketch of Proof, (i) Were it linearly dependent, then the list could be shortened to give a 
basis, and this basis is too small. 

(ii) If the list does not span, the it could be lengthened to give a basis, and this basis is too 
large. • 


Corollary 3.90. Let U be a subspace of a vector space V of dimension n. 

(i) U is finite-dimensional and dim(f/) < dim(V). 

(ii) //dim(f/) = dim(V), then U — V. 

Sketch of Proof, (i) Take u\ e U . If U — (mi), then U is finite-dimensional. Otherwise, 
there is u 2 f (mi). By Proposition 3.73, u \, 112 is linearly independent. If U — (mi, M 2 ), 
we are done. This process cannot be repeated n + 1 times, for then mi,..., m„ + 1 would be 
a linearly independent list in U C V, contradicting Corollary 3.88. 

A basis of U is linearly independent, and so it can be extended to a basis of V. 

(ii) If dim(f/) = dim(V), then a basis of U is already a basis of V (otherwise it could be 
extended to a basis of V that would be too large). • 


Exercises 

3.67 If the only subspaces of a vector space V are (0) and V itself, prove that dim(V) < 1. 

3.68 Prove, in the presence of all the other axioms in the definition of vector space, that the com¬ 
mutative law for vector addition is redundant; that is, if V satisfies all the other axioms, then 
u + v = v + u for all u , v e V. 

Hint. If u. v e V, evaluate — [(— v) + (— u)] in two ways. 

3.69 If V is a vector space over I 2 and if tq f tq are nonzero vectors in V, prove that v\, vj is 
linearly independent. Is this true for vector spaces over any other field? 

3.70 Prove that the columns of an m x n matrix A over a field k are linearly dependent in k m if and 
only if the homogeneous system Ax = 0 has a nontrivial solution. 

3.71 If U is a subspace of a vector space V over a field k. define a scalar multiplication on the 
quotient group V/U by 

a(v + U) = av + U, 

where a e k and v e V. Prove that this is a well-defined function that makes V/U into a 
vector space over k (V/U is called a quotient space). 

3.72 If V is a finite-dimensional vector space and U is a subspace, prove that 

dim(I/) + dim( V/U) = dim(V). 

Hint. Prove that if iq + U . v r + U is a basis of V/U, then the list tq,..., v r is linearly 

independent. 
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Definition. If U and W are subspaces of a vector space V. define 

U + W = {u + w : u e U and w e W}. 

3.73 (i) Prove that U + W is a subspace of V. 

(ii) If U and U' are subspaces of a finite-dimensional vector space V, prove that 

dim(t/) + dim(H') = dim(t/ n U') + dim (U + U'). 

Hint. Take a basis of U IT U' and extend it to bases of U and of U'. 


Definition. If U and W are vector spaces over a field k, then their direct sum is the set of all ordered 
pairs, 

U © W = {(u, w) : u e U and w e W}, 


with addition 


( u , w) + (u , u>') = {u + u , w + w) 


and scalar multiplication 


o/(u, w ) = (au, aw). 

3.74 If U and W are finite-dimensional vector spaces over a field k, prove that 

dim(t7 © W) = dim((7) + dim(lV). 


Linear Transformations 

Homomorphisms between vector spaces are called linear transformations. 

Definition. If V and W are vector spaces over a field k, then a function T : V -» W is a 
linear transformation if, for all vectors it, v e V and all scalars a e k. 

(i) T(u + v) = T(u) + T(v); 

(ii) T ( av ) = aT(v). 

We say that a linear transformation T is nonsingular (or is an isomorphism) if 7’ is a 
bijection. Two vector spaces V and W over k are isomorphic , denoted by V = W. if there 
is a nonsingular linear transformation T: V -* W. 

If we forget the scalar multiplication, then a vector space is an (additive) abelian group 
and a linear transformation T is a group homomorphism. It is easy to see that T preserves 
all ^-linear combinations: 


T{a\v\ H-b a m v m ) = a\T{v\) H-b a m T(v m ). 



172 


Commutative Rings I Ch. 3 


Example 3.91. 

(i) The identity function 1 y: V —>■ V on any vector space V is a nonsingular linear trans¬ 
formation. 

(ii) If 6 is an angle, then rotation about the origin by 6 is a linear transformation Rg : IK 2 — > 
R 2 . The function Rg preserves addition because it takes parallelograms to parallelograms, 
and it preserves scalar multiplication because it preserves the lengths of arrows. 

(iii) If V and W are vector spaces over a field k, write Homd V, W ) for the set of all linear 
transformations V -* W. Define addition S + 7’ by v i->- S(v) + T(v) for all v e V , 
and define scalar multiplication aT: V —*■ W, where a e k. by v i-> aT(v) for all 
tel. Both S + T and aT axe linear transformations, and Hom/TT, W ) is a vector space 
over k. a 


Definition. If V is a vector space over a field k, then the general linear group , denoted 
by GL(V), is the set of all nonsingular linear transformations V —> V. 

A composite ST of linear transformations S and T is again a linear transformation, 
and ST is nonsingular if both S and T are; moreover, the inverse of a nonsingular linear 
transformation is again nonsingular. It follows that GL(V ) is a group with composition as 
operation, for composition of functions is always associative. 

We now show how to construct linear transformations T: V -* W, where V and W are 
vector spaces over a field k. The next theorem says that there is a linear transformation that 
does anything to a basis. 

Theorem 3.92. Let v\,..., v n be a basis of a vector space V over a field k. If W is 
a vector space over k and u\,... ,u n is a list in W, then there exists a unique linear 
transformation T : V —»■ W with T (i>,) = u, for all i. 

Proof By Theorem 3.77, each v e V has a unique expression of the form v = ; ai Vj. 

and so T: V —>• W. given by T(v) = «/«/, is a (well-defined!) function. It is now a 

routine verification to check that T is a linear transformation. 

To prove uniqueness of T, assume that S: V W i s a linear transformation with 


S(Vi) = m = T(vj ) 


for all i. If v e V, then v = a i v i an d 

S(v) = S(J2 atvt) 

— ^2 S(a,Vi) 

= ^ aiS(vi) 

= J2ai T ( v i) = T (v)- 




Since v is arbitrary, S = T. 
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Corollary 3.93. If two linear transformations S, T: V —> W agree on a basis, then 
S — T. 

Proof This follows at once from the uniqueness of the defined linear transformation. • 
Linear transformations defined on k" are easy to describe. 

Proposition 3.94. If T : k n -»■ k m is a linear transformation, then there exists an m x n 
matrix A such that 

T (y) = Ay 

for all y e k n (here, y is an n x 1 column matrix and Ay is matrix multiplication ). 

Sketch of Proof If e\, ..., e n is the standard basis of k n and e\, ... , e' m is the standard 
basis of k m , define A = [atj] to be the matrix whose jth column is the coordinate set of 
T(ej). If S: k n —»■ k m is defined by S(y) = Ay, then S — T because both agree on a 
basis: T(ef) = a ij e i = Ae j- • 

Theorem 3.92 establishes the connection between linear transformations and matrices, 
and the definition of matrix multiplication arises from applying this construction to the 
composite of two linear transformations. 

Definition. Let X — v \,..., v n be a basis of V and let Y — w i,..., w m be a basis of 
W. If T : V —> W is a linear transformation, then the matrix of T is the m x n matrix 
A = [a,j] whose jth column a\j, a 2 j, ..., a m j is the coordinate set of T(vj) determined 
by the id’s: T(vj) = YllLi a ij w i■ The matrix A does depend on the choice of bases X 
and Y ; we will write 

a = y [T] x 

when it is necessary to display them. 

In case V = W, we often let the bases X = v\,... ,v„ and w \,..., w m coincide. If 
ly: V -» V, given by v v, is the identity linear transformation, then x[L']x is the 
n x n identity matrix /„ (usually, the subscript n is omitted), defined by 

I - lSij ], 

where S,j is the Kronecker delta. Thus, I has l’s on the diagonal and 0’s elsewhere. On 
the other hand, if X and Y are different bases, then y[ly]x is not the identity matrix; its 
columns are the coordinate sets of the x’s with respect to the basis Y. 

Example 3.95. 

Let T: V -> W be a linear transformation, and let X = vi ,..., v n and Y = w\. ..., w m 
be bases of V and W, respectively. The matrix for T is set up from the equation 


T(vj) — aijwi + a 2 jW 2 H-b a m jw m . 
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Why are the indices reversed? Why not write 


T{Vj) = Clj\W\ +dj2V02 H-b UjmWml 

Consider the following example. Let A be an m x n matrix over a field k. The function 
T: k n —> k m , defined by T(X ) = AX, where X is an n x 1 column vector, is a linear 
transformation. If e \, ..., e„ and e[,, e' m are the standard bases of k" and k m , respec¬ 
tively, then the definition of matrix multiplication says that T (e j) = Aej is the /'th column 
of A. But 

Aej = Qije\ + a 2 je'^ "T ''' + a mje' m . 

Therefore, the matrix associated to T is the original matrix A. 

In Proposition 3.98, we shall prove that matrix multiplication arises from composition 
of linear transformations. If T : V —> W has matrix A and S: W -> U has matrix B, then 
the linear transformation ST: V -> U has matrix BA. Had we defined matrices of linear 
transformations by making coordinate sets rows instead of columns, then the matrix of ST 
would have been AB. < 

Example 3.96. 

(i) Let T: R 2 -» R 2 be rotation by 90° 

X = ( 1 , 0 ), ( 0 , 1 ) is 

x[T] X 

However, if Y = (0, 1), (1, 0), then 


(ii) Let k be a field, let T: V —> V be a linear transformation on a two-dimensional 
vector space, and assume that there is some vector v e V with T(v) not a scalar multiple 
of v. The assumption on v says that the list X = v,T(v) is linearly independent, by 
Example 3.74(iii), and hence it is a basis of V [because dim(V) = 2], Write vi = v and 
V 2 — Tv. 

We compute x[T]x- 


. The matrix of T relative to the standard basis 

_ 0 - 1 " 

1 O' 


T (i>i) = i >2 and T ( V 2 ) — a m + bv 2 


for some a, b e k. We conclude that 


x[T] x 


0 a 
1 b 


The following proposition is a paraphrase of Theorem 3.92. 


Proposition 3.97. Let V and W be vector spaces over afield k, and let X — v\,.... v„ 
and Y = u>i,..., w m be bases ofV and W, respectively. If Hom^fV, W) denotes the set 
of all linear transformations T : V —> W, and Mat mx «(^) denotes the set of all m x n 
matrices with entries in k, then the function T 1 —> y[T]x is a bijection Hom/AL, W) —> 
Mat mx „ (k). 
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Proof. Given a matrix A. its columns define vectors in W ; in more detail, if the /th 
column of A is (a\j, ..., a m j). define Zj = Y1'IL\ a ij w i ■ By Theorem 3.92, there exists 
a linear transformation T: V —> W with T(vj) = Zj and y[T]x = A. Therefore, p. is 
suijective. 

To see that // is injective, suppose that Y [T] X — A — y[S]x■ Since the columns of A 
determine T (vj) and S(Vj) for all j, Corollary 3.93 gives S — T. • 

The next proposition shows where the definition of matrix multiplication comes from: 
the product of two matrices is the matrix of a composite. 


Proposition 3.98. Let T : V -> W and S : W —> U be linear transformations. Choose 
bases X = x\ ,...,x„ofV, Y = y m of W, and Z = zi ,... ,zi of U. Then 

z[S°T} x = ( z [S} y )( y [T} x ). 

Proof. Let y[T] x = [a^], so that T(xj) = J2 p a p jy p . and let Z [S] Y — [b qp ], so that 
S(y p ) = Y.q bqpZq. Then 

ST(xj) = S(T(xj)) - S(Y^a p jy p ) 

p 

= ^2 a PjS(yp} — ^2^2 a PjbqpZq = y~! C qj Z q’ 
p p q q 

where c q j — ^Z p b qp a p j. Therefore, 

z[ST] x = [Cqj] = z [S] yy [T] x . . 


Corollary 3.99. Matrix multiplication is associative. 

Proof. Let A be an m x n matrix, let B be an n x p matrix, and let C he a p x q matrix. 
By Theorem 3.92, there are linear transformations 

l^q pp Jf. k n _ 5 . k m 


with C = [T], B = [5], and A — [/?]. 

Then 

[Bo (So T)] = L/?][S o T] = [/?](mm) = A(BC). 
On the other hand, 

[(* oS)oT] = [Ro S][r] = ([/?][S])[r] = (AB)C. 
Since composition of functions is associative, 

Ro(SoT) = (R oS)oT. 


and so 


A(BC) = [Ro (S o T)] = [(R o S) o T] = ( AB)C. • 


We can prove Corollary 3.99 directly, although it is rather tedious, but the connection 
with composition of linear transformations is the real reason why matrix multiplication is 
associative. 
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Corollary 3.100. Let T : V —> W be a linear transformation of vector spaces V over a 
field k, and let X and Y be bases of V and W, respectively. If T is nonsingular, then the 
matrix ofT~ l is the inverse of the matrix ofT: 

x[T- 1 ]y = (y[T]x)- 1 - 

Proof I — y [1\v]y — y[T]xx[T~ 1 ]y and / = x[^v]x — x[T~ X ]yy[T]x- • 


The next corollary determines all the matrices arising from the same linear transforma¬ 
tion. 

Corollary 3.101. Let T : V —> V be a linear transformation on a vector space V over a 
field k. If X and Y are bases ofV, then there is a nonsingular matrix P with entries in k 
so that 

y[T] y = P(x[T] x )P~ l - 

Conversely, if B — PAP -1 , where B, A, and P are n x n matrices with entries in k and 
P is nonsingular, then there is a linear transformation T : k n —> k n and bases X and Y of 
k n such that B = y[T]y and A — x[T]x- 

Proof The first statement follows from Proposition 3.98 and associativity: 

y[T] y = y[1 V T1 v ]y = (y[lvk)(x[rk)Or[l v ]y). 

Set P — y[ ly]x- an d n °te that Corollary 3.100 gives P~ l — ^[1 y]y- 

For the converse, let £ — e\,... ,e n be the standard basis of k", and define T: k n k" 
by T(ej) = Aej (remember that vectors in k" are column vectors, so that Aej is matrix 
multiplication; indeed, Aej is the /th column of A). It follows that A — e[T]e- Now 
define a basis Y = y n by yj = P~ x ef, that is, the vectors in Y are the columns 

of P . Note that Y is a basis because P -1 is nonsingular. It suffices to prove that B — 
y[T]y\ that is, T(yj) = bijyt, where B = [b,j]. 

T(yj ) - A Yj 

= AP~ l ej 
= P~ i Be j 

= p ~ l J2 bi J e ‘ 

i 

— ^ bjj P Ci 
i 

= J2 bi j yi • 

i 
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Definition. Two n x n matrices B and A with entries in a field k are similar if there is a 
nonsingular matrix P with entries in k with B = PAP~ l . 

Corollary 3.101 says that two matrices arise from the same linear transformation on a 
vector space V (from different choices of basis) if and only if they are similar. In Chapter 
9, we will see how to determine whether two given matrices are similar. 

Just as for group homomorphisms and ring homomorphisms, we can define the kernel 
and image of linear transformations. 


Definition. 

of T is 


If T : V -» W is a linear transformation, then the kernel (or the null space ) 
ker T = {v e V : T(v) = 0}, 


and the image of T is 


im T — [w e W : w = T ( v ) for some v e V}. 

As in Proposition 3.94, an m x n matrix A with entries in a field k determines a linear 
transformation k n —> k m , namely, y Ay, where y is an n x 1 column vector. The 
kernel of this linear transformation is usually called the solution space of A [see Exam¬ 
ple 3.70(iv)]. 

The proof of the next proposition is routine. 

Proposition 3.102. Let 7 : V —> W be a linear transformation. 

(i) ker T is a subspace of V and im T is a subspace of W. 

(ii) T is injective if and only if ker T = {0}. 

We can now interpret the fact that a homogeneous system over a field k with r equations 
in n unknowns has a nontrivial solution if r < n. If A is the r x n coefficient matrix of 
the system, then <p : x i-»- Ax is a linear transformation <p\ k" -> k r . If there is only the 
trivial solution, then ker <p = { 0 }, so that k n is isomorphic to a subspace of k r , contradicting 
Corollary 3.90(i). 

Lemma 3.103. Let T : V —>■ W be a linear transformation. 

(i) IfT is nonsingular, then for every basis X = v\,V 2 , ■ ■ ■ ,v n ofV, we have T (X) = 

T T (v 2), ...,T ( v n ) a basis ofW. 

(ii) Conversely, if there exists some basis X — v\, V 2 , ,v n of V for which T (X) = 
T[v\), T (V 2 ), ..., T (v„) is a basis ofW, then T is nonsingular. 

Proof, (i) If '}fciT(vi) = 0, then T (Cj 1 , 7 ) = 0, and so c; i>j e ker 7’ = {0}. Hence 
each c,- = 0, because X is linearly independent. If w e W, then the surjectivity of T 
provides v e V with w — T(v). But v — ’‘ffajVi, and so w — T(v) — TCffaiVi) = 
aj T(Vi). Therefore, T (X) is a basis of W. 
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(ii) Let w e W. Since 7\iq), ..., T(v n ) is a basis of W, we have w = CjT(vj ) = 
T(J2dVi), and so T is surjective. If e ker 7’, then c,7’(u,) = 0, and so linear 

independence gives all q = 0; hence, = 0 and ker 7’ = {0}. Therefore, T is 

nonsingular. • 

Theorem 3.104. If V is an n-dimensional vector space over afield k, then V is isomor¬ 
phic to k n . 

Proof Choose a basis iq, ..., v n of V. If e\,..., e„ is the standard basis of k n , then 
Theorem 3.92 says that there is a linear transformation 7’: V —» k" with T (u,-) = e, for 
all i ; by Lemma 3.103, T is nonsingular. • 

Theorem 3.104 does more than say that every finite-dimensional vector space is es¬ 
sentially the familiar vector space of all n -tuples. It says that a choice of basis in V is 
tantamount to a choice of coordinate set for each vector in V. We want the freedom to 
change coordinates because the usual coordinates may not be the most convenient ones for 
a given problem, as the reader has probably seen (in a calculus course) when rotating axes 
to simplify the equation of a conic section. 

Corollary 3.105. Two finite-dimensional vector spaces V and W over a field k are iso¬ 
morphic if and only if dim(V) = dim(W). 


Remark. In Theorem 6.51, we will see that this corollary remains true for infinite¬ 
dimensional vector spaces. -* 

Proof Assume that there is a nonsingular T : V -> W. If X = iq, ..., v„ is a basis of 
V. then Lemma 3.103 says that T (iq),..., T ( v n ) is a basis of W. Therefore, dim(W) = 
|X| = dim(V). 

If n — dim(V) = dim(W), then there are isomorphisms T: V —> k" and S: W k", 
by Theorem 3.104. It follows that the composite S~ l T: V W is nonsingular. • 


Proposition 3.106. Let V be a finite-dimensional vector space with d i m ( V) — n, and let 
T : V —> V be a linear transformation. The following statements are equivalent: 

(i) T is an isomorphism; 

(ii) T is surjective; 

(iii) T is injective. 

Proof, (i) =q> (ii) This implication is obvious. 

(ii) => (iii) Let iq,..., v n be a basis of V. Since T is surjective, there are vectors u\, ... ,u n 
with Tuj — Vi for all i. We claim that u\,..., u n is linearly independent. If there are 
scalars c i,..., c „, not all zero, with 'ffcjUj = 0, then we obtain a dependency relation 
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0 = J2 c iT(ui) = Ci i>i , a contradiction. By Corollary 3.89(ii), u \...., u„ is a basis 
of V. To show that T is injective, it suffices to show that ker 7’ = {0}. Suppose that 
T(v) — 0. Now v — CjUi , and so 0 = T c,w,• = hence, linear independence 

of v\,..., v n gives all c, = 0, and so v = 0. Therefore, T is injective. 

(iii) => (i) Let vi ,..., v n be a basis of V. If ci,..., c„ are scalars, not all 0, then ^ c,- Vi ^ 
0, for a basis is linearly independent. Since T is injective, it follows that c, T V/ ^ 0, 
and so Tv\,..., Tv n is linearly independent. Therefore, Lemma 3.103(ii) shows that T is 
an isomorphism. • 

Recall that an n x n matrix A with entries in a field k is nonsingular if there is a matrix 
B with entries in k (its inverse ), with AB = I — BA. The next corollary shows that 
“one-sided inverses” are enough. 

Corollary 3.107. If A and B are n x n matrices with AB — I, then BA = I. Therefore, 
A is nonsingular with inverse B. 

Proof. There are linear transformations T , S : k n —> k n with |T] = A and [5] = B, and 
AB — I gives 

[TS] = [T][S] = [1 *-]. 

Since T ho* [7’] is a bijection, by Proposition 3.97, it follows that TS — 1 k n ■ By Propo¬ 
sition 1.47, T is a surjection and S is an injection. But Proposition 3.106 says that 
both T and S are isomorphisms, so that S — T~ l and TS — 1 r n = ST. Therefore, 
I = [5T] = [5][T] = BA, as desired. • 

Definition. The set of all nonsingular n x n matrices with entries in k is denoted by 
GL (n,k). 

Now that we have proven associativity, it is easy to prove that GL(n , k) is a group under 
matrix multiplication. 

A choice of basis gives an isomorphism between the general linear group and the group 
of nonsingular matrices. 

Proposition 3.108. Let V be an n -dimensional vector space over a field k, and let X = 
V],..,, v n be a basis ofV. Then p. : GL(V) —»■ GL(n, k), defined by T i-^ [T] = x[T]x> 
is an isomorphism. 

Proof. By Proposition 3.97, the function p! \ T i-> [T] = x\ L]x is a bijection 

HomGV, V) -> Mat n (k), 

where Hom^ (V,V) denotes the set of all linear transformations on V and Mat,, (k) denotes 
the set of all n x n matrices with entries in k. Moreover, Proposition 3.98 says that | 7\S - ] = 
[T][S] for all T, S e Hom k (V, V). 

If T e GL(V), then | T] is a nonsingular matrix, by Corollary 3.100; that is, if p is the 
restriction of p ', then p ; GL( V) -> GL(n. k ) is an injective homomorphism. 
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It remains to prove that p is surjective. If A e GL {n,k). then A = |T] for some 
T: V -» V. It suffices to show that T is an isomorphism; that is, T e GL(V). Since 
[T] is a nonsingular matrix, there is a matrix B with [T]B — I. Now B = [5] for some 
S : V -> V, and 

[TSi = [T][Si = I = [ l v l 

Therefore, TS — 1 \/, since // is a bijection, and so T e GL(V), by Corollary 3.107. • 

The center of the general linear group is easily identified; we now generalize Exer¬ 
cise 2.56 on page 81. 

Definition. A linear transformation T: V —> V is a scalar transformation if there is 
c e k with T (v) = cv for all v e V ; that is, T = c 1 v • A scalar matrix is a matrix of the 
form cl, where c e k and I is the identity matrix. 

A scalar transformation T — c I y i s nonsingular if and only if c f 0 (its inverse is 

C - 1 ly). 

Corollary 3.109. 

(i) The center of the group GL(V) consists of all the nonsingular scalar transforma¬ 
tions. 

(ii) The center of the group GL (n , k) consists of all the nonsingular scalar matrices. 

Proof, (i) If T e GL(V) is not scalar, then Example 3.96(ii) shows that there exists v e V 
with v, T(v) linearly independent. By Proposition 3.87, there is a basis v, T(v), u 3 ,..., u„ 

of V. It is easy to see that v,v + T ( v ), u 3 . u„ is also a basis of V, and so there is a 

nonsingular linear transformation S with S(v) = v, S(T (i>)) = v + T (v), and S(ui) = m; 
for all i. Now S and T do not commute, for ST(v ) = v + T(v) while TS(v ) = i (v). 
Therefore, T is not in the center of GL(V). 

(ii) If /: G —> H is any group isomorphism between groups G and H , then /(Z(G)) = 
Z(H). In particular, if 7’ = c I v- is a nonsingular scalar transformation, then | T] is in the 
center of GL(n, k). But it is easily checked that [T] = cl is a scalar matrix. • 


Exercises 

3.75 Let V and W be vector spaces over a field k, and let S, T: V —> W be linear transformations. 

(i) If V and W are finite-dimensional, prove that 

dim(Hom^(T, IV)) = dim(V) dim(Vk). 

(ii) The dual space V* of a vector space V over k is defined by 

P* =Hom k (V,k). 

If dim(V) = n. prove that dim(P*) = n. and hence that V* = V. 
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(iii) If X — tq,..., v n is a basis of V, define 8\, ..., 8 n e V* by 


Si(vj) 


|0 if j i 
|l if j = i. 


Prove that , .... 8 n is a basis of V* (it is called the dual basis arising from tq, ..., v n ). 


3.76 If A = 


a 

c 


b 

d 


, define det(A) = ad — be. If V is a vector space with basis X = tq, iq, define 


T: V V by r(iq) = atq + bvi and T(vL) = ciq + dv 2 - Prove that T is a nonsingular 
linear transformation if and only if det(x[T]x) 5^ 0- 

Hint. You may assume the following (easily proved) fact of linear algebra: Given a system 
of linear equations with coefficients in a field, 


ax + by = p 
cx + dy = q , 


then there exists a unique solution if and only if ad — be ^ 0. 

3.77 Let \J be a subspace of a vector space V. 

(i) Prove that the natural map n : V —> V/U, given by u i —v + U, is a linear transfor¬ 
mation with kernel U. (Quotient spaces were defined in Exercise 3.71 on page 170.) 

(ii) State and prove the first isomorphism theorem for vector spaces. 

Hint. Here is the statement. If /: V —>■ IV is a linear transformation with ker / = U, 
then U is a subspace of V and there is an isomorphism <p: V/U = im f , namely, 
<p(v + U) = f(v). 

3.78 Let V be a finite-dimensional vector space over a field k. and let B denote the family of all the 
bases of V. Prove that B is a transitive GL(L)-set. 

Hint. Use Theorem 3.92. 

3.79 (i) If U and W are subspaces of a vector space V such that L/ D VU = {0} and U + W = V, 

prove that V = U © W (see the definition of direct sunt on page 171). 

(ii) A subspace U of a vector space V is a direct summand if there is a subspace W of V 
with U IT W = {0) and U + W = V. If V is a finite-dimensional vector space over a 
field k, prove that every subspace U is a direct summand. 

Hint. Take a basis X of U, extend it to a basis X' of V, and define W — [X' — X). 

3.80 If T : V —>■ W is a linear transformation between vector spaces over a field k, define 

rank(T) = dimfimT). 


(i) Regard the columns of an m x n matrix A as m -tuples, and define the column space of A 
to be the subspace of k m spanned by the columns; define rank(A) to be the dimension of 
the column space. If T: k n —> k m is the linear transformation defined by T(X) — AX, 
where X is an n x 1 vector, prove that 

rank(A) = rank(r). 

(ii) If A is an m x n matrix and B is an p x m matrix, prove that 

rank(BA) < rank(A). 

(iii) Prove that similar n x n matrices have the same rank. 
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3.8 Quotient Rings and Finite Fields 

Let us return to commutative rings. The fundamental theorem of algebra (Theorem 4.49) 
states that every nonconstant polynomial in C[x] is a product of linear polynomials in 
C[x], that is, C contains all the roots of every polynomial in C[x], We are going to prove 
a “local” analog of the fundamental theorem of algebra for polynomials over an arbitrary 
field k: Given a polynomial fix) e k[x], then there is some field K containing k that 
also contains all the roots of f(x) (we call this a local analog for even though the larger 
field K contains all the roots of the polynomial /(x), it may not contain roots of other 
polynomials in k[x]). The main idea behind the construction of K involves quotient rings, 
a construction akin to quotient groups. 

Let I be an ideal in a commutative ring R. If we forget the multiplication, then I is a 
subgroup of the additive group R \ since R is an abelian group, the subgroup I is necessarily 
normal, and so the quotient group R/I is defined, as is the natural map it: R R/I 
given by jz(a) — a + I. Recall Lemma 2.40(i), which we now write in additive notation: 
a + I — b + I in R/1 if and only if a — b e I. 

Theorem 3.110. If I is an ideal in a commutative ring R, then the additive abelian group 
R/I can be made into a commutative ring in such a way that the natural map it : R —> R/I 
is a surjective ring homomorphism. 

Sketch of Proof. Define multiplication on the additive abelian group R/I by 

(a + I)(b + I) = ab + I. 

To see that this is a well-defined function R/I x R/I —> R/I, assume that a + I — a' + I 
and b + I — b' +1, that is, a—a' e I and b — b' e I. We must show that (a' + I)(b' + /) = 
a'b’ + I — ab + I , that is, ab — a'b' e I. But 

ab — a'b 1 = ab — a'b + a'b — a'b' 

= (a — a')b + a'(b — b') e /, 

as desired. 

To verify that R/I is a commutative ring, it now suffices to show associativity and com¬ 
mutativity of multiplication, distributivity, and that one is 1 + I. Proofs of these properties 
are routine, for they are inherited from the corresponding property in R. For example, 
multiplication in R/I is commutative because 

(a + I)(b + I) — ab + I — ba + I — (b + I)(a + I). 

Rewriting the equation (a + I){b + I) = ab + I using the definition of n, namely, 
a + I — 7T(a), gives Tt(a)n(b) = it(ab). Since 7T(1) = 1 + I, it follows that n is a 
ring homomorphism. Finally, n is surjective because a + I = n(a). • 

Definition. The commutative ring R/I constructed in Theorem 3.110 is called the 
quotient ring 14 of R modulo I (briefly, R mod I). 

'^Presumably, quotient rings are so called in analogy with quotient groups. 
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We saw in Example 2.68 that the additive abelian group Z /(in) is identical to I,„. They 
have the same elements: the coset a + (m) and the congruence class [a] are the same subset 
of Z; they have the same addition: 

a + (m) + b + (m) = a + b + (m) = [a + b] = [a] + [b]. 

We can now see that the quotient ring Z/(m) coincides with the commutative ring I m , for 
the two multiplications coincide as well: 

(a + ( m))(b + (, m )) = ab + ( m ) = [ ab] — [a] [£>]. 

We can now prove a converse to Proposition 3.50. 

Corollary 3.111. If I is an ideal in a commutative ring R, then there are a commutative 
ring A and a ring homomorphism jt : R —>■ A with I = ker7r. 

Proof. If we forget the multiplication, then the natural map tc : R —► R/I is a homomor¬ 
phism between additive groups and, by Corollary 2.69, 

I — kerjr — {r e R : Jt(a) = 0 + 1 — 1}. 

Now remember the multiplication: (a + I)(b + I) = ab + /; that is, jr(a)jr(b) = jr(ab). 
Therefore, jt is a ring homomorphism, and ker jt is equal to I whether the function jt is 
regarded as a ring homomorphism or as a homomorphism of additive groups. • 

Theorem 3.112 (First Isomorphism Theorem). Iff - R -> A is a homomorphism of 
rings, then ker / is an ideal in R, im / is a subring of A, and 

R / ker / = im /. 

Proof. Let I = ker/. We have already seen, in Proposition 3.50, that I is an ideal in R 
and that im / is a subring of A. 

If we forget the multiplication in the rings, then the proof of Theorem 2.70 shows that 
the function <p: R/I —> A , given by ip(r + I) = f(r), is an isomorphism of additive 
groups. Since q>(\+I) = /(1) = 1, it now suffices to prove that (p preserves multiplication. 
But (p(fr + I)(s + /)) = <p(rs + I) — firs) — fir) f (s) = <p(r + I)<p(s + I). Therefore, 
<p is a ring isomorphism. • 

For rings as for groups, the first isomorphism theorem creates an isomorphism from a 
homomorphism once we know its kernel and image. It also says that there is no signif¬ 
icant difference between a quotient ring and the image of a homomorphism. There are 
analogs for commutative rings of the second and third isomorphism theorems for groups 
(see Exercise 3.82 on page 196 for the third isomorphism theorem; the second isomor¬ 
phism theorem is better stated in the context of modules; see Theorem 7.9), but they are 
less useful for rings than are their group analogs. However, there is a useful analog of the 
correspondence theorem, which we will prove later (see Proposition 6.1). 
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Definition. If k is a field, the intersection of all the subfields of k is called the prime field 
of k. 

Every subfield of C contains Q, and so the prime field of C and of R is Q. The prime 
field of a finite field is just the integers mod p, as we show next. 

Notation. From now on, we will denote I p by ¥ p when we are regarding it as a field. 

Borrowing terminology from group theory, call the intersection of all the subfields of 
a field containing a subset X the subfield generated by X\ it is the smallest subfield con¬ 
taining X in the sense that if F is any subfield containing X, then F contains the subfield 
generated by X. The prime field is the subfield generated by 1, and the prime field ofFpCr) 
is F p . 

Proposition 3.113. Ifk is afield, then its prime field is isomorphic to Q or to F p for some 
prime p. 

Proof. Consider the ring homomorphism x : Z —*■ k, defined by y in) = ns, where we 
denote the one in k by e. Since every ideal in Z is principal, there is an integer m with 
ker / = (m). If m — 0, then x is an injection, and so there is an isomorphic copy of Z that 
is a subring of k. By Exercise 3.47(ii) on page 150, there is a field Q = Frac(Z) = Q with 
i tn x ^ Q ^ k. Now Q is the prime field of k , for every sub field of k contains 1, hence 
contains imx. and hence it contains Q, for Q = Q has no proper subfields. If m f 0, 
the first isomorphism theorem gives = Z/(m) = imx ^ k. Since k is a field, imx 
is a domain, and so Proposition 3.6 gives m prime. If we now write p instead of m, then 
im x = {0, e, 2e, ..., (p — l)e} is a subfield of k isomorphic to Fp. Clearly, im x is the 
prime field of k , for every subfield contains e, hence contains im x- • 

This last result is the first step in classifying different types of fields. 

Definition. A field k has characteristic 0 if its prime field is isomorphic to Q; a field k 
has characteristic p if its prime field is isomorphic to F ; , for some prime p. 

The fields Q, R, C have characteristic 0, as does any subfield of them; every finite field 
has characteristic p for some prime p, as does F p (x), the ring of all rational functions 
over F p . 

Proposition 3.114. Ifk is a field of characteristic p > 0, then pa — 0 for all a € k. 

Proof Since k has characteristic p, we have p ■ 1 = 0, where 1 is the one in k. The result 
now follows from Proposition 3.2(v). • 

Proposition 3.115. Ifk is a finite field, then \k\ — p n for some prime p and some n > 1. 

Proof The prime field P of k cannot be the infinite field Q, and so P = ¥ p for some 
prime p. Now k is a vector space over P, and so it is a vector space over Fp. Clearly, k is 
finite-dimensional, and if dimir ;) (A:) = n, then \k\ — p n . • 
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Remark. Here is a proof of the last proposition using group theory. Assume that k is a 
finite field whose order |&| is divisible by distinct primes p and q. By Proposition 2.78, 
Cauchy’s theorem for abelian groups, there are elements a and b in k having orders p and 
q , respectively. If s denotes one in k , then the elements ps (the sum of s with itself p 
times) and qs satisfy (ps)a = 0 and ( qs)b — 0. Since A; is a field, it is a domain, and so 


pe — 0 — qs. 


But (p,q) — 1, so there are integers ,v and t with sp + tq — 1. Hence, e — s(pe) + t(qs) — 
0, and this is a contradiction. Therefore, \k\ has only one prime divisor, say, p, and so A 
is a power of p. ■* 


Proposition 3.116. Ifk is afield and I = (p(x)), where p(x) is a nonzero polynomial in 
k[x\ then the following are equivalent: p(x) is irreducible; k[x]/I is a field; k[x]/I is a 
domain. 

Proof. Assume that p(x) is irreducible. Note that I — ( p(x)) is a proper ideal, so that the 
one in k[x]/I, namely, 1 + /, is not zero. If f(x ) + I e k[x]/I is nonzero, then f(x ) f /, 
that is, fix) is not a multiple of p(x) or, to say it another way, p \ f. By Lemma 3.36, p 
and / are relatively prime, and so there are polynomials ,v and t with sf + tp — 1. Thus, 
sf —Is/, and so 1 + I — sf + I — (s + I)(f + !)■ Therefore, every nonzero element 
of k[x]/I has an inverse, and so k[x]/I is a field. 

Of course, every field is a domain. 

If k[x]/I is a domain. If p(x) is not an irreducible polynomial in k[x\, there is a 
factorization p(x) = g(x)h(x ) in A[x] with deg(g) < deg(p) and deg(7z) < deg(p). It 
follows that neither g(x) + I nor h(x) + I is zero in k[x]/I. After all, the zero in k[x\/I 
is 0 + I — /, and g(x ) + / = / if and only if g(x) el — (p(x))\ but if this were so, then 
p(x) | g(x), giving the contradiction deg(p) < deg(g). The product 

(g(x) + I)(h(x) + /) = p(x) + 1 = 1 

is zero in the quotient ring, and this contradicts k[x]/I being a domain. Therefore, p(x) 
must be an irreducible polynomial. • 

The structure of R/I can be rather complicated, but for special choices of R and /, 
the commutative ring R/I can be easily described. For example, when p(x) is an irre¬ 
ducible polynomial, the following proposition gives a complete description of the field 
k[x]/(p(x)). 

Proposition 3.117. Let k be a field, let p(x) e k[x] be a monic irreducible polynomial 
of degree d, let K — k[x]/1, where I — (p(x)), and let f = x + I e K. 

(i) K is a field and k' — [a + I : a e k] is a subfield of K isomorphic to k. Therefore, 
ifk' is identified with k, then k is a subfield of K. 

(ii) f is a root of p(x) in K. 



186 


Commutative Rings I Ch. 3 


(iii) If g(x) e k[x] and j3 is a root of g(x), then p(x) \ g(x) in k[x~\. 

(iv) p(x) is the unique monic irreducible polynomial in £[x] having f as a root. 

(v) The list 1, f, f 2 ,..., f d ~ l is a basis of K as a vector space over k, and so 
dim^l/T) = d. 

Proof (i) The quotient ring K = k[x]/I is a field, by Proposition 3.116, because p(x) is 
irreducible. It is easy to see, using Corollary 3.53, that the restriction of the natural map, 
<p — n \k : k —*■ K , defined by <p(a) = a + /, is an isomorphism from k —*■ k!. 

(ii) Let p(x) — ao + a\x + • • • + ad-ix d ~ 1 + x d , where a,- e k for all i. In K = k[x]/I, 

we have 

pif) — (ao + /) + («t + I)f + • • • + (1 + I)f d 

— (flo + I) + (ai + I)(x + /) + • • • + (1 + /)(x + I) d 

— {ao + /) + (a\x + /) + ••• + (\x d + I) 

— ao -{- a\x -H • • • T x d T I 
= p(x) + 1 = 1, 

because p(x) el — (p(x)). But I = 0 + I is the zero element of K = k[x]/I , and so ft 
is a root of p{x). 

(iii) If p(x) j g(x) in k[x~\, then their gcd is 1, because p(x) is irreducible. Therefore, there 
are s(x), t(x) e k[x ] with 1 = s(x)p(x) + t(x)g(x). Since k[x] c K\x ], we may regard 
this as an equation in K [x]. Evaluating at ft gives the contradiction 1=0. 

(iv) Let h{x) e k[ jr] be a monic irreducible polynomial having f as a root. By part (iii), 
we have p(x) \ h(x). Since h(x) is irreducible, we have h{x) — cpix ) for some constant 
c; since h(x ) and p(x) are monic, we have c = 1 and h(x) = p(x). 

(v) Every element of K has the form f(x) + /, where f(x) e k[x]. By the division 
algorithm, there are polynomials q(x),r(x ) e fc[x] with f(x) = q(x)p(x) + r{x) and 
either r(x) = 0 or deg(r) < d — deg(p). Since / — r = qp e /, it follows that 
f(x) + I — r{x) + I. If r(x) = bo + b\x + ■ ■ ■ + bd~ix d ~ l , where be k for all i, then 
we see, as in the proof of part (ii), that r(x) + I — bo + b[fi H— • + bj-if d ~ l - Therefore, 

...,f d ~ l spans K. 

To prove uniqueness, suppose that 

b 0 + b { f + • • • + bd-i/3"- 1 =c 0 + cif + --- + c d -iP d ~ l . 

Define g{x) e k[x] by g(x) = X^f=o^' — Q)x'; if g(x) — 0, we are done. If g(x) ^ 0, 
then deg(g) is defined, and deg(g) < d = dcg( p). On the other hand, f> is a root of g(x), 
and so part (iii) gives p(x) \ g(x); hence, deg(p) < deg(g), and this is a contradiction. 
It follows that 1, f, ft 2 , ■ ■ ■, /3 d is a basis of K as a vector space over k , and this gives 
dim^-(/T) = d. • 
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Definition. If A' is a field containing k as a subfield, then K is called a (field) extension 
of k , and we write “ K/k is a field extension.” 15 

An extension field K of a field k is a finite extension of k if K is a finite-dimensional 
vector space over k. The dimension of K , denoted by | K : k], is called the degree of K / k. 

Proposition 3.117(v) shows why [K : k] is called the degree of the extension K/k. 

Example 3.118. 

The polynomial x 2 +1 e R[x] is irreducible, and so K — R[x]/(x 2 +1) is a field extension 
K /E of degree 2. If /I is a root of x 2 + 1, then ( J f = —1; moreover, every element of K 
has a unique expression of the form a + bfi, where a. h e E. Clearly, this is another 
construction of C (which we have been viewing as the points in the plane equipped with a 
certain addition and multiplication). 

Here is a natural way to construct an isomorphism K -> C. Consider the evaluation 
map (p: M[x] -* C given by <p\ fix) i->- f(i). First, <p is surjective, for a + ib = 
( p(a + bx) e ixntp. Second, keri p — {fix) e R[x] : /(i) = 0}, the set of all polynomials 
in R[x] having i as a root. We know that x 2 + 1 € ker cp, so that (x 2 + 1) c ker q>. For the 
reverse inclusion, take g{x) e ker <p. Now i is a root of gix), and so gcd ig, x 2 + 1) ^ 1 
in C[x]; therefore, gcd (g,x 2 + 1) ^ 1 in R[x]. Irreducibility of x 2 + 1 in R[x] gives 
x 2 + 1 | g(x). and so g(x) e (x 2 + 1), Therefore, ker <p — (x 2 + 1). The first isomorphism 
theorem now gives E[x]/(x 2 + 1) = C. ◄ 

The easiest way to multiply in C is to first treat i as a variable and then to impose the 
condition ; 2 = — 1. To compute {a + bi)ic + di), first write ac + {ad + bc)i + bdi 2 , and 
then observe that i 2 — —1. More generally, if f is a root of an irreducible p(x) e k[x ], 
then the proper way to multiply 


{bo + b\fi + • • • + b n -ifi n *)(co + cif + ■ ■ ■ + c„-if3 n \) 

in the quotient ring k[x)/{p{x)) is to regard the factors as polynomials in f, multiply them, 
and then impose the condition that pifi) — 0. 

A first step in classifying fields involves their characteristic; that is, describing prime 
fields. A next step considers whether the elements are algebraic over the prime field. 

Definition. Let K/ k be a field extension. An element a e K is algebraic over k if there is 
some nonzero polynomial /(x) e k [x | having a as a root; otherwise, a is transcendental 
over k. An extension K/k is algebraic if every a e K is algebraic over k. 

When a real number is called transcendental, it usually means that it is transcendental 
over Q. 

Proposition 3.119. IfK/k is a finite field extension, then K/k is an algebraic extension. 

15 This notation should not be confused with the notation for a quotient ring, for a field K has no interesting 
ideals; in particular, if k C K, then k is not an ideal in K. 
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Proof. By definition, K/k finite means that [K : k] — n < oo; that is, K has dimension 
n as a vector space over k. By Corollary 3.88, the list of n + 1 vectors 1, a, a 2 ,..., a n 
is dependent. Thus, there are cq, ci, ..., c n e k, not all 0, with ff q«' = 0. Thus, the 
polynomial f(x) = CjX 1 is not the zero polynomial, and a is a root of fix). Therefore, 
a is algebraic over k. • 

The converse of this last proposition is not true. We shall see, in Example 6.55, that the 
set A of all complex numbers algebraic over Q is an algebraic extension of Q that is not a 
finite extension. 

Definition. If K/k is an extension and a e K, then k(a) is the intersection of all those 
subfields of K that contain k and a; we call k(a) the subfield of K obtained by adjoining 
a to k. 

More generally, if A is a (possibly infinite) subset of K , define k(A) to be the intersec¬ 
tion of all the subfields of K that contain k U A; we call k(A) the subfield of K obtained by 
adjoining A to k. In particular, if A = [zi,..., z n } is a finite subset, then we may denote 
k(A) by k(z \,..., Zn)- 

It is clear that k(A) is the smallest subfield of K containing k and A; that is, if B is any 
subfield of K containing k and A, then k(A) C B. 

We now show that the field k[x]/(p(x)), where p(x) e k[x~\ is irreducible, is intimately 
related to adjunction. 

Theorem 3.120. 

(i) If KIk is an extension and a e K is algebraic over k, then there is a unique monic 
irreducible polynomial p{x) € k\x ] having a as a root. Moreover, if I — (p(x)), 
then k[x]/I = k(a)\ indeed, there exists an isomorphism 

(p : k[x]/I —> k(a) 

with (p{x + /) = a and (fi(c + /) = c for all c € k. 

(ii) If a’ e K is another root of p(x), then there is an isomorphism 

6 : k(a) -> k(aj 

with 9(a) = a 1 and 6(c) — cfor all c e k. 

Proof, (i) Consider evaluation, the ring homomorphism cp: k[x\ K defined by 

<P ■ fix) f(ot). 

Now mup is the subring of K consisting of all polynomials in a; that is, all elements of 
the form f(a) with f(x) e k[x\. Now ker<p is the ideal in k[xj consisting of all those 
fix) e k[xj having a as a root. Since every ideal in k[x~\ is a principal ideal, we have 
ker <p — (p(xj) for some monic polynomial p(x) e k[x~\. But k[x\/(p(x)) = im q>, which 
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is a domain, and so p(x) is irreducible, by Proposition 3.116. This same proposition says 
that k[x]/(p{x)) is a field, and so the first isomorphism theorem gives k[x]/(p(x)) = im <p; 
that is, imq> is a subfield of K containing k and a. Since every subfield of K that contains 
k and a must contain im <p, we have im (p = k(a). We have proved everything in the 
statement except the uniqueness of p(x); but this now follows from Proposition 3.117(iv). 

(ii) As in part (i), there are isomorphisms (p: k[x]/I -> k(a) and i//: k[x\/I —>• k(a') with 
<p(c + I) = c and flc) — c + I for all c e k; moreover, (p: x + I i-> a and i//: x +1 (->• a'. 
The composite 9 = \fi(p~ 1 is the desired isomorphism. • 


Definition. If K / k is a field extension and a e K is algebraic over k. then the unique 
monic irreducible polynomial p (x ) e k [x | having a as a root is called the minimal poly¬ 
nomial of a over k, and it is denoted by 

irr(a, k) = p(x). 

The minimal polynomial irr(a, k) does depend on k. For example, irr(;\ R) = x 2 + 1, 
while irr(i, C) = x — i. 

The following formula is quite useful, especially when proving a theorem by induction 
on degrees. 

Theorem 3.121. Let k C E C K be fields, with E a finite extension of k and K a finite 
extension of E. Then K is a finite extension ofk, and 

[K : k] — [K : E][E : k]. 

Proof. If A = a\,... ,a n is a basis of E over k and if B — b\, ... , b m is a basis of K 
over E, then it suffices to prove that a list X of all a t b j is a basis of K over k. 

To see that X spans K, take u e K. Since B is a basis of K over E, there are scalars 
A .j e E with u = k jbj- Since A is a basis of E over k, there are scalars p.ji e k with 
Xj = pLjiaj. Therefore, u — iijjUibj, and X spans K over k. 

To prove that X is linearly independent over k, assume that there are scalars // /; e k 
with i pijjaibj — 0. If we define Xj = JT pijiai, then Xj e E and ^ f Xpbj = 0. 
Since B is linearly independent over E, it follows that 

0 — Xj = ^ ^ ItjiCli 
i 

for all j. Since A is linearly independent over k, it follows that /r /( = 0 for all j and i, as 
desired. • 

There are several classical problems in euclidean geometry: trisecting an angle; dupli¬ 
cating the cube (given a cube with side length 1, construct a cube whose volume is 2); 
squaring the circle (given a circle of radius 1, construct a square whose area is equal to the 
area of the circle). In short, the problems ask whether geometric constructions can be made 
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using only a straightedge (ruler) and compass according to certain rules. Theorem 3.121 
has a beautiful application in proving the unsolvability of these classical problems. For a 
discussion of these results, the reader may see my book, A First Course in Abstract Alge¬ 
bra , pages 332-344. 

Example 3.122. 

Let fix) — x 4 — I0x 2 + 1 e Q[x]. If (3 is a root of fix), then the quadratic formula gives 
fr — 5 ± 2V6. But the identity a + 2 sfab + b = a + s/b)" gives /3 = ±(V2 + s/3). 
Similarly, 5 — 2^6 = (s/2 — V3)~, so that the roots of fix) are 

V2 + V3, -V2-V3, V2-V3, -V2 + V3. 

By Theorem 3.43, the only possible rational roots of fix) are ±1, and so we have just 
proved that these roots are irrational. 

We claim that fix) is irreducible in Q[x]. If gix) is a quadratic factor of fix) in Q[x], 
then 

gix) — (x — a\/ 2 — bs/3)(x — csfl — ds! 3), 
where a, b,c,d e {1, —1}. Multiplying, 

gix) — x 2 — + c)sfl + ib + d)s/ 3^x + lac + 3 bd + iad + bc)s/ 6. 

We check easily that ia + c)sfl + ib + d)s/ 3 is rational if and only if a + c = 0 = b + d\ 
but these equations force ad + be 0, and so the constant term of gix) is not rational. 
Therefore, gix) f Q[x], and so fix) is irreducible in Q[jc]. If ft — -Jl + then 
fix) = irr if, Q). 

Consider the field E — Q(/S) = Q(V2 + V3). There is a tower of fields Q C E C F, 
where F — Q(\/2, V3), and so 

IF : Q] = [F : E][E : Q], 

by Theorem 3.121. Since E = Q(ft) and f is a root of an irreducible polynomial of degree 
4, namely, fix), we have [E : Q] = 4. On the other hand, 

[F : Q] = [F : Q(V2)][Q(V2) : Q]. 

Now [Q(\/2) : Q] = 2, because s/2 is a root of the irreducible quadratic x 2 — 2 in Q[x], 
We claim that [F : Q(V2)] < 2. The field F arises by adjoining s/3 to Q(\/2); either 
s/3 e Q(\/2), in which case the degree is 1, or x 2 — 3 is irreducible in Q(\/2)[x], in 
which case the degree is 2 (in fact, the degree is 2). It follows that [F : Q] < 4, and so the 
equation [F : Q] = [F : E][E : Q] gives [F : E] = 1; that is, F = E. 

Let us note that F arises from Q by adjoining all the roots of fix), and it also arises 
from Q by adjoining all the roots of gix) — (x 2 — 2)(x 2 — 3). ◄ 

We now prove two important results: The first, due to L. Kronecker, says that if fix) e 
k[x), where k is any field, then there is some larger field E that contains k and all the roots 
of fix); the second, due to E. Galois, constructs finite fields other than ¥ p . 
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Theorem 3.123 ( Kronecker ). Ifk is afield and f(x) e k[x], then there exists afield K 

containing k as a subfield and with fix) a product of linear polynomials in K\x\ 

Proof. The proof is by induction on deg(/). If deg if) = 1, then f(x ) is linear and we 
can choose K — k. If deg(/) > 1, write fix) = p(x)g(x), where p(x) is irreducible. 
Now Proposition 3.117(i) provides a field F containing k and a root z of p(x). Hence, in 
F[x], we have p{x) = (x — z)h(x) and /(x) = (x — z)h(x)g(x). By induction, there 
is a field K containing F (and hence k) so that h(x)g(x), and hence fix), is a product of 
linear factors in K[x\. • 

For the familiar fields Q, R, and C, Kronecker’s theorem offers nothing new. The 
fundamental theorem of algebra, first proved by Gauss in 1799 (completing earlier attempts 
of Euler and of Lagrange), says that every nonconstant fix) e C[x] has a root in C; it 
follows, by induction on the degree of /(x), that all the roots of fix) lie in C; that is, 
f(x) — a(x — r i)... (x — r n ), where a e C and rj e C for all j. On the other hand, 
if k — F p or k = C(x) = Frac(C[x]), then the fundamental theorem does not apply; 
but Kronecker’s theorem does apply to tell us, for any given fix), that there is always 
some larger field E that contains all the roots of fix). For example, there is some field 
containing C(x) and ~J1k . There is a general version of the fundamental theorem that we 
give in Chapter 6 : Every field k is a subfield of an algebraically closed field K , that is, K 
is a field containing k such that every fix) e /f[x] is a product of linear polynomials in 
K [x]. In contrast, Kronecker’s theorem gives roots of just one polynomial at a time. 

The definition of k(A), the field obtained by adjoining a set A to k, assumes that A is 
a subset of a field extension K of k. In light of Kronecker’s theorem, we may now speak 
of a field extension kiz i,..., z n ) obtained by adjoining all the roots of some fix) e k[x] 
without having to wonder whether such an extension K/k exists. 

Definition. Let k he a subfield of a field K , and let fix) e k[x]. We say that fix) splits 
over K if 

fix) = aix - Zi) • • • (x - Zn), 

where zi,...,z n are in K and a e k is nonzero. 

If fix) e k[x] is a polynomial, then a field extension E/k is called a splitting field of 
fix) over k if fix) splits over E, but fix) does not split over any proper subfield of E. 

For example, consider fix) = x 2 + 1 e Q[x], The roots of fix) are ±i, and so fix) 
splits over C; that is, fix) = (x — i) (x + i) is a product of linear polynomials in C[x], 
However, C is not a splitting field over Q, for C is not the smallest field containing Q and 
all the roots of fix). The splitting field of fix) e k[x] depends on k as well as on fix): 
Here, the splitting field over Q is Q(i); the splitting field over R is R(i) = C. 

In Example 3.122, we proved that E — Q(V2 + V3) is a splitting field of fix) — 
x 4 — 10x 2 + 1, as well as a splitting field of g(x) = (x 2 — 2)(x 2 — 3). 

The existence of splitting fields is an easy consequence of Kronecker’s theorem. 

Corollary 3.124. Let k be a field, and let fix) € k[x]. Then a splitting field of fix) over 
k exists. 
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Proof. By Kronecker’s theorem, there is a field extension K/k such that fix) splits in 
K[x\, say, fix) = a(x — u\) ■ ■ ■ (x — a„). The subfield E — k(a i,..., a„) of K is a 
splitting field of fix) over k. • 

Thus, a splitting field of fix) e k [x ] is the smallest subfield E of K containing k and 
all the roots of fix). The reason we say “a” splitting field instead of "the” splitting field is 
that the definition involves not only fix) and k, but the larger field K as well. Analysis of 
this technical point will enable us to prove Corollary 3.132: Any two finite fields with the 
same number of elements are isomorphic. 

Example 3.125. 

Let k be a field and let £ = k(y\,... ,y n ) be the rational function field in n variables 
y\,... ,y n over k; that is, E = Frac(k[yi,..., y„]), the fraction field of the ring of poly¬ 
nomials in n variables. The general polynomial of degree n over k is defined to be 

fix) = Y\ (x ~ Vi) e Frac <M;yi, • • • - y n ])W- 

i 

The coefficients of fix) — (x — yi)(x — yf) ■ ■ ■ (x — y n ), which we denote by a,-, can be 
given explicitly [see Eqs. (1) on page 198] in terms of the y’s. Notice that £ is a splitting 
field of fix) over the field K = kiao, ..., a n - 1 ), for it arises from K by adjoining to it all 
the roots of fix), namely, all the y’s. < 

Here is another application of Kronecker’s theorem. 

Proposition 3.126. Let p be a prime, and let k be a field. If fix) — x p — c € k[x] and 
a is a pth root of c iin some splitting field), then either fix) is irreducible in k[x) or c has 
a pth root in k. In either case, ifk contains the pth roots of unity, then k(a) is a splitting 
field of fix). 

Proof. By Kronecker’s theorem, there exists a field extension K/k that contains all the 
roots of fix): that is, K contains all the pth roots of c. If a p = c , then every such root has 
the form o>a, where a> is a pth root of unity; that is, <x> is a root of x p — 1. 

If fix) is not irreducible in k[x\, then there is a factorization fix) — gix)hfx) in k[x] 
with gix) a nonconstant polynomial with d — deg(g) < deg(/) = p. Now the constant 
term b of gix) is, up to sign, the product of some of the roots of fix): 

±b — a d a>, 

where <x>, which is a product of d pth roots of unity, is itself a pth root of unity. It follows 
that 

i±b) p = (a d a>) p = a dp = c d . 

But p being prime and d < p forces id. p) — 1; hence, there are integers ,v and t with 
1 = sd +tp. Therefore, 


c = c sd+,p = c sd c tp = i±b) ps c tp = [i±b) s c'] p . 
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Therefore, c has a pth root in k. 

If a e K is a pth root of c, then f(x) = ]~[ (x — coa), where a> ranges over the pth roots 
of unity. Since we are now assuming that all a> lie in k, it follows that k(a) is a splitting 
field of fix). • 

It follows, for every prime p, that x p — 2 is irreducible in Q[x], 

We are now going to construct the finite fields. My guess is that Galois knew that C can 
be constructed by adjoining a root of a polynomial, namely, x 2 + 1, to R, and so it was 
natural for him to adjoin a root of a polynomial to F p . Note, however, that Kronecker’s 
theorem was not proved until a half century after Galois’s death. 

Theorem 3.127 (Galois). If p is a prime and n is a positive integer, then there is afield 
having exactly p n elements. 

Proof. Write q — p n , and consider the polynomial 

g(x) = x q — x e F p [x], 

By Kronecker’s theorem, there is a field K containing ¥ p such that g(x) is a product of 
linear factors in /G [jr]. Define 


E — {a e K : g(a) — 0}; 

thus, E is the set of all the roots of g(x). Since the derivative g'(x) = qx q ~ l — 1 = 
p n x q ~ 1 — 1 = —1 (see Exercise 3.23 on page 130), it follows that the gcd(g, g') is 1. By 
Exercise 3.37 on page 142, all the roots of g(x) are distinct; that is, E has exactly q — p n 
elements. 

We claim that £ is a subfield of K, and this will complete the proof. If a, h e E, then 
a q — a and b q — b. Therefore, (ab) q = a q b q = ab, and ab e E. By Exercise 3.45 on 
page 149(iii), (a — b) q — a q — b q — a — b, so that a — b e E. Finally, if a / 0, then the 
cancellation law applied to a q = a gives a q ~ l — 1 , and so the inverse of a is a q ~ 2 (which 
lies in E because E is closed under multiplication). • 

We will soon see that any two finite fields with the same number of elements are iso¬ 
morphic. 

Recall Theorem 3.30: The multiplicative group of a finite field k is a cyclic group; a 
generator a of this group is called a primitive element, that is, every nonzero element of k 
is a power of a. 

Notation. Denote a finite field having q — p n elements (where p is a prime) by 

V 

Corollary 3.128. For every prime p and every integer n > 1, there exists an irreducible 
polynomial g(x) e F p [x] of degree n. In fact, if a is a primitive element of F p n, then its 
minimal polynomial g(x) = irr(a, F^,) has degree n. 
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Remark. An easy modification of the proof replaces F ; , by any finite field. < 


Proof. Let E /F ; , be a field extension with p" elements, and let a e E be a primitive ele¬ 
ment. Clearly, F ; ,(a) = E, for it contains every power of a , hence every nonzero element 
of E. By Theorem 3.120(i), g(x) = irr(a, F p ) e F p [x] is an irreducible polynomial hav¬ 
ing a as a root. If deg(g) = d, then Proposition 3.117(v) gives [F p [x]/(,g(x)) : F p ] = d; 
but F ; ,[x]/ (g (x)) = Fp(a) = E, by Theorem 3.120(i), so that [£ : F p ] = n. Therefore, 
n = d, and so g(x) is an irreducible polynomial of degree n. • 


This corollary can also be proved by counting. If m = p e { 1 ■ ■ ■ p e " define the Mobius 
function by 


dim) 


1 

0 


if m — 1; 
if any e, > 1; 


(-1)' 1 if l=e 1= e 2 = •••=«„. 


If N„ is the number of irreducible polynomials in F p [x] of degree n. then 


N„ 


-J2p(d)p n/d . 

n z —' 


An elementary proof can be found in G. J. Simmons, “The Number of Irreducible Polyno¬ 
mials of Degree n over GF(p),” American Mathematical Monthly 77 (1970), pages 743- 


745. 


Example 3.129. 

(i) In Exercise 3.14 on page 125, we constructed a field k with four elements: 


k — 


a b 
b a + b 


: a, b eh 


On the other hand, we may construct a field of order 4 as the quotient F = F 2 [x]/(<y (x)), 
where q(x) e F 2 [x] is the irreducible polynomial x 2 +x+l. By Proposition 3.117(v), F is 
a field consisting of all a + bz, where z = x + (g(x)) is a root of g(x) and a, b e I 2 . Since 
z 2 +z+ 1 =0, we have z 2 — — z— 1 = z+1; moreover, z 3 = zz 2 — z(z + 1) = z 2j rz = 1. 


It is now easy to see that there is a ring isomorphism <p : k —> F with q> 


a 

b 


b 

a + b 


a + bz. 


(ii) According to the table in Example 3.35(ii) on page 137, there are three monic irre¬ 
ducible quadratics in F 3 [x], namely. 


p(x) — x 2 + 1 , q(x ) = x 2 + x — 1 , and r(x) = x 2 — x — 1 ; 


each gives rise to a field with 9 = 3 2 elements. Let us look at the first two in more detail. 
Proposition 3.117(v) says that E = F 3 [x]/(p(x)) is given by 

9 

E — {a + ba : where a" + 1 = 0}. 
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Similarly, if F = F 3 [x]/(< 7 (x)), then 

F — {a + bf : where f 2 + f — 1 = 0}. 

These two fields are isomorphic, for the map <p: E -* F (found by trial and error), defined 
by 

<p(a + ba ) = a + b( 1 — ft), 

is an isomorphism. 

Now F 3 [x]/(x 2 — x — 1) is also a field with nine elements, and it can shown that it is 
isomorphic to both of the two fields E and F just given (see Corollary 3.132). 

(iii) In Example 3.35(h) on page 137, we exhibited eight monic irreducible cubics p (x) e 
F 3 [x]; each of them gives rise to a field F 3 [x]/(;?(x)) having 27 = 3 3 elements. ◄ 

We are now going to solve the isomorphism problem for finite fields. 

Lemma 3.130. Let f(x ) e k[x], where k is a field, and let E be a splitting field of 
f(x) over k. Let (p: k —> k' be an isomorphism of fields, let q>* : k[x] —> k'[x] be the 
isomorphism 

g{x) — ao + a\x H-h a„x n i-> g*(x) — <p(ao) + <p{a\)x H-h < p(a n )x n , 

and let E' be a splitting field of f *(x) over k'. Then there is an isomorphism 
<1>: £ —> extending tp. 



Proof. The proof is by induction on d — [E : k]. If d — 1, then f(x) is a product 
of linear polynomials in k[x], and it follows easily that f*(x) is also a product of linear 
polynomials in k'[x\. Therefore, E' — k', and we may set <t> = tp. 

For the inductive step, choose a root z of / (x ) in E that is not in k, and let pix) = 
irr(7, k) be the minimal polynomial of z over k (Proposition 3.117). Now deg( p) > 1, 
because z f k; moreover, [k(z) : k] — deg (p), by Theorem 3.117. Let z! be a root of 
p*(x) in E', and let p*(x) — irr (z!, k') be the corresponding monic irreducible polynomial 
in k'\x]. 

By a straightforward generalization 16 of Proposition 3.120(h), there is an isomorphism 
<p : k(z ) —> k'(z') extending cp with tp: z z' ■ We may regard /(x) as a polynomial with 

1 ^Proving the generalization earlier would have involved introducing all the notation in the present hypothe¬ 
sis. and so it would have made a simple result appear complicated. The isomorphism i p: k ^ k' induces an 
isomorphism <p* : k\. v] —»• k’\x\. which takes pix) to some polynomial p*(x), and ip* induces an isomorphism 
k[x]/(p(x)) k'[x]/(p*(x)). 
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coefficients in k(z) (for k C k(z) implies k[x] C k(z)Lr]). We claim that £ is a splitting 
field of f(pc) over k(z)\ that is. 


E = k(z)(zi, ■ ■. ,z n ), 
where zi, ■ ■ ■, Z n are the roots of f(x)/(x — z,)\ after all, 

E = k(z, Zi, . . . ,Z n ) = k(z)(zi, ■ ■ ■ ,Z n )• 

Similarly, E' is a splitting field of f*(x) over k'(z'). But [£ : k(z)] < [E : k], by 
Theorem 3.121, so that the inductive hypothesis gives an isomorphism <J>: E -> E' that 
extends ip, and hence <p. • 


Theorem 3.131. Ifk is afield and f(x ) e A:[a], then any two splitting fields of fix) over 
k are isomorphic via an isomorphism that fixes k pointwise. 

Proof Let E and E' be splitting fields of / (x) over k. If <p is the identity, then the 
theorem applies at once. • 

It is remarkable that the next theorem was not proved until the 1890s, 60 years after 
Galois discovered finite fields. 

Corollary 3.132 {E. H. Moore). Any two finite fields having exactly p" elements are 
isomorphic. 

Proof. If £ is a field with q — p" elements, then Lagrange’s theorem applied to the 
multiplicative group E x shows that a q ~ [ — 1 for every a e E x . It follows that every 
element of £ is a root of f (x) — x q — x e F p [x], and so £ is a splitting field of f(x) 
over Fp. • 

E. H. Moore (1862-1932) began his mathematical career as an algebraist, but he did 
important work in many other parts of mathematics as well; for example, Moore-Smith 
convergence is named in part after him. 

Finite fields are often called Galois fields in honor of their discoverer. In light of Corol¬ 
lary 3.132, we may speak of the field with q elements, where q — p" is a power of a 
prime p. 

Exercises 

3.81 Prove that if / = {0}, then R/I = R. 

3.82 (Third Isomorphism Theorem for Rings) If £ is a commutative ring having ideals / C J. 
then J/I is an ideal in R/I and there is a ring isomorphism (R/I)/(J //) = R/J. 

3.83 For every commutative ring R, prove that R[x\/(x) = R. 

3.84 Prove that IF 3 [jc]/(jc 3 — x 2 + 1) = F 3 [.r]/(jr 3 — x 2 + x + 1). 
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3.85 

3.86 

3.87 


3.88 

3.89 

3.90 

3.91 

3.92 

3.93 

3.94 

3.95 


If X is a subset of a commutative ring R. define 1(X) to be the intersection of all those ideals 
/ in R that contain X. Prove that T(X) is the set of all a e R for which there exist finitely 
many elements x\, ... ,x n e X and elements r ,• € R with a = r\x \ + • • • + r n x n . 

Let h(x). p{x) e k[x~\ be monic polynomials, where k is a field. If p(x) is irreducible and if 
every root of h(x) (in an appropriate splitting field) is also a root of p{x), prove that h(x) — 
p(x) m for some integer m > 1 . 

Hint. Use induction on deg(/t). 

Chinese Remainder Theorem. 

(i) Prove that if k is a field and f(x), f'{x) e k[x] are relatively prime, then given 
b(x), b'(x) e k[x], there exists c{x) e k[x] with 

c — b e (/) and c — b' e (f')\ 

moreover, if d(x) is another common solution, then c — d e iff'). 

Hint. Adapt the proof of Theorem 1.28. This exercise is generalized to commutative 
rings in Exercise 6.1 l(iii) on page 325. 

(ii) Prove that if k is a field and /Ox), g(x) e k[x] are relatively prime, then 

k[x]/(f(x)g(x)) = k[.x]/(f(x)) x k[x]/(g(x)). 

Hint. See the proof of Theorem 2.81. 

(i) Prove that a field K cannot have subfields k' and k" with k' = Q and k" = ¥ p for some 
prime p. 

(ii) Prove that a field K cannot have subfields k' and k" with k' = F p and k" = F ? , where 
p q are primes. 

Prove that the stochastic group E(2, F 4 ) = A 4 . 

Hint. See Exercise 3.19 on page 125. 

Let f(x) = + six + • • • + s n _ ix n ~ l + x 11 e 0[x], where k is a field, and suppose that 

f(x) — (x — a\){x — 012 ) ■ ■ ■ (x — a n ). Prove that 4-„_i = — (ffi + ai + • • • + a n ) and that 
j ’0 = (—1)”q'i 0'2 ■■•««. Conclude that the sum and product of all the roots of f(x) lie in k. 

Write addition and multiplication tables for the field Fg with eight elements. 

Hint. Use an irreducible cubic over Ft. 

Let k C K C E be fields. Prove that if £ is a finite extension of k, then £ is a finite extension 
of K and £ is a finite extension of k. 

Hint. Use Corollary 3.90(h). 

Let k C £ C K be a tower of fields, and let z € K. Prove that if k(z)/k is finite, then 
[E(z) : £] < [k(z) : k]. In particular, [£(z) : £] is finite. 

Hint. Use Proposition 3.117 to obtain an irreducible polynomial p(x) e k[x\, the polynomial 
p(x) may factor in K\x\. 

(i) Is F 4 a subfield of Fg? 

(ii) For any prime p, prove that if F^n is a subfield of F p m , then n \ m (the converse is also 
true, as we shall see later). 

Hint. View F p m as a vector space over F p n. 

Let K/k be a field extension. If A C K and u e k(A). prove that there are a\ . a n e A 

with u e k{a j, .... a n ). 
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Fields 


4.1 Insolvability of the Quintic 

This chapter will discuss what is nowadays called Galois theory (it was originally called 
theory of equations ), the interrelation between field extensions and certain groups asso¬ 
ciated to them, called Galois groups. This theory will enable us to prove the theorem of 
Abel-Ruffini as well as Galois’s theorem describing precisely when the quadratic formula 
can be generalized to polynomials of higher degree. Another corollary of this theory is a 
proof of the fundamental theorem of algebra. 

By Kronecker’s theorem. Theorem 3.123, for each monic f(x) e k[x], where k is a 
field, there is a field K containing k and (not necessarily distinct) roots z\,... ,z n with 

f(x) — x n + a n -\x"~ l H-h a\x + ao = (x — z t) • • • (x - z n )• 

By induction on n > 1, we can easily generalize 1 Exercise 3.90 on page 197: 

a „-1 = -Y'Z, 

i 

a „~2 = Y. z > z j 
i<j 

\ ( 1 ) 

a n -3 = - / , ZiZjZk 

i<j<k 

, ao — (-l)”ziZ 2 • • -Zn- 

^The coefficients d[ may be viewed as polynomials in z\, ■ ■ ■ ,Zn\ as such, they are called the elementary 
symmetric polynomials , for they are unchanged if the z ’s are permuted. 
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Notice that —a„-i is the sum of the roots and that ±c/q is the product of the roots. Given the 
coefficients of fix), can we find its roots; that is, given the a's, can we solve the system 
(1) of n equations in n unknowns? If n = 2, the answer is yes: The quadratic formula 
works. If n = 3 or 4, the answer is still yes, for the cubic and quartic formulas work. But 
if n > 5, we shall see that no analogous solution exists. 

We did not say that no solution of system (1) exists if n >5; we said that no solution 
analogous to the solutions of the classical formulas exists. It is quite possible that there 
is some way of finding the roots of a quintic polynomial if we do not limit ourselves to 
field operations and extraction of roots only. Indeed, we can find the roots by Newton’s 
method ; if r is a real root of a polynomial fix) and if xq is a “good” approximation to r, 
then r — lim^oo*,,, where x n is defined recursively by x n + \ — x„ — f(x n )/f'ix n ) for 
all n > 0. There is a method of Hermite finding roots of quintics using elliptic modular 
functions, and there are methods for finding the roots of many polynomials of higher degree 
using hypergeometric functions. 

We are going to show, if n > 5, that there is no solution “by radicals” (we will define 
this notion more carefully later). The key observation is that symmetry is present. Recall 
from Chapter 2 that if Q is a polygon in the plane R 2 , then its symmetry group E (Q) 
consists of all those motions tp: R 2 —»• R 2 of the plane for which q>i£t) = Moreover, 
motions tp e E(£2) are completely determined by their values on the vertices of A; indeed, 
if f2 has n vertices, then E ( Q) is isomorphic to a subgroup of S„. 

We are going to set up an analogy with symmetry groups in which polynomials play the 
role of polygons, a splitting field of a polynomial plays the role of the plane M 2 , and an 
automorphism fixing k plays the role of a motion. 

Definition. Let £ be a field containing a subfield k. An automorphism 2 of E is an 
isomorphism o : E —> E\ we say that a fixes k if oia) = a for every a e k. 

For example, consider f(x) — x 2 + 1 e Q[x]. A splitting field of fix) over Q is 
E = Q(t'), and complex conjugation a : a i-> a is an example of an automorphism of E 
fixing Q. 

Proposition 4.1. Let k be a subfield of a field K, let 

fix) = x n + a„-ix n ~ l + ■ ■ ■ + a\x +ao e k[x ], 

and let E = k(z \,..., z n ) C K be a splitting field. If o : E —» E is an automorphism 
fixing k, then a permutes the set of roots {zi, ..., Zn] of fix). 

Proof. If r is a root of fix), then 


0 = fir) = r n + a„-\r n *H - hair+ao- 

2 The word automorphism is made up of two Greek roots: auto, meaning “self," and morph, meaning "shape” 
or “form.” Just as an isomorphism carries one group onto an identical replica, an automorphism carries a group 
onto itself. 
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Applying a to this equation gives 

0 = a(r) n + a(a„-i)a(r)”~ l H-b er(ai)er(r) + a(a 0 ) 

= a(r )” + a„_ia(r)" _1 H-b ai<r(r) + a Q 

= f(o(r)), 

because a fixes k. Therefore, o(r) is a root of fix)', thus, if Z is the set of all the roots, 
then ojZ: Z -» Z, where er|Z is the restriction. But ct|Z is injective (because a is), so 
that Exercise 1.58 on page 36 says that er|Z is a permutation of Z. • 

Here is the analog of the symmetry group E(f2) of a polygon £2. 

Definition. Let k he a sub field of a field E. The Galois group of E over k, denoted by 
Gal (E/k), is the set of all those automorphisms of E that fix k. If f(x) e k[x ], and if 
E — k(z \. ..., Zn) is a splitting field, then the Galois group of fix) over k is defined to 
beGalf E/k). 

It is easy to check that Gal( E/k) is a group with operation composition of functions. 
This definition is due to E. Artin (1898-1962), in keeping with his and E. Noether’s em¬ 
phasis on “abstract” algebra. Galois’s original version (a group isomorphic to this one) was 
phrased, not in terms of automorphisms, but in terms of certain permutations of the roots of 
a polynomial (see Tignol, Galois’ Theory of Algebraic Equations, pages 306-331). Note 
that Gal (E/k) is independent of the choice of splitting field E, by Theorem 3.131. 

The following lemma will be used several times. 

Lemma 4.2. Let E — k(z i,..., z„). If : E -> E is an automorphism fixing k, that is, 
if a e Gal( E/k), and ifo(z,i) = Zi for all i, then o is the identity 1 e- 

Proof We prove the lemma by induction on n > 1. If n = 1, then each u e E has the 
form u — /(zt)/g(zt), where fix), g{x) e k[x] and g(zi) f 0. But a fixes zi as well 
as the coefficients of fix) and of g(x), so that a fixes all u e E. For the inductive step, 
write K — k{z\, ..., z„-i), and note that E — K(z n ) [for K{z n ) is the smallest subfield 
containing k and zi,..., z«-i, z„]. Having noted this, the inductive step is just a repetition 
of the base step with k replaced by K. • 

Theorem 4.3. If fix) € k[x) has degree n, then its Galois group GaKE / k) is isomorphic 
to a subgroup of S n . 

Proof Let X — {zi,..., z n }. If a e Gal iE/k), then Proposition 4.1 shows that its 
restriction o\X is a permutation of X: that is, <j\X e Sx■ Define tp\ Gal (E/k) -> Sx 
by q>\ a (->• o\X. To see that q> is a homomorphism, note that both (plot) and tpio)(pir) 
are functions X X, and hence they are equal if they agree on each Zi e X. But 
(picrr): Zi i— ((TT)izi), while tpio)(pir): Zi cr(r(z,)), and these are the same. 

The image of (p is a subgroup of Sx = S„. The kernel of tp is the set of all o e Gal iE/k) 
such that a is the identity permutation on X ; that is, o fixes each of the roots z,-. As a also 
fixes k, by definition of the Galois group. Lemma 4.2 gives ker tp — {1}. Therefore, tp is 
injective, giving the theorem. • 
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If fix) — x 2 + 1 e Q[x], then complex conjugation a is an automorphism of its split¬ 
ting field Q(i) which fixes Q (and interchanges the roots i and —i). Since Gal(Q(i)/Q) is 
a subgroup of the symmetric group S 2 , which has order 2, it follows that Gal(Q(/)/Q) = 
(er) = I 2 . We should regard the elements of any Galois group Gal (E/k) as generalizations 
of complex conjugation. 

We are going to compute the order of the Galois group, but we first obtain some infor¬ 
mation about field isomorphisms and automorphisms. 

Lemma 4.4. Ifk is afield of characteristic 0, then every irreducible polynomial p(x) e 
k[x) has no repeated roots. 

Proof. In Exercise 3.37 on page 142, we saw, for any (not necessarily irreducible) poly¬ 
nomial fix) with coefficients in any field, that fix) has no repeated roots if and only if 
the gcd (/, f) — 1, where fix) is the derivative of fix). 

Now consider p(x) e k[x). Either p'{x) = 0 or deg(p') < deg(p). Since p(x) is 
irreducible, it is not constant, and so it has some nonzero monomial UjX 1 , where i > 1. 
Therefore, iajx'~ l is a nonzero monomial in p'{x), because k has characteristic 0, and so 
p'{x) f 0. Finally, since p(x) is irreducible, its only divisors are constants and associates; 
as p'(x) has smaller degree, it is not an associate of pix), and so the gcd (p', p) — 1. • 

Recall Theorem 3.120(i): If E/k is an extension and a e E is algebraic over k , then 
there is a unique monic irreducible polynomial irr(a, k ) e k[x\, called its minimal polyno¬ 
mial, having a as a root. 

Definition. Let E/ k be an algebraic extension. An irreducible polynomial p (x) is sepa¬ 
rable if it has no repeated roots. An arbitrary polynomial fix) is separable if each of its 
irreducible factors has no repeated roots. 

An element a e E is called separable if either a is transcendental over k or if a is 
algebraic over k and its minimal polynomial irr(a, k ) has no repeated roots; that is, irr(«, k) 
is a separable polynomial. 

A field extension E/k is called a separable extension if each of its elements is separa¬ 
ble; E/k is inseparable if it is not separable. 

Lemma 4.4 shows that every extension of a field of characteristic 0 is a separable exten¬ 
sion. If E is a finite field with p n elements, then Lagrange’s theorem (for the multiplicative 
group E x ) shows that every element of £ is a root of x p — x. We saw, in the proof of 
Theorem 3.127 (the existence of finite fields with p" elements), that x p — x has no re¬ 
peated roots. It follows that if k C E, then E/k is a separable extension, for if a € E, then 
irr(a!, k) is a divisor of x p —x. 

Example 4.5. 

Here is an example of an inseparable extension. Let k = F p (t) = Frac(F p [f]), and let 
E — kia), where a is a root of fix) = x p — t; that is, a p — t. In E[x], we have 

fix) — x p — t = x p — a p — (x — a) p . 
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If we show that a f k, then fix) is irreducible, by Proposition 3.126, and so f(x ) = 
irr(«, k) is an inseparable polynomial. Therefore, E/k is an inseparable extension. 

It remains to show that a f k. Otherwise, there are git), h(t) e F p [t] with a = 
g(t)/ hit). Hence, g — ah and g p — a p h p = th p , so that 

deg(g p ) = deg ith p ) — 1 + deg(/z p )- 

But p | deg ig p ) and p | deg ih p ), and this gives a contradiction. ◄ 

We will study separability and inseparability more thoroughly in Chapter 6. 

Example 4.6. 

Let m be a positive integer, let A: be a field, and let fix) — x m — 1 e k[x). If the 
characteristic of k does not divide m, then mx' n ~ l ^ 0 and the gcd if, f) — 1; hence, 
fix) has no repeated roots. Therefore, any splitting field E/k of fix) contains m distinct 
with roots of unity. Moreover, the set of these roots of unity is a (multiplicative) subgroup 
of E x of order m that is cyclic, by Theorem 3.30. We have proved that if characteristic 
k \ m, then there exists a primitive /nth root of unity o> in some extension field of k, and a> 
is a separable element. 

On the other hand, if p e is a prime power and k has characteristic p, then x p — 1 = 
(x — \) p , and so there is only one //th root of unity, namely, 1. ■* 

Separability of E/k allows us to find the order of Gal( E/k). 

Theorem 4.7. 

(i) Let E / k be a splitting field of a separable polynomial fix) € k[x], let cp : k -> k' 
be a field isomorphism, and let E'/k' be a splitting field of f*ix) e k f [x\ [where 
f*ix) is obtained from fix) by applying q> to its coefficients ]. 



Then there are exactly [£ : k] isomorphisms <I>: E —> E' that extend ip. 

(ii) If E/k is a splitting field of a separable fix) e k[x\ then 

|Ga KE/k)\ = [E : k]. 

Proof (i) The proof, by induction on [E : k], modifies that of Lemma 3.130. If [E : k] = 
1, then E — k and there is only one extension <J> of tp, namely, qj itself. If [E : k] > 1, let 
fix) = pix)gix), where pfx) is an irreducible factor of largest degree, say, d. We may 
assume that d > 1, otherwise fix) splits over k and \E : k] = 1. Choose a root a of pix) 
(note that a e E because £ is a splitting field of fix) = pix)g(x)). If <p: k[a) —*■ E' 
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is any extension of cp, then cpia) is a root a' of p*(x), by Proposition 4.1; since f*(x) is 
separable, p*(x) has exactly d roots a' e E'\ by Lemma 4.2 and Theorem 3.120(ii), there 
are exactly d isomorphisms cp : k(a ) —>• kfa f ) extending cp, one for each of. Now E is 
also a splitting field of fix) over k{ot), because adjoining all the roots of fix ) to k{a) still 
produces E, and E' is a splitting field of f*(x) over k'ia'). Since [E : k(a )] = [£ : k\/d, 
induction shows that each of the d isomorphisms <p has exactly [E : k\/d extensions 
<t>: E —► E'. Thus, we have constructed [E : k] isomorphisms extending <p. But there are 
no others, because every r extending <p has r \k(a) = cp for some cp: k(a) -> k\ot'). 

(ii) In part (i), take k = k!, E = E', and cp = \p. • 

Example 4.8. 

The separability hypothesis in Theorem 4.7(ii) is necessary. In Example 4.5, we saw that 
if k = F pit) and a is a root of x p — 1, then E = k{a) is an inseparable extension. 
Moreover, x p — t = (x — a) p , so that a is the only root of this polynomial. Therefore, if 
o e Gal (E/k). then Proposition 4.1 shows that a(a) = a. Therefore, Gal(£/k) = {1}, 
by Lemma 4.2, and so | Gal(£ /k)\ < [E : k] — p in this case. ◄ 

Corollary 4.9. Let E/k be a splitting field of a separable polynomial f(x) e fc[x] of 
degree n. If f(x) is irreducible, then n \ \ Gal(£/k)|. 

Proof. By the theorem, | G&\(E/k)\ — [E : k]. Let a e E be a root of fix). Since fix) 
is irreducible, [k(a) : k] = n, and 

[E :k] = [E : k(a)][kia) : k] = n[E : k(a)]. • 

We shall see, in Proposition 4.38, that if E/k is a splitting field of a separable polyno¬ 
mial, then E/k is a separable extension. 

Here are some computations of Galois groups of specific polynomials in Q[x], 

Example 4.10. 

(i) Let fix) — x 3 — 1 e Q[x], Now fix) = (x — l)(x 2 + x + 1), where x 2 + x + 1 
is irreducible (the quadratic formula shows that its roots w and cb, do not lie in Q). The 
splitting field of fix) is Q(&>), for or — cb, and so [Q(ft>) : Q] = 2. Therefore, 

| Gal(Q(&>)/Q)| = 2, by Theorem 4.7(ii), and it is cyclic of order 2. Its nontrivial ele¬ 
ment is complex conjugation. 

(ii) Let fix) = x 2 — 2 e Q[x], Now fix) is irreducible with roots ±\/2, so that E = 
QiV2) is a splitting field. By Theorem 4.7(h), | Gal(E /Q) | = 2. Now every element of E 
has a unique expression of the form a + bV 2, where a, b e Q [Theorem 3.117(v)]. and it 
is easily seen that o : E -> E, defined by a: a + b\!2 i->- a -by/2, is an automorphism 
of E fixing Q. Therefore, GalfE/Q) = (cr), where a interchanges y/2 and —y/2. 

(iii) Let g(x) — x 3 — 2 e Q[x]. The roots of gix) are a, coa, and ora, where a — y/2, the 
real cube root of 2, and a> is a primitive cube root of unity. It is easy to see that the splitting 
field of g(x) is E — Q(a, u>). Note that 

[E:Q] = [E: Q(a)][Q(a) : Q] = 3 [E : Q(a)], 
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for g(x) is irreducible over Q (it is a cubic having no rational roots). Now E Q(a), 
for every element in Q(or) is real, while the complex number to is not real. Therefore, 
[E : Q] = | Gal( E /Q)| > 3. On the other hand, we know that Gal(£/Q) is isomorphic to 
a subgroup of S 3 , and so we must have GallE/Q) = S 3 . 

(iv) We examined fix) — x 4 — 10v 2 + 1 e Q[x] in Example 3.122, when we saw that 
f(x) is irreducible; in fact, fix) — irr(/l,Q), where ft = V2 + V3. If E = Q(/S), 
then [£ : Q] = 4; moreover, £ is a splitting field of fix), where the other roots of 
fix ) are — Jl — V^, — V 2 + V3, and \fl — V3. It follows from Theorem 4.7(h) that if 
G — Gal(£/Q), then |G| =4; hence, either G = I 4 or G = V. 

We also saw, in Example 3.122, that E contains V2 and V3. If a is an automorphism 
of E fixing Q, then er(V 2 ) = mV 2, where u — ±1, because (er(V2 ) 2 = 2 . Therefore, 
( 7 2 (V 2 ) = it(mV 2 ) = mct(V 2 ) = m 2 V 2 = V 2 ; similarly, cr 2 (V3) = V3. If a is a root of 
fix), then a — mV 2 + i>V3, where u, v — ± 1 . Hence, 

cr 2 (a) — ua 2 iV2) + i><t 2 (V3) = mV 2 + uV3 = a. 

Lemma 4.2 gives a 2 = lg for all a e Gal(£/Q), and so Gal(£/Q) = Y. 

Here is another way to compute G = Gal(£/Q). We saw in Example 3.122 that E — 
Q(V2 + V3) = Q(V2, V3) is also a splitting field of g(x) — ix 2 — 2)(x 2 — 3) over Q. By 
Proposition 3.120(h), there is an automorphism tp : Q(V2) —> Q(V2) taking V2 i-> — V2. 
But V3 i Q(V2), as we noted in Example 3.122, so that x 2 — 3 is irreducible over 
Q(V2). Lemma 3.130 shows that tp extends to an automorphism T>: E —>• E ; of course, 
<J> e Gal( E /Q). There are two possibilites: <I>(V3) = ±V3. Indeed, it is now easy to see 
that the elements of GaHE/Q) correspond to the four-group, consisting of the identity and 
the permutations (in cycle notation) 

(V 2 , -V2)(V3, V3), (V 2 , -V2)(V3, -V3), (V 2 , V2)(V3, -V3). ◄ 

Here are two more general computations of Galois groups. 

Proposition 4.11. If m is a positive integer, ifk is a field, and if E is a splitting field of 
x m — 1 over k, then Ga \)E/k) is abelian', in fact, Gal iE/k) is isomorphic to a subgroup 
of the multiplicative group U( I m ) of all [/] with (i, m) — 1. 

Proof Assume first that the characteristic of k does not divide m. By Example 4.6, E 
contains a primitive mth root of unity, to, and so E — kito). The group of all roots of 
x m — 1 in E is cyclic, say, with generator to, so that if o e G'd\(E/k), then its restriction is 
an automorphism of the cyclic group ( a>). Hence, a (a>) = to' must also be a generator of 
(to ); that is, (i, m) = 1, by Theorem 2.33(i). It is easy to see that i is uniquely determined 
mod m, so that the function tp: Gal ikiw)/k) —> I/(I m ), given by tpicr) = [/] if o-(&>) = 
is well-defined. Now <p is a homomorphism, for if r (&>) = oof then 

roico) = r(ftj') = (co') J — off 


Finally, Lemma 4.2 shows that <p is injective. 
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Suppose now that k has characteristic p and that m — p e n, where p j n. By Exam¬ 
ple 4.6, there is a primitive nth root of unity co, and we claim that E = kico) is a splitting 
field of x m — 1. If = 1, then 1 = ^ p ‘ n = (t; n ) pe . But the only p e th root of unity is 
1, since k has characteristic p. and so = 1; that is, f e k(oi). We have reduced to the 
case of the first paragraph. [In fact, more is true in this case: Gal(£/ k ) is isomorphic to a 
subgroup of the multiplicative group U (I,,)-] • 


Remark. We cannot conclude more from the proposition; given any finite abelian group 
G, there is some integer m with G isomorphic to a subgroup of U (I m ). ◄ 


Theorem 4.12. If p is a prime, then 


GaKF^n /¥ p ) = I„, 


and a generator is the Frobenius F: u i—>• u p . 

Proof. Let q — p n , and let G = GalfF^/F^). Since F ? has characteristic p , we have 
(i a + b) p = a p + b p , and so the Frobenius F is a homomorphism of fields. As any 
homomorphism of fields, F is injective; as F (/ is finite, F must be an automorphism, by 
Exercise 1.58 on page 36; that is, F e G. 

If it e ¥ q is a primitive element, then d(x) = irrfjr, F ; ,) has degree n, by Corol¬ 
lary 3.128, and so |G| = n, by Theorem 4.7(h). It suffices to prove that the order j of F is 
not less than n. But if F' — If for j < n, then u pJ — u for all of the q — p n elements 
u e Fq, giving too many roots of the polynomial x pl — x. • 

The following nice corollary of Lemma 3.130 says, in our analogy between Galois the¬ 
ory and symmetry of polygons, that irreducible polynomials correspond to regular poly¬ 
gons. 

Proposition 4.13. Let k be afield and let p(x) € A’[x] have no repeated roots. If E / k is 
a splitting field of p(x), then p(x) is irreducible if and only ifGa\(E/k ) acts transitively 
on the roots of p(x). 

Proof. Assume that p(x) is irreducible, and let a, f e E be roots of p(x ). By Theo¬ 
rem 3.120(i), there is an isomorphism q> : k(a ) -> kifi) with (plot) — f and which fixes 
k. Lemma 3.130 shows that <p extends to an automorphism T of E that fixes k\ that is, 
<t> e Gal (E/k). Now <t>(a) = (p(a ) = fi, and so Gal (E/k) acts transitively on the roots. 

Conversely, assume that Ga l(E/k) acts transitively on the roots of p(x). If p(x) = 
qi(x) ■ ■ ■ q t (x) is a factorization into irreducibles in k[x], where t > 2, choose a root 
a e E of qiix) and choose a root f e E of <72 U')- By hypothesis, there is a e Gal (E/k) 
with cr(a) = f. Now 0 permutes the roots of qi(x), by Proposition 4.1. However, f is not 
a root of £/i (x), because p(x ) has no repeated roots, and this is a contradiction. Therefore, 
t = 1; that is, p(x) is irreducible. • 
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We can now give another proof of Corollary 4.9. Theorem 2.98 says that if X is a G-set, 
then |G| = \0(x)\\G x |, where 0{x) is the orbit of x e X. In particular, if X is a transitive 
G-set, then \X | is a divisor of |G|. Let fix) e k[x] be a separable irreducible polynomial 
of degree n, and let E/k be its splitting field. If X is the set of roots of f(x), then X is 
a transitive Gal(£/A;)-set, by Proposition 4.13, and so n — deg(/) = \X | is a divisor of 
|Gal(£7fc)|. 

The analogy 3 is complete. 

Polygon £2.polynomial f(x) e k[x) 

Regular polygon.irreducible polynomial 

Vertices of £2.roots of f(x ) 

Plane.splitting field E of fix) 

Motion.automorphism fixing k 

Symmetry group £(G).Galois group Gal {E/k) 

Here is the basic strategy. First, we will translate the classical formulas (giving the 
roots of polynomials of degree at most 4) in terms of subfields of a splitting field E over k. 
Second, this translation into the language of fields will itself be translated into the language 
of groups: If there is a formula for the roots of fix), then Gal {E/k) must be a solvable 
group (which we will soon define). Finally, polynomials of degree at least 5 can have 
Galois groups that are not solvable. The conclusion is that there are polynomials of degree 
5 for which there is no formula, analogous to the classical formulas, giving their roots. 


Formulas and Solvability by Radicals 

Without further ado, here is the translation of the existence of a formula for the roots of a 
polynomial in terms of subfields of a splitting field. 

Definition. A pure extension of type m is an extension k{u)/k, where u m e k for some 
m > 1. An extension K/k is a radical extension if there is a tower of fields 

k = K 0 c K\ c • • • c K, = K 

in which each K, + i/Ki is a pure extension. 

If u m = a e k, then k(u) arises from k by adjoining an with root of a. If k C C, there 
are m different with roots of a, namely, u, cou, aru , ..., co m ~ l u, where co — e 27r! /" ! j s a 
primitive with root of unity. More generally, if k contains the with roots of unity, then a pure 
extension k{u) of type m, that is, u' n = a e k, then k(u) is a splitting field of x m — a. Not 
every subfield k of C contains all the roots of unity; for example, 1 and — 1 are the only 
roots of unity in Q. Since we seek formulas involving extraction of roots, it will eventually 
be convenient to assume that k contains appropriate roots of unity. 

3 Actually, a better analogy would involve polyhedra in euclidean space R" instead of only polygons in the 
plane. 
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When we say that there is a formula for the roots of a polynomial fix) analogous to the 
quadratic formula, we mean that there is some expression giving the roots of fix) in terms 
of the coefficients of fix). The expression may involve the field operations, constants, 
and extraction of roots, but it should not involve any other operations involving cosines, 
definite integrals, or limits, for example. We maintain that a formula as we informally 
described exists precisely when fix) is solvable by radicals, which we now define. 

Definition. Let fix) e £[x] have a splitting field E. We say that fix) is solvable by 
radicals if there is a radical extension 

k = K 0 c K x c • • • c K t 


with E C K,. 

Actually, there is a nontrivial result of Gauss that we are assuming. It is true, but not 
obvious, that x n — 1 is solvable by radicals in the sense that there is the desired sort of 
expression for 

e 2ni/n =cos(^:) +/sin(=f) 

(see van der Waerden, Modern Algebra I, pages 163-168, or Tignol, Galois’ Theory of 
Algebraic Equations, pages 252-256). This theorem of Gauss is what enabled him to 
construct a regular 17-gon with ruler and compass. 

Let us illustrate this definition by considering the classical formulas for polynomials of 
small degree. 


Quadratics 

If f(x) = x 2 + bx + c, then the quadratic formula gives its roots as 

\ b ± s/b 2 — 4c\. 

Let k — Qib, c). Define K\ — kin), where u — \jb 2 — 4c. Then K\ is a radical extension 
of k, for ii 2 e k. Moreover, the quadratic formula implies that K\ is the splitting field of 
fix), and so fix) is solvable by radicals. 

Cubics 

Let fiX) — X 3 + bX 2 + cX + d, and let k = Qib, c, d). The change of variable 
X — x — \b yields a new polynomial fix) — x 3 + qx + r e k[x] having the same 
splitting field E [for if u is a root of fix), then u — j b is a root of fix)]', it follows that 
fix) is solvable by radicals if and only if fix) is. Special cases of the cubic formula were 
discovered by Scipio del Ferro around 1515, and the remaining cases were completed by 
Niccolo Fontana (Tartaglia) in 1535 and by Giralamo Cardano in 1545. The formula gives 
the roots of fix) as 

g + h, cog + a> h, and co g + coh. 
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where g 3 = j r + V~R^j, h = —q/3g, R — r 2 + yjq 3 , and a> = — ^ +i ^ is a primitive 
cube root of unity. 

The cubic formula is derived as follows. If u is a root of f(x) — x 3 + qx + r, write 

u — g + h, 


and substitute: 


0 = f(u) = fig + h) = g 3 + /z 3 + (3 gh + q)u + r. 

Now the quadratic formula can be rephrased to say, given any pair of numbers u and v, that 
there are (possibly complex) numbers g and h with u — g + h and i> = gh. Therefore, we 
can further assume that 3gh + q — 0; that is, 

g 3 + /z 3 = —r and gh = — ^q. 

After cubing the latter, the resulting pair of equations is 

g 3 + h 3 = -r 

■? 3/;3 = - 77 ^’ 

giving the quadratic in g 3 : 

g 6 + rg 3 -±q 3 =0. 

The quadratic formula gives 

.? 3 = + f^) = H~ r + SK) 

[note that h 3 is also a root of this quadratic, so that h 3 — k(—r — There are three 

cube roots of g 3 : g. cog. and arg. Because of the constraint gh — — ^q, each of these has 
a “mate,” namely, h = —q/(3g). —q/(3cog) = arh, and —q/(3co 2 g) = wh. 

Let us now see that fix) is solvable by radicals. Define K\ = k{\fR). where R — 
r 2 + ^q 3 , and define K2 = Kfa), where a 3 = \{—r + \/R). The cubic formula shows 
that K2 contains the root a + f of fix), where ft = ~q/ 3 a. Finally, define K3 = K2ico), 
where co 3 — 1 . The other roots of fix) are coa + w 2 f and u> 2 a + cof. both of which lie in 
K3, and so E C K3. 

A splitting field E need not equal K 3 , for if all the roots of fix) are real, then E C 
K, whereas Kt, % R. An interesting aspect of the cubic formula is the so-called casus 
irreducibilis\ the formula for the roots of an irreducible cubic in Q[x] having all roots real 
requires the presence of complex numbers (see Rotman, Galois Theory, 2d ed., page 99). 

Casus Irreducibilis. If fix) = x 3 + qx + r e Q[x] is an irreducible polynomial having 
three real roots, then any radical extension K,/ Q containing the splitting field of fix) is 
not real; that is, K, £ R. 
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Example 4.14. 

If f(x) = x 3 - 15* - 126, then q = -15, r = -126, R = 15376, and J~R = 124. Hence, 
g 3 = 125, so that g — 5. Thus, h = —q/(3g ) = 1. Therefore, the roots of fix) are 

6 , 5w T co~ = —3 + 2/V3, 5co~ T co — —3 — 2i\f2. 

Alternatively, having found one root to be 6 , the other two roots can be found as the roots 
of the quadratic /(*)/(* — 6 ) = x 2 + 6 * + 21 . ◄ 

Example 4.15. 

The cubic formula is not very useful because it often gives the roots in unrecognizable 
form. For example, let 

fix) — (x — l)(x — 2)(x + 3) = x 3 — lx + 6 . 

The cubic formula gives 

g + A = ^(- 6 + v /^) + ^(- 6 - v /^). 

It is not at all obvious that g + h is a real number, let alone an integer. There is another 
version of the cubic formula, due to F. Viete, which gives the roots in terms of trigonometric 
functions instead of radicals (see my book, A First Course in Abstract Algebra, pp. 360- 
362). 


Quartics 

Let fiX) = X 4 + bX 3 +cX 2 +dX + e, and let k = Qib, c, d, e). The change of variable 
X — x — ^b yields a new polynomial fix) — x 4 + qx 1 + rx + s e k[x]; moreover, the 
splitting field E of fix) is equal to the splitting field of fix), for if u is a root of fix), 
then u — ^b is a root of fix). The quartic formula was found by Luigi Ferrari in 1545, but 
here is the version presented by R. Descartes in 1637. Factor fix) in C[x]: 

fix) = x 4 + qx 2 + rx + s = (x 2 + jx + f)(x 2 — jx + m), 

and determine j, i and m. Expanding and equating like coefficients gives the equations 

9 

l + m — j — q\ 
j(m -i) = r\ 

Im — s. 


2m = j 2 +q + r/j\ 
21 = j 2 +q- r/j. 


The first two equations give 





210 


Fields Ch. 4 


Substituting these values for m and i into the third equation yields the resolvent cubic: 
O' 2 ) 3 + 2q(j 2 ) 2 + ( q 2 - 4s)j 2 - r 2 . 

The cubic formula gives j 2 , from which we can determine m and l, and hence the roots of 
the quartic. 

Define pure extensions 

k = Kq c K\ c K2 c K 3 , 

as in the cubic case, so that j 2 e K 3. Define K4 = K 3 (j) (so that t, m e Kf). Finally, 
define K5 = K4 ^j 2 — 4lJ and K(, = K5 j 2 — 4 [giving roots of the quadratic 
factors x 2 + jx + £ and x 2 — jx + m of f(x)]. The quartic formula gives E c K g. 

We have just seen that quadratics, cubics, and quartics are solvable by radicals. Con¬ 
versely, if f(x) is a polynomial that is solvable by radicals, then there is a formula of the 
desired kind that expresses its roots in terms of its coefficients. For suppose that 

k = K 0 C Ki C • • • C K, 

is a radical extension with splitting field E C K, . Let z be a root of fix). Now K, = 
K t -i(u), where u is an wth root of some element a e K,-\: hence, z can be expressed in 
terms of u and K t _ 1; that is, z can be expressed in terms of %/a and K t _\. But K,_\ = 
Kt-2(v), where some power of v lies in K t - 2. Hence, z can be expressed in terms of m, 
u, and K,-2- Ultimately, z is expressed by a formula analogous to those of the classical 
formulas. 


Translation into Group Theory 

The second stage of the strategy involves investigating the effect of f(x) being solvable by 
radicals on its Galois group. 

Suppose that k(u)/k is a pure extension of type 6; that is, u 6 e k. Now k(u 2 )/k is a 
pure extension of type 2, for (m 3 ) 2 — u 6 e k, and k(u)/k(u 3 ) is obviously a pure extension 
of type 3 . Thus, k(u)/k can be replaced by a tower of pure extensions k C k(n 3 ) C k(u) 
of types 2 and 3 . More generally, we may assume, given a tower of pure extensions, that 
each field is of prime type over its predecessor: If k C k{u) is of type m, then factor 
m — p 1 • • • p q , where the p’s are (not necessarily distinct) primes, and replace k C k(u) 
by 

k c k(u m/pi ) c k(u m/p 1P2 ) c ... C kiu). 

Here is a key result allowing us to translate solvability by radicals into the language of 
Galois groups. 

Theorem 4 . 16 . Let k C B C E be a tower of fields, let fix), g(x ) e k[x], let B be a 
splitting field of fix) over k, and let E be a splitting field ofg(x) overk. Then Gal (E/B) 
is a normal subgroup of Gal(£/ k), and 


Gal(E / k) / Gal(£ /B) = Gal (B/k). 
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Proof. Let B — k(z 1 ,..., Zt). where zi, ■ ■ ■, Zt are the roots of fix) in E. If cr e 
Gal(£/k), then o permutes zi, ■ ■ ■, Zt, by Proposition 4. l(i) (for o fixes k), and so o(B) = 
B. Define p: Gal (E/k) —> Gal (B/k) by cr i-^ <j\ B. It is easy to see, as in the proof of 
Theorem 4.3, that p is a homomorphism and that kerp = Gal(£//i). It follows that 
Gal {E/B) is a normal subgroup of Gal( E / k). But p is surjective: If r e Gal {B/k), then 
Lemma 3.130 applies to show that there is o e Gal (E/k) extending r [i.e.,p(cr) — o\B — 
r]. The first isomorphism theorem completes the proof. • 

The next technical result will be needed when we apply Theorem 4.16. 

Lemma 4.17. 

(i) If B = k(a i, ..., a n ) is a finite extension of a field k, then there is a finite extension 
E/B that is a splitting field of some polynomial fix) e k[x] (such an extension 
of smallest degree is called a normal^ closure of B/k). Moreover, if each a, is 
separable over k, then fix) can be chosen to be a separable polynomial. 

(ii) If B is a radical extension of k, then the extension E/B in part ( i ) is a radical 
extension ofk. 

Proof, (i) By Theorem 3.120(i), there is an irreducible polynomial p, (x ) = irr(a,-,k) 
in k[x\, for each i, with Pi(oti) — 0, and a splitting field E of f{x) — p\ (x) - - ■ p n (x) 
containing B. If each a, is separable over k, then each p, (x ) is a separable polynomial, 
and hence fix) is a separable polynomial. 

(ii) For each pair of roots a and a' of any p, (x), there is an isomorphism y: k(a) —> 
k(a') which fixes k and which takes a a' , for both k(a) and k{a') are isomorphic 
to k[x\/(pj(x)). By Lemma 3.130, each such y extends to an automorphism a e G = 
Gal (E/k). It follows that E = k{o{u i), ..., o(u t ) : o e G). 

If B/k is a radical extension, then 


k c k(u i) c k(u i, M2) c ... c k(u 1 ,..., u,) = B , 


where each k(u 1 , ..., m,-+ 1 ) is a pure extension of k{u\, . .., m,); of course, cr (B) — 
k{<j(u 1 ), ..., cr(u t )) is a radical extension of k for every cr e G. We now show that E 
is a radical extension of k. Define 

B\ = k(o(u 1 ) : o e G). 

Now if G — {1, cr, r, ...}, then the tower 


k c k(u 1) c k(u 1, oiu 1)) c k(ui, ct(mi), r iu\)) c c fij 


displays B\ as a radical extension of k. For example, if u 1 " lies in k, then r (mi )"' = t(m" ! ) 
lies in r ik) — k, and hence riu\) m lies in k C kiu\, ct(mi)). Assuming, by induction, that 

4 We often call an extension E/k a normal extension if it is the splitting field of some set of polynomials in 
k[xf 
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a radical extension Bj/k containing {cj(uj) : o e G) for all j < i has been constructed, 
define 

Bi +1 = Bi(a(u i+ 1 ) : a e G). 

It is easy to see that Bj + i/Bj is a radical extension: If e k(u\,... ,u{), then 

r(M i _|_i) m e k{r{u\), ..., r( m,)) C it follows that B , + 1 is a radical extension of k. 
Finally, since E — B t , we have shown that £ is a radical extension of k. • 

We can now give the heart of the translation we have been seeking. 

Lemma 4.18. Let 

Kq c Ki c K 2 c • • • c K, 

be a radical extension of a field Kq. Assume, for each i > 1, that each Kj is a pure 
extension of prime type p , over £;_i, where p, char(Ko), and that Kq contains all the 
Pith roots of unity. If K t is a splitting field over Kq, then there is a sequence of subgroups 

Gal(£,/£ 0 ) = Go > Gi > G 2 > ■ ■ ■ > G, = {1}, 

with each G;+i a normal subgroup of Gi and with G ; /G,+i cyclic of prime order pi+\. 
Proof For each i, define G, = Gal( K, /Ki). It is clear that 

Gh\{K,/Kq) = Go > Gi > G 2 > • • • > G, = {1} 

is a sequence of subgroups. Since K\ = Kq(u), where it P] e Kq, the assumptions that 
char(£o) Pi an d that Kq contains all the pith roots of unity implies that Kq contains 
a primitive pith root of unity or, hence, K\ is a splitting field of the separable polynomial 
x pi — U P ', for the roots are u, cou,..., a> pt ~ l u. We may thus apply Theorem 4.16 to see 
that Gi = Gal {K,/K\ ) is a normal subgroup of Go = Gal( K,/ Kq) and that Gq/Gi = 
Gal(£i/£o)- By Theorem 4.7(ii), Gq/G\ = I pi . This argument can be repeated for 
each i. • 

We have been led to the following definition. 

Definition. A normal series 5 of a group G is a sequence of subgroups 
G = Go > Gi > G 2 > • • • > Gf = {1} 

with each G,+i a normal subgroup of G, ; the factor groups of this series are the quotient 
groups 

G 0 /G 1 , G 1 /G 2 ,..., G„_i/G„. 

A finite group G is called solvable if it has a normal series each of whose factor groups has 
prime order (see the definition of infinite solvable groups on page 286). 

5 This terminology is not quite standard. We know that normality is not transitive; that is, if H < K are 
subgroups of a group G, then H <3 K and K <3 G does not force H <3 G. A subgroup H < G is called a 
subnormal subgroup if there is a chain 

G = G 0 > Gi > G 2 > • • • > G, = H 

with Gj <1 G,_i for all / > 1, Normal series as defined in the text are called subnormal series by some authors; 
they reserve the name normal series for those series in which each G, is a normal subgroup of the big group G. 
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In this language. Lemma 4.18 says that Ga\(K t / Kq ) is a solvable group if K, is a radical 
extension of Kq and Kq contains appropriate roots of unity. 

Example 4.19. 

(i) By Exercise 2.86(h) on page 113, every finite abelian group G has a (necessarily normal) 
subgroup of prime index. It follows, by induction on |G|, that every finite abelian group is 
solvable. 

(ii) Let us see that S 4 is a solvable group. Consider the chain of subgroups 

S 4 > A 4 > V > W > {1}, 

where Y is the four-group and W is any subgroup of V of order 2. Note, since Y is 
abelian, that W is a normal subgroup of V. Now IS 4 /A 4 I = |S 4 |/|A 4 | = 24/12 = 2, 
|A 4 /V| = |A 4 |/|V| = 12/4 = 3, |V/W| = |V|/|W| = 4/2 = 2, and \W/{\}\ = |W| = 2. 
Since each factor group has prime order, S 4 is solvable. 

(iii) A nonabelian simple group G, for example, G = A 5 , is not solvable, for its only 
proper normal subgroup is {1}, and G/{ 1} = G is not cyclic of prime order. 4 

The awkward hypothesis in the next lemma, about roots of unity, will soon be removed. 

Lemma 4.20. Let k be a field and let f(x) e k[xf be solvable by radicals, so there is 
a radical extension k = Kq C K\ C ■ ■ ■ C K t with K, containing a splitting field E of 
f(x). If each Kj/K ,-1 is a pure extension of prime type pi, where pi charik), and if 
k contains all the pith roots of unity, then the Galois group Ga \{E / k) is a quotient of a 
solvable group. 

Proof. There is a tower of pure extensions of prime type 

k = K 0 c Ki c K 2 c • • • c K, 

with E C K,; by Lemma 4.17, we may assume that K, is also a splitting field of some poly¬ 
nomial in k[jc]. The hypothesis on k allows us to apply Lemma 4.18 to see that Gal (K t /k) 
is a solvable group. Since E and K t are splitting fields over k. Theorem 4.16 shows that 
Gal{K,/k)/Gal(K,/E) = Gal (E/k), as desired. • 

Proposition 4.21. Every quotient G/N of a solvable group G is itself a solvable group. 

Proof. Let G = Go > Gi > G 2 > • • • > G t — {1} be a sequence of subgroups as in the 
definition of solvable group. Since N < G, we have NGj a subgroup of G for all i, and so 
there is a sequence of subgroups 

G = GqN > GiN > > G,N = N > {1}. 

This is a normal series: With obvious notation, 

(gin)Gi +l N(gin)- 1 < gi G i+l Ngf l = giG i+l gf l N < G i+l N ; 
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the first inequality holds because n{Gi+\N)n ~ 1 < NGi+\N < (G;+iA0(G,+iA0 = 
Gi+\N (for Gi+iN is a subgroup); the equality holds because Ngf l = gf l N (for N <S G, 
and so its right cosets coincide with its left cosets); the last inequality holds because 
Gi- i-i <1 Gj. 

The second isomorphism theorem gives 

Gj _ GjiGj+iN) = GjN 
G, n (G i+ iN) ~ G i+l N G i+ iN’ 

the last equation holding because G/G /+1 = G,. Since G,+i <1 Gi fl G l+ \ N, the third 
isomorphism theorem gives a surjection Gi /G,+i —> Gj/[Gi fl G,+i IV], and so the com¬ 
posite is a surjection G,/G ;+ i —> GjN / Gj + \N. As Gj/Gi+\ is cyclic of prime order, its 
image is either cyclic of prime order or trivial. Therefore, G/N is a solvable group. • 

Proposition 4.22. Every subgroup H of a solvable group G is itself a solvable group. 
Proof Since G is solvable, there is a sequence of subgroups 

G = Go > G i > G 2 > • • • > Gf = {1} 

with Gj normal in G,_ 1 and G, _ 1 / G, cyclic, for all i. Consider the sequence of subgroups 
H = HnG 0 >HnGi>HnG 2 >--->HnG, = {1}. 

This is a normal series: If hi+\ e H fl G/+i and g,- e H C 1 Gi, then gjh l+ \gf ] e H, 
for gi,hj + i e H; also, g,h ! + \gf 1 e G,+i because G/+i is normal in Gi. Therefore, 
gihi+igf 1 e H fl G, + i, and so // fl G ; +i <3 // fl G;. Finally, the second isomorphism 
theorem gives 

(H n Gi)/{H n Gf +1 ) = (// n G,■)/[(// n g { ) n G i+l ] 

= Gi + i(HnGi)/G i+l . 

But the last (quotient) group is a subgroup of G;/G, + i. Since the only subgroups of a 
cyclic group C of prime order are C and {1}, it follows that the nontrivial factor groups 
(H fl Gi)/(H IT G,-+ 1 ) are cyclic of prime order. Therefore, H is a solvable group. • 

Example 4.23. 

In Example 4.19(h), we showed that S 4 is a solvable group. However, if n > 5, the 
symmetric group S n is not a solvable group. If, on the contrary, S n were solvable, then 
so would each of its subgroups be solvable. But A 5 < S 5 < S„ , and A 5 is not solvable 
because it is a nonabelian simple group. < 

Proposition 4.24. If H <\ G and if both H and G/H are solvable groups, then G is 
solvable. 

Proof. Since G/H is solvable, there is a normal series 

G/H >K^>K^>-- K* = {1} 
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having factor groups of prime order. By the correspondence theorem for groups, there are 
subgroups Ki of G, 

G > Ki > K 2 > • • • > K m = H, 

with Kj / H = K* and K l+ \ <| K, for all i. By the third isomorphism theorem, 

K*/K* +l = Kj/Kj+i 

for all i, and so Ki/K[ + 1 is cyclic of prime order for all i. 

Since H is solvable, there is a normal series 

H > H x > H 2 > •• H q = {1} 

having factor groups of prime order. Splice these two series together, 

G > Ki > K 2 > ■ ■ ■ > K m >H l >H 2 >--H q = {1}, 
to obtain a normal series of G having factor groups of prime order. • 

Corollary 4.25. If H and K are solvable groups, then H x K is solvable. 

Proof. Since (H x K)/H = K , the result follows at once from Proposition 4.24. • 

We return to fields, for we can now give the main criterion that a polynomial be solvable 
by radicals. 

Theorem 4.26 (Galois). Let f(x) e k[x], where k is a field, and let E be a splitting 
field of f(x) over k. If fix) is solvable by radicals, then its Galois group Gal (E/k) is a 
solvable group. 

Remark. The converse of this theorem is false if k has characteristic p > 0 (see Propo¬ 
sition 4.56), but it is true when k has characteristic 0 (see Theorem 4.53). ◄ 

Proof. In the proof of Lemma 4.20, we assumed that the ground field contained certain 
Pi th roots of unity (the primes p ,• were types of pure extensions). Define m to be the 
product of all these pi, define E* to be a splitting field of x m — 1 over E, and define 
k* = k(£2), where £2 is the set of all wth roots of unity in E*. Now E* is a splitting field 
of f(x) over k*, and so Gal( E*/k*) is solvable, by Proposition 4.21. 


E* 



k 
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Consider the tower k Cfc £*; we have Gal(£’*/A:*)<lGal(£ , */A:), by Theorem 4.16, 
and 

Gal(E*/k)/Gal(E*/k*) = Gal(k*/k). 

Now Gal(£*/A:*) is solvable, while Gal (k*/k) is abelian, hence solvable, by Proposi¬ 
tion 4.11; therefore. Gal {E*/k) is solvable, by Proposition 4.24. Finally, we may use 
Theorem 4.16 once again, for the tower k C E C E* satisfies the hypothesis that both E 
and E* are splitting fields of polynomials in k[x ] [E* is a splitting field of (x m — 1 )/(x)]. 
It follows that Gal(E*/k)/ Gal(E*/E) = GalfE /k), and so Gal (E/k) is solvable, for it is 
a quotient of a solvable group. • 

Recall that if k is a field and E — k(y i, .... y„ ) = Frac(T[vi, ..., y,i]) is the field of 
rational functions, then the general polynomial of degree n over k is 


(x - y\)(x - y 2 ) ■ ■ ■ (x - y n ). 


Galois’s theorem is strong enough to prove that there is no generalization of the quadratic 
formula for the general quintic polynomial. 

Theorem 4.27 ( Abel-Ruffini ). If n > 5, the general polynomial of degree n 
f(x) = (x - yi)(x - y 2 ) ■ ■ ■ (x - y n ) 
over a field k is not solvable by radicals. 

Proof In Example 3.125, we saw that if E = k{y \, ..., y „) is the field of all rational 
functions in n variables with coefficients in a field k, and if F — k(ao ,..., a„- 1 ), where 
the ai are the coefficients of fix), then E is the splitting field of f(x) over F. 

We claim that Ga \{E/F) = S„. Exercise 3.47(i) on page 150 says that if A and R 
are domains and <f>\ A -> R is an isomorphism, then a/b (p(a)/q){b) is an isomor¬ 
phism Frac(A) —> Frac( R). In particular, if a e S „, then there is an automorphism a of 
A'| vi, .... y n \ defined by a : f{y\, ..., y„) i-> f(y a i, ..., y a n)~ that is, a just permutes 
the variables, and a extends to an automorphism ct* of E = Frac(/c[ vi, ..., y„]). Equa¬ 
tions (1) on page 198 show that a* fixes F , and so a* e Galf E /F). Using Lemma 4.2, it is 
easy to see that a i-^ a* is an injection S„ —*■ Gal(£/ F ), so that \S n \ < \ Gal(£ /F)\. On 
the other hand. Theorem 4.3 shows that Gi\\(E/F) can be imbedded in S „, giving the re¬ 
verse inequality | Ga\(E/F)\ < S n \. Therefore, Gal (E/F) = S n . But S n is not a solvable 
group if n > 5, by Example 4.23, and so Theorem 4.26 shows that fix) is not solvable by 
radicals. • 

We know that some quintics in Q[x] are solvable by radicals; for example, x 5 — 1 is 
solvable by radicals, for its Galois group is abelian, by Proposition 4.11. On the other 
hand, we can give specific quintics in Q[x] that are not solvable by radicals. For example, 
f(x) — x 5 — 4x + 2 e Q[x] is not solvable by radicals, for it can be shown that its Galois 
group is isomorphic to S 5 (see Exercise 4.13 on page 218). 
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Exercises 

4.1 Given u, v e C, prove that there exist g, h e C with u = g + h and v — gh. 

4.2 Show that the quadratic formula does not hold for ax~ + bx + c e k\x] when characteristic!^) 

= 2 . 

4.3 (i) Find the roots of f(x) = x 3 — 3x + 1 e Q[jr], 

(ii) Find the roots of f(x) — x 4 — 2x~ + 8* — 3 e Q[.v], 

4.4 Let f{x) e E[x], where £ is a field, and let o': E —> £ be an automorphism. If f(x) splits 
and a fixes every root of /( x), prove that a fixes every coefficient of fix). 

4.5 iAccessory Irrationalities) Let E/k be a splitting field of fix) e k[x\ with Galois group 
G = Gal(£/ k). Prove that if k*/ k is a field extension and E* is a splitting field 


E* 



k 


of fix) over k* , then restriction, a i->- a\E, is an injective homomorphism 

Gal(£*/fc*) Gal(£/L). 

Hint. If a e Gal(£*/L*), then a permutes the roots of fix), so that cr|£ e Gal(£/L). 

4.6 (i) Let K/k be a field extension, and let fix) e k[x] be a separable polynomial. Prove that 

fix) is a separable polynomial when viewed as a polynomial in K\x). 

(ii) Let k be a field, and let fix), gix) e k[x). Prove that if both fix) and gix) are separable 
polynomials, then their product fix)gix) is also a separable polynomial. 

4.7 Let k be a field and let fix) e k \x ] be a separable polynomial. If E/k is a splitting field of 
fix), prove that every root of fix) in £ is a separable element over k. 

4.8 Let K/k be a field extension that is a splitting field of a polynomial fix) e k[x\. If pix) e 
k\x | is a monic irreducible polynomial with no repeated roots, and if 

P(x) = giix) ■ ■ ■ g r ix) in K[x ], 

where the giix) are monic irreducible polynomials in K[x}, prove that all the giix) have the 
same degree. Conclude that deg(p) = r deg(g, ). 

Hint. In some splitting field E/K of pix) fix), let a be a root of giix) and f) be a root of 
gjix), where i j. There is an isomorphism <p: kia) —> k(f>) with (pia) = ft, which fixes k 
and which admits an extension to <t>: £ —>■ £. Show that <t>| K induces an automorphism of 
K[x ] taking giix) to gjix). 

4.9 (i) Give an example of a group G having a subnormal subgroup that is not a normal sub¬ 

group. 

(ii) Give an example of a group G having a subgroup that is not a subnormal subgroup. 
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4.10 Prove that the following statements are equivalent for a quadratic f(x) = ax~ + bx + c e 

QM. 

(i) f{x) is irreducible in Q[jr], 

(ii) \Jb^ — 4 ac is not rational. 

(iii) Gal(Q(\/ b 2 — 4 ac), Q) has order 2. 

4.11 Let k be a field, let f{x) e k[x] be a polynomial of degree p, where p is prime, and let E/k 
be a splitting field. Prove that if Gal (E/k) = I p , then f{x) is irreducible. 

Hint. Show that fix) has no repeated roots. 

4.12 (i) Prove that if a is a 5-cycle and r is a transposition, then S$ is generated by {a, r j. 

Hint. Use Exercise 2.94(iii) on page 114. 

(ii) Give an example showing that S n , for some n, contains an »-cycle a and a transposition 
r such that (o', t) 7 ^ S„. 

4.13 Let f(x) = x 5 — 4x + 2 e Q[x] and let G be its Galois group. 

(i) Assuming that fix) is an irreducible polynomial, prove that |G| is a multiple of 5. 
[We can prove that fix) is irreducible using Eisenstein's criterion. Theorem 6.34 on 
page 337.] 

(ii) Prove that fix) has three real roots and two complex roots, which are, of course, com¬ 
plex conjugates. Conclude that if the Galois group G of fix) is viewed as a subgroup of 
55 , then G contains complex conjugation, which is a transposition of the roots of fix). 

(iii) Prove that G = S 5 . and conclude that fix) is not solvable by radicals. 

Hint. Use Exercise 4.12. 

4.2 Fundamental Theorem of Galois Theory 

Galois theory analyzes the connection between algebraic extensions £ of a field k and 
the corresponding Galois groups Ga l{E/k). This connection will enable us to prove the 
converse of Galois’s theorem: If k is a field of characteristic 0, and if f (x ) e k[x] has 
a solvable Galois group, then fix) is solvable by radicals. The fundamental theorem of 
algebra is also a consequence of this analysis. 

We have already seen several theorems about Galois groups whose hypothesis involves 
an extension being a splitting field of some polynomial. Let us begin by asking whether 
there is some intrinsic property of an extension E/k that characterizes its being a splitting 
field, without referring to any particular polynomial in k[x]. It turns out that the way to 
understand splitting fields E/k is to examine them in the context of both separability and 
the action of the Galois group Gal {E/k) on E. 

Let £ be a field and let Aut(£) be the group of all (field) automorphisms of E. If k is 
any subfield of E , then Gal iE/k) is a subgroup of Aut(£), and so it acts on E. Whenever 
a group acts on a set, we are interested in its orbits and stabilizers, but we now ask for those 
elements of E stabilized by every a in some subset H of Aut(£). 

Definition. If E is a field and H is a subset of Aut(£), then the fixed field of H is defined 
by 

E h = {a e E : aia) — a for all a e H}. 
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The most important instance of a fixed field E H arises when H is a subgroup of Aut(E), 
but we will meet a case in which it is merely a subset. 

It is easy to see that if o e Aut(£), then E° — {a e E : cr(a) — a} is a subfield of E; 
it follows that E H is a subfield of E. for 

e h = pi e ° . 

a eH 


In Example 3.125, we considered E — k(y\. ..., y „). the rational function field in n 
variables with coefficients in a field k, and its subfield K = k(ao, ..., a„- 1 ), where 

fix) = (x - yi)(x - y 2 ) ■ ■ ■ {x - y n ) = a Q + a\x H-h + x” 

is the general polynomial of degree n over k. We saw that £ is a splitting field of fix) 
over K , for it arises from K by adjoining to it all the roots of /Or), namely, all the y’s. 
Now the symmetric group S„ < Aut(£), for every permutation of vj, ..., y n extends to an 
automorphism of E, and it turns out that K — E Sn . The elements of K are usually called 
the symmetric functions in n variables over k. 


Definition. A rational function g(x i, ... , x n )/h{x \,..., x n ) e k(x i ,..., x n ) is a sym¬ 
metric function if it is unchanged by permuting its variables: For every cr e S „, we have 

g(x a 1,..., Xon ) / h (Xfj i,..., 

Xcrn ) = g(x 1,.. ,,x„)/h(xi - ,x„). 

The various polynomials in Eqs. (1) on page 198 define examples of symmetric func¬ 
tions; they are called the elementary symmetric functions. 

The proof of the following proposition is almost obvious. 


Proposition 4.28. If E is afield, then the function H i-^ E H , from subsets H ofAut(E) 
to subfields of E, is order-reversing: If H < L < Aut(£), then E L C E H . 

Proof. If a e E l , then a (a ) — a for all cr e L. Since H < L, it follows, in particular, 
that o(a ) = a for all o e H. Hence, E L C E H . • 


Example 4.29. 

Suppose now that A: is a sub field of E and that G — Gal (E/k). It is obvious that k C E ( f 
but the inclusion can be strict. For example, let E — Q(4^2). If a e G = Gal(£/Q), 
then o must fix Q, and so it permutes the roots of fix) = x 3 — 2. But the other two roots 
of fix) are not real, so that oly/l) — y/l. It now follows from Lemma 4.2 that o is the 
identity; that is, E G — E. Note that E is not a splitting field of fix). < 

Our immediate goal is to determine the degree | E : E (; ], where G < Aut( E). To this 
end, we introduce the notion of characters. 
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Definition. A character 6 of a group G in a field £ is a (group) homomorphism 
a: G -* E x , where E x denotes the multiplicative group of nonzero elements of the 
field E. 

If a e Aut(£), then its restriction <j\E x : E x —> £ x is a character in E. 

Definition. If £ is a field and G < Aut(£), then a list o\...., o„ of characters of G in 
E is independent if, whenever c\,... .c n e E and 

Y. c/ct/ (x) = 0 for all x e G , 


then all the c,- = 0 . 

In Example 3.82(iii), we saw that the set E x of all the functions from a set A to a field 
£ is a vector space over £, where addition of functions is defined by 

a + x : x i-> er(x) + r(x), 

and scalar multiplication is defined, for c e £, by 

co : x i—> cer(x). 

Independence of characters, as just defined, is linear independence in the vector space E x 
when X is the group G. 

Proposition 4.30 (Dedekind). Every list o \, ..., o n of distinct characters of a group G 
in afield E is independent. 

Proof. The proof is by induction on n > 1. The base step n = 1 is true, for if co (x) = 0 
for all x e G. then either c — 0 or o(x) = 0; but o(x) f 0, because imo C £ x . 

Assume that n > 1; if the characters are not independent, there are c, e E. not all zero, 
with 

ciffi(x) H-h c n —\ o n — \ (x) + c„o n (x) = 0 ( 2 ) 

for all x e G. We may assume that all c, f 0, or we may invoke the inductive hypothesis 
and reach a contradiction, as desired. Multiplying by cf 1 if necessary, we may assume that 
c n — 1. Since o n ^ cri, there exists y e G with cri(y) o n (y). In Eq. (2), replace x by 
yx to obtain 


ci< 7 i(y)cri(x) H-h c n -io n - l (y)o n - l (x) + o n {y)o n (x) = 0 , 

6 This definition is a special case of character in representation theory: If a : G — > GL («, E) is a homomor- 
phism, then its character Xo '■ G ^ E is defined, for a e G, by 

Xa(x) = trace (a (a)), 

where the trace of an n x n matrix is the sum of its diagonal entries. When n = 1, then GL(1, E) = E x and 
Xo ( x ) = cr(x ) is called a linear character. 
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for <jj(yx ) = <t ( - (y)cri (x). Now multiply this equation by er„(y) 1 to obtain the equation 

c\o n (y)~ l ofy)o\{x) H-b c„-i(j n (y)~ l <7 n -i(y)<7 ll -i(x) + o n {x) = 0. 

Subtract this last equation from Eq. (2) to obtain a sum of n — 1 terms: 

ct[l - a„(y) _1 CTi(y)]CTi(x) + c 2 [l - (T,,(y)~ l V 2 (y)]v 2 (x) H-= 0. 

By induction, each of the coefficients c, [ 1 — cr„(y) _ 1 cr, (y)] = 0. Now c, ^ 0, and so 
cr„(y)~ 1 cri(y) — 1 for all i < n. In particular, a n (y) — eri(y), contradicting the definition 
of y. • 

Lemma 4.31. If G — {eri, ... ,cr n ] is a set ofn distinct automorphisms of a field E, then 

[E : E g ] > n. 

Proof Suppose, on the contrary, that [E : E G ] = r < n, and let a\ ,..., a r be a basis of 
E/E G . Consider the homogeneous linear system over E of r equations in n unknowns: 

cti( ai)vi H-b cr n {a\)x n = 0 

a-i(a 2 )xi H-b a n (a 2 )x n = 0 


oi(a r )x\ H-b o n (a r )x„ — 0. 


Since r < n, there are fewer equations than variables, and so there is a nontrivial solution 
(ci, ..., c„) in E n . 

We are now going to show that oi{f)c\ + • • • + cr n (f)c n — 0 for any f e E x . which 
will contradict the independence of the characters 0 [ \E X ,..., er„ \E X . Since a \, .... a r is 
a basis of E over E G , every f e E can be written 

f> = 'EbiC'i, 

where bj e E G . Multiply the /th row of the system by o\ {bf) to obtain the system with ;th 
row: 

o\{bi)o\(ai)c\ H-b <r, (bj)cr n (aj)c n = 0. 

But (7 1 (fii ) = bj = <jj(bj) for all i , j, because bj e E G . Thus, the system has /th row: 

Oi(bjaj)ci H-b o n (bjCij)c n = 0. 


Adding all the rows gives 


o\(f)ci H-b a n (f)c n = 0, 




which contradicts the independence of the characters o \,..., o n . 
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Proposition 4.32. If G = [cri,. .. , a,,} is a subgroup of AvX(E), then 

[E : E g ] — |G|. 

Proof. In light of Lemma 4.31, it suffices to prove [E : E G ] < |G|. If, on the contrary, 
[E : E g ] > n, let {&>i,.... &>„+i} be a linearly independent list of vectors in E over E G . 
Consider the system of n equations in n + 1 unknowns: 

eri(<ui)xi H-b G\ (a) n+ i)x n +\ = 0 


a n {to\)x\ H-b G n {(o n +\)x n+ 1 = 0. 


There is a nontrivial solution («i, ..., a„+i) over E\ we proceed to normalize it. Choose 
a solution ..., f r , 0,..., 0) having the smallest number r of nonzero components (by 
reindexing the &>;, we may assume that all nonzero components come first). Note that 
r / 1 , lest eri(&>i)/3i = 0 imply — 0. Multiplying by its inverse if necessary, we may 
assume that fi r = 1 . Not all /f, e E G , lest the row corresponding to cr = 1 f violates 
the linear independence of {a>\, ..., o> n+ \ |. Our last assumption is that f>\ does not he 
in E g (this, too, can be accomplished by reindexing the &>,). There thus exists (Tf : with 
(Tkifii) f\ . Since f r — 1 , the original system has /'th row 

Oj{w\)P\ H-b (Tj((O r -l)P r -l + Gj{co r ) = 0. (3) 


Apply o> to this system to obtain 


G k Gj{C0\)G k {Pl) -I-b G k Gj{w r -l)G k (P r -\) + G k Gj(w r ) = 0. 

Since G is a group, G k G\,.. . , G k o n is just a permutation of o\ ,..., ct„ . Setting G k a ; = cr;, 
the system has i th row 


Gi(an)G k (Pi) H-b Gi(co r -i)G k (p r -i) + Gi{co r ) = 0. 

Subtract this from the /th row of Eq. (3) to obtain a new system with / th row: 

er;(tt>l)[j8i - G k (fi)] H-b Gi (co r -l)[Pr-l - VkiPr- 1)] = o. 

Since f \ — G k (fi) 0, we have found a nontrivial solution of the original system having 

fewer than r nonzero components, a contradiction. • 

These ideas give a result needed in the proof of the fundamental theorem of Galois 
theory. 

Theorem 4.33. If G and H are finite subgroups ofAut(E) with E G = E H , then G = H. 
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Proof. We first show that if o e Aut(£), then o fixes E G if and only if a e G. Clearly, 
o fixes E g if a e G. Suppose, conversely, that a fixes E G but rr (/ G. If |G| = n. then 

n = |G| = [E : E G ], 

by Proposition 4.32. Since er fixes E G . we have E G C E G U 1‘ T '. But the reverse inequality 
always holds, by Proposition 4.28, so that E G — E GG ^. Hence, 

n = [E: E g ] = [E : £ ,GU * <T *] > |G U {a}| = n + 1, 

by Lemma 4.31, giving the contradiction n > n + 1. 

If a e H, then a fixes E H — E G , and hence a e G; that is, // < G; the reverse 
inclusion is proved the same way, and so H = G. • 

We can now give the characterization of splitting fields we have been seeking. 

Theorem 4.34. If E/k is a finite extension with Galois group G — Gal (E/k), then the 
following statements are equivalent. 

(i) E is a splitting field of some separable polynomial f(x) € k[x\ 

(ii) k = E g . 

(iii) Every irreducible p(x) e k[x] having one root in E is separable and splits in E[x\ 

Proof, (i) (ii) By Theorem 4.7(h), |G| = [E : k]. But Proposition 4.32 gives |G| = 
[E : E g ], so that 

[E : k] = [E : E G ]. 

Since k < E G , we have [E : k] = [E : E G ][E G : k], so that [E G : k] = 1 and k = E G . 

(ii) =t- (iii) Let p (x) e k[x\ be an irreducible polynomial having a root a in E, and let the 
distinct elements of the set {er(a): a e G} be a \,..., a n . Define g(x) e E[x ] by 

g(x) = ]~[(x - di). 

Now each o e G permutes the a,-, so that each o fixes each of the coefficients of g(x)\ 
that is, the coefficients of g(x) lie in E G — k. Hence g(x) is a polynomial in A'[v] having 
no repeated roots. Now p(x) and g(x ) have a common root in E. and so their gcd in 
L[v] is not 1; it follows from Corollary 3.41 that their gcd is not 1 in k [v]. Since p(x) is 
irreducible, it must divide g(x). Therefore, p(x) has no repeated roots, hence is separable, 
and it splits over E. 

(iii) => (i) Choose a\ e E with £ k. Since E/k is a finite extension, a\ must be 

algebraic over k\ let p\(x) — irr(at, k) e k[x] be its minimal polynomial. By hypothesis, 
pi(x) is a separable polynomial that splits over E; let K\ C £ be its splitting field. If 
K i = E, we are done. Otherwise, choose 0/2 e E with «2 f K\. By hypothesis, there 
is a separable irreducible p 2 (x) e k[x] having 012 as a root. Let K 2 L E he the splitting 
field of p\ (x)P 2 (x), a separable polynomial. If K 2 — E. we are done; otherwise, repeat 
this construction. This process must end with K m — E for some m because E/k is finite. 
Thus, £ is a splitting field of the separable polynomial p\ ix) ■ ■ ■ p m (x). • 
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Definition. A field extension E/ k is a Galois extension if it satisfies any of the equivalent 
conditions in Theorem 4.34. 

Example 4.35. 

If E/k is a finite separable extension, then the radical extension of E constructed in 
Lemma 4.17 is a Galois extension. ◄ 

Corollary 4.36. If E / k is a Galois extension and if B is an intermediate field, that is, a 
subfield B with k C B C E, then E/B is a Galois extension. 

Proof We know that £ is a splitting field of some separable polynomial fix) e k[x)\ 
that is, E = k{a \, ..., a n ), where a \, ..., a n are the roots of fix). Since k C B c E, we 
have f(x) e B[x] and E — B(a i,..., a n ). • 

Recall that the elementary symmetric functions of n variables are the polynomials, for 

7 = 1- ,n, _ 

e j{x\ - ,x n )= ^ x it ■■ ■ x ij . 

i\<—<ij 

If zi, ■■■,?.,! are the roots ofx"+a„_ix" _1 4-|-ao, then ej (z,],... ,z n ) — (—1 y a n -j. 

Theorem 4.37 (Fundamental Theorem of Symmetric Functions). Ifk is afield, every 
symmetric function in k(x\,...,x n ) is a rational function in the elementary symmetric 
functions e\,... ,e„. 

Proof. Let F be the smallest subfield of E — k(x\, ..., x n ) containing the elementary 
symmetric functions. As we saw in Example 3.125, E is the splitting field of the general 
polynomial /(f) of degree n: 

n 

fit) — f[(f - Xi). 
i= 1 

As f(t) is a separable polynomial, E/F is a Galois extension. We saw, in the proof of 
Theorem 4.27, the Abel-Ruffini theorem, that Gal ( E/F) = S„. Therefore, E Sn = F, by 
Theorem 4.34. But to say that d(x) — g(x i,..., x n )/h(x i,..., x„) lies in E Sn is to say 
that it is unchanged by permuting its variables; that is, 0 (x) is a symmetric function. • 

Exercise 6.84 on page 410 shows that every symmetric polynomial in k[x \,..., x n ] lies 
in k[e\, ..., e„]. 

Definition. If A and B are subfields of a field E, then their compositum , denoted by 
A V B, is the intersection of all the subfields of E that contain AL) B. 

It is easy to see that A V B is the smallest subfield of E containing both A and B. For 
example, if E/k is an extension with intermediate fields A — k(a i,..., a„) and B — 
k(/3 1 ./„,), then their compositum is 


k(a i,- a n ) V k(fi,.... f m ) = k(a i, ... ,a„, /i, ..., /„,). 
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Proposition 4.38. 

(i) Every Galois extension E/k is a separable extension ofk. 

(ii) If E / k is an algebraic field extension and S C E is any, possibly infinite, 7 set of 
separable elements, then k{S)/k is a separable extension. 

(iii) Let E/k be an algebraic extension, where k is afield, and let B and C be interme¬ 
diate fields. If both B/k and C/k are separable extensions, then their composition 
B V C is also a separable extension ofk. 

Proof, (i) If f e E, then p(x) = irr(/3, k) e k[x~\ is an irreducible polynomial in k[x\ 
having a root in E. By Theorem 4.34(iii), p(x) is a separable polynomial (which splits in 
E[xf). Therefore, f is separable over k, and E/k is a separable extension. 

(ii) Let us first consider the case when S is finite; that is, B — k(a i,..., a,) is a finite 
extension, where each a, is separable over k. By Lemma 4.17(i), there is an extension E / B 
that is a splitting field of some separable polynomial f(x) e k[x\, hence, E/k is a Galois 
extension, by Theorem 4.34(i). By part (i) of this proposition, E/k is a separable extension; 
that is, for all a e E, the polynomial irr(a, k) has no repeated roots. In particular, irr(«, k) 
has no repeated roots for all a e B, and so B / k is a separable extension. 

We now consider the general case. If a e k(S), then Exercise 3.95 on page 197 says 

that there are finitely many elements a\,... ,a n e S with a e B — k(a\ . a n ). As 

we have just seen, B/k is a separable extension, and so a is separable over k. As a is an 
arbitrary element of k(S), it follows that k(S)/k is a separable extension. 

(iii) Apply part (i) to the subset S — B U C, for B V C = k(B U C). • 

Query. If E/k is a Galois extension and B is an intermediate field, is B/k a Galois 
extension? The answer is no; in Example 4.29, we saw that E = Q(\/2, o>) is a splitting 
field of x 3 — 2 over Q, where to is a primitive cube root of unity, and so it is a Galois 
extension. However, the intermediate field B — Q(\/2) is not a Galois extension, for 
x 3 — 2 is an irreducible polynomial having a root in B, yet it does not split in B[x~\. 

The following proposition determines when an intermediate field B does give a Galois 
extension. 

Definition. If E/k is a Galois extension and if B is an intermediate field, then a conjugate 
of B is an intermediate field of the form 

B a = { o(b) :be B} 


for some er e Gal (E/k ). 


7 This result is true if finitely many transcendental elements are adjoined (remember that transcendental el¬ 
ements are always separable, by definition), but it may be false if infinitely many transcendental elements are 
adjoined. 
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Proposition 4.39. If E/k is a Galois extension, then an intermediate field B has no 
conjugates other than B itself if and only if B / k is a Galois extension. 

Proof Assume that B a — B for all o e G, where G = Gal (E/k). Let p{x) e £[x] be 
an irreducible polynomial having a root f> in II. Since B C E and E/k is Galois, p(x) 
is a separable polynomial and it splits in E[x\. If ft e E is another root of p(x), there 
exists an isomorphism o e G with a {ft — ft' (for G acts transitively on the roots of an 
irreducible polynomial, by Proposition 4.13). Therefore, ft — a (ft) e B a = B, so that 
p(x) splits in B[x]. Therefore, B/k is a Galois extension. 

Conversely, since B/k is a splitting held of some polynomial / (x ) over k, we have 
B — k(a i, ..., a n ), where ct\ , ..., a„ are all the roots of fix). Since every a e Ga \(E/k) 
must permute the roots of fix), it follows that a must send B to itself. • 

We are now going to show, when E/k is a Galois extension, that the intermediate fields 
are classified by the subgroups of Gal (E/k). 

We begin with some general definitions. 

Definition. A set X is a partially ordered set if it has a binary relation x < y defined on 
it that satisfies, for all x, y, z e X, 

(i) Reflexivity: x < x; 

(ii) Antisymmetry. If x < y, and y < x, then x — y, 

(iii) Transitivity: If x < y and y < z, then x < z. 

An element c in a partially ordered set X is an upper bound of a.h e X if a w c and 
b < c, an element d e A is a least upper bound of a , b if d is an upper bound and if d < c 
for every upper bound c of a and b. Lower bounds and greatest lower bounds are defined 
similarly, everywhere reversing the inequalities. 

We will discuss partially ordered sets more thoroughly in the Appendix. Here, we are 
more interested in special partially ordered sets called lattices. 

Definition. A lattice is a partially ordered set C in which every pair of elements a, b e C 
has a greatest lower bound a A b and a least upper bound a V b. 


Example 4.40. 

(i) If U is a set, define C to be the family of all the subsets of U, and define A < B to mean 
A C B. Then C is a lattice, where A A B — A (T B and Aw B — AD B. 

(ii) If G is a group, define C = Sub(G) to be the family of all the subgroups of G, and 
define A < B to mean A < B: that is, A is a subgroup of B. Then C is a lattice, where 
A A B — A D B and Aw B is the subgroup generated by A U B. 

(iii) If E/k is a field extension, define C = Int( E/k) to be the family of all the intermediate 
fields, and define K < B to mean K c B: that is, K is a subfield of B. Then C is a lattice, 
where K A B = K fl B and K V B is the compositum of K and B. 
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(iv) If n is a positive integer, define Div(n) to be the set of all the positive divisors of 
n. Then Div(n) is a partially ordered set if one defines d < d' to mean d \ d'. Here, 
d /\d' — gcd (d, d') and d V d’ = lcm (d, d'). ◄ 


Definition. If C and C! are lattices, a function /:£—»■ C! is called order-reversing if 
a < b in C implies f(b ) < f (a) in C. 


Example 4.41. 

There exist lattices C and C! and an order-reversing bijection <p\ C —>• C! whose inverse 
(p~ l : C! -> C is not order-reversing. For example, consider the lattices 


a 4 



d 2 


1 

The bijection <p: C —> C!, defined by 

(p(a) = 1, q>{b) — 2, <p(c) — 3, <p(d) — 4, 

is an order-reversing bijection, but its inverse <p~ l : C £ is not order-reversing, because 
2 ^ 3butc = <p-*(3) £ =b. <* 

The De Morgan laws say that if A and B are subsets of a set X , and if A' denotes the 
complement of A, then 

(A (T B)' = A' L) B' and (A U B)' = A' n B'. 

These identities are generalized in the next lemma. 

Lemma 4.42. Let C and C be lattices, and let (p: C —* C be a bijection such that both 
<p and (p -1 are order-reversing. Then 

(p(a A b) — (p(a) V < p(b) and q>(a V b) — q)(a) A q>(b). 

Proof. Since a, b < a V b, we have (p(a V b) < <p(a), (p(b)\ that is, <p(a V b) is a lower 
bound of (p(a),(p{b). It follows that q>(a V b) < (p(a) A <p(b). 

For the reverse inequality, surjectivity of <p gives c e C with <p(a) A ip(b) — <p(c). Now 
(p(c) = (p(a) A (p(b) < <p(a), <p(b). Applying (p ~ l , which is also order-reversing, we have 
a, b < c. Hence, c is an upper bound of a , b, so that a V b < c. Therefore, (p(a V b) a 
(p(c) = (p(a ) A cp(b). A similar argument proves the other half of the statement. • 
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Theorem 4.43 (Fundamental Theorem of Galois Theory). Let E/k be a finite Galois 
extension with Galois group G — Ga l(E/k). 

(i) The function y: Sub (Gal (£/£)) —> lnt(E/k), defined by 

y: H h-> E h , 

is an order-reversing bijection whose inverse, 8: lnt(E/k) —> Sub(Gal(£/A;)), is 
the order-reversing bijection 

8: B i-» Ga 1{E/B). 

(ii) For every B e Int (E/k) and H e Sub(Gal(£’/A:)), 

£°ai (E/B) = B Qnd Ga \( E / E h ) = h. 

(iii) For every H, K e Sub(Gal(£/A:)) and B, C e Int (E/k), 

e H vK = e h n e k \ 

E HnK = e h v E k \ 

Gal (E/(B V C)) = Gal (E/B) n Gal(£/C); 

Gal (E/{B n C)) = Gal (E/B) V Gal(£/C). 

(iv) For every B e In t(E/k) and H e Sub(Gal(£/A:)), 

[B :k] = [G : Gal (E/B)] and [G : H] = [E H : k], 

(v) If B e Int(£’/^:), then B / k is a Galois extension if and only //’Gal(£’ / B) is a normal 
subgroup of G. 

Proof, (i) Proposition 4.28 proves that y is order-reversing, and it is also easy to prove 
that 8 is order-reversing. Now injectivity of y is proved in Theorem 4.33, so that Propo¬ 
sition 1.47 shows that it suffices to prove that yS: Int (E/k) —> Int (E/k) is the iden¬ 
tity; it will follow that y is a bijection with inverse 8. If B is an intermediate field, 
then 8y: B h-> e g ' a(E / B) . But E/E B is a Galois extension, by Corollary 4.36, and so 
^Gai (E/B) _ ky xheorem 4.34. 

(ii) This is just the statement that yS and Sy are identity functions. 

(iii) These statements follow from Lemma 4.42. 

(iv) By Theorem 4.7(h) and the fact that E/B is a Galois extension, 

[B ; k] = [E : k]/[E : B] = \G\/\ Gal(£/B)| = [G : Gal (E/B)]. 

Thus, the degree of B/k is the index of its Galois group in G. The second equation follows 
from this one; take B = E H , noting that (ii) gives Gal( E/E H ) — H: 

[E h :k] = [G: Gal (E/E H )] = [G : H]. 
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(v) It follows from Theorem 4.16 that Gal (E/B) < G when B/k is a Galois extension 
(both B/k and E/k are splitting fields of polynomials in £[x]). For the converse, let 
H = Gal (E/B). and assume that H <\ G. Now E H — E Gal(E I B) — B. by (ii), and so 
it suffices to prove that ( E H )° = E H for every o e G. by Proposition 4.39. Suppose 
now that a e E H \ that is, ifia) = a for all ;; e H. If a e G , then we must show that 
q(o(a)) — o (a) for all ;/ e H. Now H <d G says that if i] e II and a e G, then there is 
v/ e H with r]<j = or/ (of course, r/ — a~ l qo). But 

j]o(a) — or/(a) — o{a ), 

because r/(a) = a , as desired. Therefore, B/k = E H /k is Galois. • 

Here are some corollaries. 

Theorem 4.44. If E/k is a Galois extension whose Galois group is abelian, then every 
intermediate field is a Galois extension. 

Proof. Every subgroup of an abelian group is a normal subgroup. • 

Corollary 4.45. A Galois extension E/k has only finitely many intermediate fields. 
Proof. The finite group Ga\(E/k) has only finitely many subgroups. • 

Definition. A field extension E/k is a simple extension if there is it e E with E = k(u). 
The following theorem of E. Steinitz characterizes simple extensions. 

Theorem 4.46 (Steinitz). A finite extension E/k is simple if and only if it has only 
finitely many intermediate fields. 

Proof. Assume that E/k is a simple extension, so that E = k(u)\ let p(x) — irr(n, k ) e 
A'[v] be its minimal polynomial. If B is any intermediate field, let 

q(x) — irr (u, B) — bo + b[X H-+ b n -\x n ~ { + x n e B[x] 

be the monic irreducible polynomial of u over B , and define 

B' = k(bo,, b n -\) c B. 

Note that q(x) is an irreducible polynomial over the smaller field B'. Now 

E = k(u) c B\u) c B{u) c E, 

so that B'(u) = E = B(u). Hence, [E : B] = [B(u) : B] and [E : B’] = [B’{u) : B’]. 
But each of these is equal to deg(g), by Proposition 3.117(v), so that [E : B] = deg(g) = 
[E : B']. Since B' C B, it follows that [B : B’] = 1; that is. 


B = B' = k(bo, , b n -i). 
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We have characterized B in terms of the coefficients of q{x), a monic divisor of p(x) = 
irr (u, k) in E[x\. But p(x) has only finitely many monic divisors, and hence there are only 
finitely many intermediate fields. 

Conversely, assume that E/k has only finitely many intermediate fields. If A: is a finite 
field, then we know that E/k is a simple extension (take u to be a primitive element); 
therefore, we may assume that k is infinite. Since E/k is a finite extension, there are 
elements mi, ..., m„ with E — k(u \, ..., m„). By induction on n > 1, it suffices to prove 
that E = k(a, b) is a simple extension. Now there are infinitely many elements c e E of 
the form c — a + tb, where t e k, for k is now infinite. Since there are only finitely many 
intermediate fields, there are, in particular, only finitely many fields of the form kic). By 
the pigeonhole principle, 8 there exist distinct elements t,t'ek with k(c ) = k(c'), where 
c’ = a + t’b. Clearly, k(c) C k(a , b ). For the reverse inclusion, the field k(c) — k(c') 
contains c — c’ = (f — t')b , so that b e k(c) (because t — t' 0). It follows that 
a — c — tb e k(c), and so k(c) = k(a, b). • 

An immediate consequence is that every Galois extension is simple; in fact, even more 
is true. 

Theorem 4.47 (Theorem of the Primitive Element). If B/k is a finite separable 
extension, then there is u € B with B — k(u). In particular, if k has characteristic 0, then 
every finite extension B/k is a simple extension. 

Proof. By Example 4.35, the radical extension E/k constructed in Lemma 4.17 is a Ga¬ 
lois extension having B as an intermediate field, so that Corollary 4.45 says that the exten¬ 
sion E/k has only finitely many intermediate fields. It follows at once that the extension 
B/k has only finitely many intermediate fields, and so Steinitz’s theorem says that B/k 
has a primitive element. • 

The theorem of the primitive element was known by Lagrange, and Galois used a mod¬ 
ification of it in order to construct the original version of the Galois group. 

We now turn to finite fields. 

Theorem 4.48. The finite field F ? , where q — p n , has exactly one subfield of order p d 
for every divisor d of n, and no others. 

Proof. First, ¥ q /¥ p is a Galois extension, for it is a splitting field of the separable poly¬ 
nomial x q — x. Now G — Gal(F 9 /F p ) is cyclic of order n, by Theorem 4.12. Since a 
cyclic group of order n has exactly one subgroup of order d for every divisor d of n, by 
Lemma 2.85, it follows that G has exactly one subgroup H of index n/d. Therefore, there 
is only one intermediate field, namely, E H , with [E H : F ;) ] = [G : H] — n/d, and 

E H = W p n/d. • 

We now give two algebraic proofs of the fundamental theorem of algebra, proved by 
Gauss (1799): The first, due to P. Samuel (which he says is “by a method essentially due 

8 If there is an infinite number of pigeons in only finitely many pigeonholes, then at least one of the holes 
contains an infinite number of pigeons. 
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to Lagrange”), uses the fundamental theorem of symmetric functions; the second uses the 
fundamental theorem of Galois theory, as well as a Sylow theorem which we will prove in 
Chapter 5. 

Assume that R satisfies a weak form of the intermediate value theorem: If fix) e R[x] 
and there exist a, b e R such that f(a) > 0 and fib) < 0, then fix) has a real root. Here 
are some preliminary consequences. 

(i) Every positive real number r has a real square root. 

If fix) = x 2 — r, then 

/(I + r) = (1 + r) 2 - r = 1 + r + r 2 > 0 , 

and /( 0 ) = —r < 0 . 

(ii) Every quadratic g(x) € C[x] has a complex root. 

First, every complex number z has a complex square root: When z is written in polar 
form z — re 10 , where r > 0, then ^fz, = *Jre 10 ! 2 . The quadratic formula gives the 
(complex) roots of g(x). 

(iii) The field C has no extensions of degree 2. 

Such an extension would contain an element whose minimal polynomial is an irre¬ 
ducible quadratic in C[x]; but Item (ii) shows that no such polynomial exists. 

(iv) Every fix) e R[x] having odd degree has a real root. 

Let fix) = ao + a\x H-+ a n -\x n ~ l + x" e M[x], Define t = 1 + |a,-|. Now 

\at \ < t — 1 for all i and, if hix) — fix) — x", then 

\hit)\ — |o 0 + a\t + • • • + a n -\t n * | 

< it- l)(l+f -t-.-. + f"- 1 ) 

= t n - 1 

< t n . 

Therefore, -t n < hit) and 0 = -t n + t n < hit) + t n = fit). 

A similar argument shows that |/z(—f)| < t n , so that 

fi-t) = hi-t) + i-t)" < t" + i-t) n . 

When n is odd, (—t) n — —t n , and so /(— t) < t n — t" = 0. Therefore, the 
intermediate value theorem provides a real number r with fir) = 0 ; that is, fix) 
has a real root. 

(v) There is no field extension E /R of odd degree > 1. 

If u e E, then its minimal polynomial irr(u, R) must have even degree, by Item (iv), 
so that [R(m) : R] is even. Hence [E : R] = [£ : R(m)][R(m) : R] is even. 
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Theorem 4.49 (Fundamental Theorem of Algebra). If fix) e C[x] has degree n > 1, 
then fix) has a complex wot, and hence fix) splits'. There are c, u \,..., u n € C with 


fix) = cix - u\) ■ ■ ■ (x - u n ). 

Proof We show that fix) — a,x' e C[x] has a complex root. Define fix) — 
Jfdjx', where a,- is the complex conjugate of a;. Now fix) fix) — JfckX k , where 
Ck = J2i+j=k a idj\ hence, Ck = Ck, so that fix) fix) e R[x]. If fix) has a complex root 
z, then z is a root of fix) fix). Conversely, if z is a complex root of fix) fix), then z is a 
root of either fix) or fix). But if z is a root of fix), then z is a root of fix). Therefore, 
fix) has a complex root if and only if fix) fix) has a complex root, and so it suffices to 
prove that every real polynomial has a complex root. 

To summarize, it suffices to prove that every nonconstant monic fix) e R[x] has a 
complex root. Let deg if) = 2 k m, where m is odd; we prove the result by induction on 
k > 0. The base step k — 0 is proved in Item (iv), and so we may assume that k > 1. Let 
ai,..., a„ be the roots of fix) in some splitting field of fix). For fixed t e R, define 

gtix) = ]~[(x - fij), 

UJ) 


where fjj — j+ta,a j and {/, j } varies over all the two-element subsets of {1, ..., n). 

First, 

deg(g f ) = \nin - 1) = 2 A_1 w(u - 1). 

Now n = 2 k m is even, because k > 1, so that n — 1 is odd; hence, min — 1) is odd. Thus, 
the inductive hypothesis will apply if g t (x) e R[x], 

For each coefficient c of gt(x), there is an elementary symmetric function 

e(..., y/j,...) e R[..., y,-j -] 

with c = e(..., fij ,...). If we define 

hix i,..., x„) — ef .., Xi + Xj + tx,Xj,...), 

then 

c = e(..., a,- + a j + totjoij,...) — hia i,..., a„). 

Each o e S n acts on R[xi,..., x„] via a : x,- + xj + txixj i-> x„i + x a j + tx a ix a j, and 
hence it permutes the set of polynomials of this form. Since the elementary symmetric 
function e(..., yij,...) is invariant under every permutation of the variables yij, it follows 
that hix i,... ,x n ) — Fi..., x,- + x; + tx'iXj ,...) is a symmetric function of xi, ..., x n . 
By the fundamental theorem of symmetric polynomials (Exercise 6.84 on page 410), there 
is a polynomial tpix) e R[xi,..., x„\ with 


hix i, - Xn) = (pie i(xi, ... ,x„), ..., e„ix i,... ,x„)). 
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The evaluation (xi,..., x n ) i-> (ai. a n ) gives 

c = h(a\, ..., a„) = (p{e\(ct\,..., or,,),..., e n (a\,..., a n )). 

But e r (ai, ... ,a n ) is just the rth coefficient of fix), which is real, and so c is real; that is, 
gt (x) e R[x], 

By induction, g, (x) has a complex root for each t e E. There are infinitely many f el 
and only finitely many two-element subsets {i, j). By the pigeonhole principle, there exists 
a subset [i, j } and distinct reals t and s with both a; + aj + ta,aj and a; + a j + saja j 
complex [for the f)jj are the roots of gtix)]. Subtracting, (1 — s)aiaj e C; as t s, we 
have ai a j e C; say, ajaj — u. Since a; +a j +tajaj e C, it follows that a; +aj e C; say, 
+ aj = v. Therefore, a, is a root of x 2 — vx + u, and the quadratic formula. Item (ii), 
gives a, e C, as desired. That fix) splits now follows by induction on n > 1 . • 

Here is a second proof. 

Theorem (Fundamental Theorem of Algebra). Every nonconstant fix) e C[x] has a 
complex root. 

Proof. As in the proof just given, it suffices to prove that every nonconstant fix) e R[x] 
has a complex root. Let £/R be a splitting field of ix 2 + 1 )f(x) that contains C. Since 
IR has characteristic 0, E/M. is a Galois extension; let G — GalfL/R) be its Galois group. 
Now |G| = 2 '"k, where m > 0 and k is odd. By the Sylow theorem (Theorem 5.36), 
G has a subgroup H of order 2”'; let B — E H be the corresponding intermediate field. 
By the fundamental theorem of Galois theory, the degree [B : R] is equal to the index 
[G : H] = k. But we have seen, in Item (v), that R has no extension of odd degree 
greater than 1; hence k — 1 and G is a 2-group. Now E /C is also a Galois extension, and 
Gal (E /C) < G is also a 2-group. If this group is nontrivial, then it has a subgroup K of 
index 2. By the fundamental theorem once again, the intermediate field E K is an extension 
of C of degree 2, and this contradicts Item (iii). We conclude that [£ : C] = 1; that is, 
E = C. But £ is a splitting field of fix) over C, and so fix) has a complex root. • 

We now prove the converse of Galois’s theorem (which holds only in characteristic 0): 
Solvability of the Galois group implies solvability by radicals of the polynomial. It will 
be necessary to prove that a certain field extension is a pure extension, and we will use the 
norm (which arises quite naturally in algebraic number theory; for example, it was used in 
the proof of Theorem 3.66, Fermat’s two-squares theorem). 

Definition. If E/k is a Galois extension and u e E x , define its norm N(u) by 

N(u) = J~| ct(m). 

<rsGal (E/k) 

Here are some preliminary properties of the norm, whose simple proofs are left as ex¬ 
ercises. 

(i) If u e E x , then Niu) e k x (because Niu) e E G — k). 
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(ii) N(uv) = N(u)N(v), so that N : E x -> k x is a homomorphism. 

(iii) If a e k, then N(a ) = a", where n = [E: k], 

(iv) If a e G and u e E x , then N(a(u)) — N(u). 

Given a homomorphism, we ask about its kernel and image. The image of the norm is 
not easy to compute; the next result (which was the ninetieth theorem in an 1897 exposition 
of Hilbert on algebraic number theory) computes the kernel of the norm in a special case. 

Theorem 4.50 (Hilbert’s Theorem 90). Let E / k be a Galois extension whose Galois 
group G = Gal (E / k) is cyclic of order n, say, with generator a. If u e E x , then N (u) — 1 
if and only if there exists v e E x with u — i>er(i>) -1 . 

Proof. If n = vct (r)~ 1 , then 

N(u) = N(va(v )~ l ) 

= N(v)N((j(v)~ l ) 

= N(v)N(o(v)r l 
= N(v)N(v)~ l = 1. 

Conversely, let N(u) = 1. Define “partial norms” in E x : 

Sq = u, 

<$! = uo{u), 

82 = ucr(u)cr 2 (u), 


8 n -1 = uo(u) ■ ■ ■ a" -1 (m). 

Note that 8 n -\ — N(u) — 1. It is easy to see that 

uo(8{) — <5,_|_i for all 0 < i < n — 2. (4) 

By independence of the characters 1, cr, <r 2 , ..., ct" _ 1 , there exists y e E with 

8 0 y + 5ict(v) H-h 8 n - 2 o n ~ 2 (y) + a n ~\y) 0; 

call this sum z■ Using Eq. (4), we easily check that 

o(z) = o(8o)(j(y ) + o(S l )o 2 (y) H-h cr(,5„_2)o-'' _1 (y) + a n (y ) 

= u~\8\o(y) + u~ l S 2 (T 2 (y) H-h t/ -1 ,5„_ier" -1 (y) + y 

= M _1 ^icr(y) + 8 2 a 2 (y) H-1- (y)j + u~ l 8 0 y 

— U~ l Z. • 
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Corollary 4.51. Let E / k be a Galois extension of prime degree p. Ifk contains a prim¬ 
itive pth root of unity co, then E — k{z), where z p € k, and so E/k is a pure extension of 
type p. 

Proof. The Galois group G — Gal(£ /k) has order p, hence is cyclic; let cr be a generator. 
Observe that Nico) — io p — 1, because co e k. By Hilbert’s Theorem 90, we have 
a> — zeGz)" 1 for some z e E. Hence cr(z) — co~ l z. Thus, cr (z p ) — (co~ l z) p = Z p , and so 
Z p e E G . because cr generates G; since E/k is Galois, however, we have E G — k, so that 
z p e k. Note that z f k, lest co = 1, so that k(z) k is an intermediate field. Therefore 
E — k(z), because [E : k] = p is prime, and hence E has no proper intermediate fields. • 

We confess that we have presented Hilbert’s Theorem 90, not only because of its corol¬ 
lary, which will be used to prove Galois’s theorem, but also because it is a well-known 
result that is an early instance of homological algebra (see Corollary 10.129). Here is an 
elegant proof of Corollary 4.51 due to E. Houston (we warn the reader that it uses eigen¬ 
values, a topic we have not yet introduced). 

Proposition 4.52. Let E/k be a Galois extension of prime degree p. Ifk contains a 
primitive pth root of unity co, then E — k(z), where z p € k, and so E / k is a pure extension 
of type p. 

Proof. Since E/k is a Galois extension of degree p, its Galois group G = Gal ( E/k) has 
order p, and hence it is cyclic: G — (a). View £ as a vector space over k. If a <= k and 
u e E, then o(au) — a(a)a{u) = a<r(u), because a e Gal (E/k) (so that it fixes k), and 
so we may view o : E —>■ £ as a linear transformation. Now a satisfies the polynomial 
x p — 1, because o p — I f , by Lagrange’s theorem. But a satisfies no polynomial of smaller 
degree, lest we contradict independence of the characters 1, o, o 2 ,..., a p ~ l . Therefore, 
x p — 1 is the minimum polynomial of a, and so every pth root of unity co is an eigenvalue of 
a. Since co~ l € k , by hypothesis, there is some eigenvector z e £ of cr with cr(z) = co~ l z 
(note that z f k because it is not fixed by cr). Hence, cr(z p ) — (cr(z)) p = ( co~ [ ) p z p — z p , 
from which it follows that z p e E G = k. Now p — [£ : k] = [£ : k(z)][k(z) : k\, since 
p is prime and [k(z) : k] 1, we have [£ : k(z)] = 1; that is, £ = k(z), and so E/k is a 
pure extension. • 

Theorem 4.53 (Galois). Let k be a field of characteristic 0, let E/k be a Galois exten¬ 
sion, and let G = Gal (E/k) be a solvable group. Then E can be imbedded in a radical 
extension ofk. 

Therefore, the Galois group of a polynomial over a field of characteristic 0 is a solvable 
group if and only if the polynomial is solvable by radicals. 

Remark. A counterexample in characteristic p is given in Proposition 4.56. ◄ 

Proof. Since G is solvable, it has a normal subgroup H of prime index, say, p. Let 
co be a primitive pth root of unity, which exists in some extension field, because k has 
characteristic 0. We distinguish two cases. 
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Case (i): &> e k. 

We prove the statement by induction on [ E : k]. The base step is obviously true, for 
k = E is a radical extension of itself. For the inductive step, consider the intermediate field 
E H . Now E/E h is a Galois extension, by Corollary 4.36, and Gal (E/E H ) is solvable, 
being a subgroup of the solvable group G. Since [E : E H \ < [E : k], the inductive 
hypothesis gives a radical tower E H C R\ C ••• C R t , where E C R t . Now E H /k 
is a Galois extension, because H <3 G, and its index [G : H] = p = [E H : k], by the 
fundamental theorem. Corollary 4.5 1 (or Proposition 4.52) now applies to give E H — k(z ), 
where z p e k\ that is, E H /k is a pure extension. Hence, the radical tower above can be 
lengthened by adding the prefix k C E H , thus displaying R t /k as a radical extension. 

Case (ii): General case. 

Let k* = k(a>), and define E* = Eito). We claim that E* / k is a Galois extension. 
Since E/k is a Galois extension, it is the splitting field of some separable fix) e k[x), 
and so E* is a splitting field over k of f(x)(x p — 1). But x p — 1 is separable, because 
k has characteristic 0, and so E* / k is a Galois extension. Therefore, E*/k* is also a 
Galois extension, by Corollary 4.36. Let G* = GalfL*/ k*). By Exercise 4.5 on page 217, 
accessory irrationalities, there is an injection i/r: G* -» G = Gal (E/k), so that G* is 
solvable, being isomorphic to a subgroup of a solvable group. Since a> e k*, the first case 
says that there is a radical tower k* c R* C • • • C R* n with E C E* C R* n . But k* — k(a >) 
is a pure extension, so that this last radical tower can be lengthened by adding the prefix 
k C k*, thus displaying R* n / k as a radical extension. • 

We now have another proof of the existence of the classical formulas. 

Corollary 4.54. If k has characteristic 0, then every fix) e £[x] with deg(/) < 4 is 
solvable by radicals. 

Proof. If G is the Galois group of f(x), then G is isomorphic to a subgroup of S 4 . But 
S 4 is a solvable group, and so every subgroup of .S 4 is also solvable. By Galois’s theorem, 
f(x) is solvable by radicals. • 

Suppose we know the Galois group G of a polynomial fix) e Q[x] and that G is 
solvable. Can we use this information to find the roots of /(x)? The answer is affirmative; 
we suggest the reader look at the book by Gaal, Classical Galois Theory with Examples, 
to see how this is done. 

In 1827, N. H. Abel proved that if the Galois group of a polynomomial fix) is commu¬ 
tative, then fix) is solvable by radicals (of course, Galois groups had not yet been defined). 
This result was superseded by Galois’s theorem, proved in 1830, but it is the reason why 
abelian groups are so called. 

A deep theorem of W. Feit and J. G. Thompson (1963) says that every group of odd 
order is solvable. It follows that if k is a field of characteristic 0 and fix) e A'[x] is a 
polynomial whose Galois group has odd order, equivalently, whose splitting field has odd 
degree over k, then fix) is solvable by radicals. 
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The next proposition gives an example showing that the converse of Galois’s theorem 
is false in prime characteristic. 

Lemma 4.55. Ifk — F p (t), the field of rational functions overW p , then f(x ) = x p —x — t 
has no mots in k. 

Proof. If there is a root a of f(x) lying in k, then there are g(t), h(t) e F p [t] with 
a — g(t)/h(t)\ we may assume that (g, h) — 1. Since a is a root of f(x), we have 
(g/h) p — (g/h) — t\ clearing denominators, there is an equation g p — h p ~ l g — th p in 
F p [I], Hence, g \ th p . Since (g, h) = 1, we have g | t, so that g(t) = at or g(t ) is a 
constant, say, g(t) — b , where a,b e Fp. Transposing h p ~ 1 g in the displayed equation 
shows that h \ g p ; but (g, h) = 1 forces h to be a constant. We conclude that if a — g/h, 
then a = at or a = b. In the first case, 

0 = a p — a — t 
= ( at) p — (at) — t 
= a p t p — at — t 

= at p — at — t by Fermat’s theorem in F p 
= t(at p ~ l — a — 1 ). 

It follows that at p ~ l — a — 1=0. But a ^ 0, and this contradicts t being transcendental 
over F p . In the second case, a = b e Fp. But b is not a root of f(x), for f(b) = 
b p — b — t — —t, by Fermat’s theorem. Thus, no root a of fix) can lie in k. • 

Proposition 4.56. Let p be a prime, and let k — F p (t). The Galois group of f(x) = 
x p — x — t over k is cyclic of order p, but f(x) is not solvable by radicals over k. 

Proof. Let a be a root of fix). It is easy to see that the roots of f(x) are a + i, where 
0 < i < p, for Fermat’s theorem gives i p — i in F p , and so 

(a + i) p — (a + i) — t — a p + i p — a — i — t — a p — a — t — 0. 

It follows that f(x) is a separable polynomial and that k(a) is a splitting field of / (x) 
over k. We claim that f(x) is irreducible in £[;r]. Suppose that f(x) — g(x)h(x), where 

g(x) — x d + Cd~ix d ~ l H-h c 0 e k[x] 

and 0 < d < deg(/) = p\ then gix) is a product of d factors of the form a + i. 
Now — Cd-i e k is the sum of the roots: — Cd-\ — da + j, where j e Fp, and so 
da e k. Since 0 < d < p, however, d 0 in k, and this forces a e k, contradicting the 
lemma. Therefore, f(x) is an irreducible polynomial in k[x]. Since deg(/) = p, we have 
[k(a) : k] = p and, since f(x) is separable, we have | Gal(A:(a)/A:)| = [ k(a ) : k] — p. 
Therefore, Ga\.(k(a)/k) = Ip. 

It will be convenient to have certain roots of unity available. Let Q be the set of all 
qth roots of unity, where q < p is a prime divisor of pi. We claim that a f k(Q). On 
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the one hand, if n — W q < p L F then Q is contained in the splitting field of x" — 1 , and so 
[k(Q) : k\ | n!, by Theorem 4.3. It follows that p { [k(£2) : k]. On the other hand, if 
a e ffc(£2), then k(a) C k(tt) and [k(Q) : k] = [*(S2) : k(a)][k(a) : k] = p[k(Q) : k(a )]. 
Hence, p | [A: (£2) : k], and this is a contradiction. 

If f(x) were solvable by radicals over k(Q), there would be a radical extension 


k(Q) = B 0 c B x c • • • c B r 

with k(£2,a) C B r . We may assume, for each i > 1, that Bj/Bj-i is of prime type; 
that is, Bj = Bi-i(uj), where uf e B/_i and q, is prime. There is some j > 1 with 
a e B j but a B/_i. Simplifying notation, we set it ; = u, q f — q , B ; _ | = B, 
and Bj = B'. Thus, B' = B(u ), u q — b e B, a e B', and a, u B. We claim 
that f(x) — x p — x — t, which we know to be irreducible in k[x], is also irreducible in 
B[. r]. By accessory irrationalities. Exercise 4.5 on page 217, restriction gives an injection 
Gal (B(a)/B) -> Gal (k(a)/k)) = l p . If Gal (B(a)/B) = {1}, then B(a) = B and 
a e B, a contradiction. Therefore, Gal( B(a)/B) = I p , and fix) is irreducible in B[x), 
by Exercise 4.11 on page 218. 

Since u f. B' and B contains all the r/th roots of unity. Proposition 3.126 shows that 
x q — b is irreducible in B[x), for it does not split in B[x], Now B' — B(u) is a splitting 
field of x q — b, and so [B' : B] = q. We have B C B(a ) c B', and 

q = [B 1 : B] = [B' : B(a)][B(a) : B], 

Since q is prime, [B' : B{a)] = 1; that is, B' = B(a ), and so q — [ B’ : B], As a is a root 
of the irreducible polynomial fix) — x p — x — t e B[x), we have [B(a) : B] — p; there¬ 
fore, q = p. Now B(u) = B' = B(a) is a separable extension, by Proposition 4.38, for 
a is a separable element. It follows that u e B' is also a separable element, contradicting 
irr( m, B) = x q — b = x p — b = (x — u) p having repeated roots. 

We have shown that fix) is not solvable by radicals over k(£2). It follows that fix) is 
not solvable by radicals over k , for if there were a radical extension k = Rq c R\ C • • • C 
R, with k(a) C R t , then k(£i) — /?o(f2) C R\i£2) C • • • C R t )Q) would show that fix) 
is solvable by radicals over k(£l), a contradiction. • 

The discriminant of a polynomial is useful in computing its Galois group. 

Definition. If fix) — ]~[, (x — a,) e k[x), where A; is a field, define 

A = 

i<j 

and define the discriminant to he I) — Z)(/) = A 2 = P|; < /^ a '' — a ji¬ 
lt is clear that fix) has repeated roots if and only if its discriminant I) — 0. 

The product A = Y\i < j ( a i ~ a j) has one factor a, — a j for each distinct pair of indices 
(i, j ) (the restriction i < j prevents a pair of indices from occurring twice). If E/k is a 
splitting field of fix) and if G — Gal iE/k), then each a e G permutes the roots, and so 
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o permutes all the distinct pairs. However, it may happen that i < j while the subscripts 
involved in cr(a ; ) — <r(oij ) are in reverse order. For example, suppose the roots of a cubic 
are o , i,a 2 , and 013 , and suppose there is o e G with er(o , i) = 012 , < 7 ( 0 : 2 ) — «t> and 
( 7 ( 0 : 3 ) = < 73 . Then 

<7(A) = (a(o' 1 ) - a(a 2 ))(<7(o:i) - <7(a 3 ))(a(a 2 ) - <7(a 3 )) 

= (<X 2 - oil )(012 - 0 : 3 )(art - 0 : 3 ) 

= -(a 1 - 012 )(ci 2 - a 3 )(o:i - a 3 ) 

= -A. 

In general, each term a, — a j occurs in <7 (A) with a possible sign change. We conclude, 
for all a e Ga l(E/k), that cr(A) = ±A. It is natural to consider A 2 rather than A, for 
A depends not only on the roots of fix), but also on the order in which they are listed, 
whereas D — A 2 does not depend on the listing of the roots. For a connection between 
discriminants and the alternating group A n , see Proposition 4.59(ii) on page 241. 

Proposition 4.57. If fix) e k[x) is a separable polynomial, then its discriminant D lies 
in k. 

Proof. Let E /k be a splitting field of fix); since fix) is separable. Theorem 4.34 applies 
to show that E/k is a Galois extension. Each a e Gal (E/k) permutes the roots it \. ..., u n 
of fix), and 0 (A) = ±A, as we have just seen. Therefore, 

a(D) = cr(A 2 ) = cr(A) 2 = (±A) 2 = D, 

so that D e E°. Since E/k is a Galois extension, we have E G — k, and so D e k. • 

If fix) — x 2 + bx + c, then the quadratic formula gives the roots of fix): 
a = \{—b + s/b 2 — 4c) and ft = \{—b — V b 2 — 4c). 

It follows that 

D = A 2 = ia - f) 2 = b 2 - 4c. 

If f(x) is a cubic with roots a, ft, y, then 

D = A 2 = ia - f) 2 ia - y) 2 if - y) 2 ; 

it is not obvious how to compute the discriminant D from the coefficients of fix). 

Definition. Apolynomial fix) = x n +c„-\x"~ x -\ -|-co e k[x] is reduced ifc„_i = 0. 

If fix) is a monic polynomial of degree n and if c„_i f 0 in k, where char(L) = 0, then 
its associated reduced polynomial is 

fix) = fix - y t C„-l). 

If fix) = x n + +-1- co e k[ x] and f e k is a root of fix), then 

0 =/(/?) = fif -\cn-l). 

Hence, f is a root of fix) if and only if f — ^c„_ 1 is a root of fix). 
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Theorem 4.58. Let k be afield of characteristic 0. 

(i) A polynomial f(x) e k[x] and its associated reduced polynomial f(x) have the 
same discriminant. 

(ii) The discriminant of a reduced cubic f(x) = x 3 + qx + r is 

D = -4 q 3 - Hr 2 . 


Proof, (i) If the roots of f(x) — CjX 1 are oq ,..., a n , then the roots of f{x) are 
fii,.. ,,f n , where = a; + Therefore, fj — fj — a, — aj for all i, j, 

Y\{cq - a j ) = \\{fii-(l j ), 

i<j i<j 

and so the discriminants, which are the squares of these, are equal. 

(ii) The cubic formula gives the roots of f{x) as 

a — g + h, f> — cog + w 2 h, and y — co 2 g + coh, 

where g = [j(—r + -//?)] *^ 3 , h — —q/3g , R — r 2 + A q 3 , and w is a cube root of unity. 
Because to 2 = 1, we have 

a - P = (g + h) - (cog + co 2 h) 

— (g~ orh) - (cog - h) 

— (g — orh) — (g — arh)co 
= (g ~ arh)(l - w). 


Similar calculations give 

a - y = (g + h) - (arg + coh) = (g — wh)( 1 - or) 


and 


2 2 

f — y = (cog + co h) — (co g + coh) — (g — h)co( 1 — &>). 


It follows that 


A = (g — h)(g — coh)(g — co 2 h)co( 1 — &> 2 )(1 — co) 2 . 

By Exercise 4.14 on page 246, we have co(\ — w 2 )(l — co) 2 — 3/V3; moreover, the identity 
x 3 — 1 — (x — l)(x — co)(x — CO 2 ), 


with x — gjh, gives 


(g ~ h)(g - coh)(g - co 2 h) = g 3 - h 3 = Vr 
(we saw on page 208 that g 3 — h 3 — */R). Therefore, A = 3iV3V~R, and 
D = A 2 = -27 R = —Hr 2 - 4q 3 . • 
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Remark. Let k be a field, and let f(x ) = a m x m + a m -\x m 1 + • • • + a\x + ao and 
g(x) = b„x n + b„-ix n ~ l + ■ ■ ■ + b\x + bo e k[x]. Their resultant is defined as 

Res (/, g) = det(M), 

where M — M(f, g) is the (m + n ) x (m + n ) matrix 



— 1 


a\ 

a 0 


Mm 

a m —\ 


a i 

ao 



®m— 1 


a i 

bn bn— 1 


b\ 

bo 


bn 

bn—\ 


b\ 

bo 


bn 

bn — l 


b] 


there are n rows for the coefficients a,- of fix) and m rows for the coefficients bj of 
g(x); all the entries other than those shown are assumed to be 0. It can be proved that 
Res(/, g) = 0 if and only if / and g have a nonconstant common divisor. We mention the 
resultant here because the discriminant can be computed in terms of it: 

D(f) = (—l)" ( " -1)/2 Res(/, f'), 

where f'(x) is the derivative of fix). See the exercises in Dummit and Foote, Abstract 
Algebra, pages 600-602. < 

Here is a way to use the discriminant in computing Galois groups. 

Proposition 4.59. Let k be a field with characteristic 2, let f(x) € k[x] be a polyno¬ 
mial of degree n with no repeated roots, and let D — A 2 be its discriminant. Let E/k be 
a splitting field of f(x), and let G — Gal (E/k) be regarded as a subgroup of S„ (as in 
Theorem 4.3). 

(i) If H = A n n G, then E H = k( A). 

(ii) G is a subgroup of A„ if and only if \f~D e k. 

Proof, (i) The second isomorphism theorem gives H = ( G fl A „) <] G and 

[G : H] = [G : A n n G] = [A n G : A n ] < [S n : A„] = 2. 

By the fundamental theorem of Galois theory (which applies because fix) has no repeated 
roots, hence is separable), [E H : k] = [G : H], so that [E H : k] = [G : H] < 2. By 
Exercise 4.25 on page 248, we have k( A) C E A " , and so k{ A) C E H . Therefore, 

[E h : k] = [E h : k(A)][k(A) : k] < 2. 


( 5 ) 
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There are two cases. If [E H : k] = 1, then each factor in Eq. (5) is 1; in particular, 
[E h : ifc(A)] = 1 and E H = k{ A). If [E H : k] = 2, then [G : H] = 2 and there 
exists a e G, a <f A n , so that er(A) = — A. Now A / 0, because fix) has no repeated 
roots, and —A f A, because k does not have characteristic 2. Hence, A (ft E G = k and 
[k( A) : k] > 1. It follows from Eq. (5) that [E H : Ar(A)] = 1 and E H = k( A). 

(ii) The following are equivalent: G < A„; H = G IT A n = G; E H = E G — k. Since 
E h = k{ A), by part (i), E H — k is equivalent to k( A) = k\ that is, A = \f~D e k. • 

We now show how to compute Galois groups of polynomials over Q of low degree. 

If f(x) e Q[jc] is quadratic, then its Galois group has order either 1 or 2 (because the 
symmetric group has order 2). The Galois group has order 1 if fix) splits; it has order 2 
if f(x) does not split; that is, if fix) is irreducible. 

If fix) e Q[jc] is a cubic having a rational root, then its Galois group G is the same as 
that of its quadratic factor. Otherwise fix) is irreducible; since |G| is now a multiple of 3, 
by Corollary 4.9, and G < S 3 , it follows that either G = A 3 = I 3 or G = S 3 . 

Proposition 4.60. Let fix) e Q[x] be an irreducible cubic with Galois group G and 
discriminant D. 

(i) fix) has exactly one real root if and only if D <0, in which case G = S 3 . 

(ii) fix) has three real roots if and only if D > 0. In this case, either \flf e Q and 
G = I 3 , or VD ft Q and G = S 3 . 

Proof. Note first that D 0: Since Q has characteristic 0, irreducible polynomials over 
Q have no repeated roots. If fix) has three real roots, then A is real and D — A 2 > 0. 
The other possibility is that fix) has one real root a and two complex roots: ft = it + iv 
and ft — u — iv. Since ft — ft = 2iv and a — a, we have 

A = ia- ft)ia-ft)ift-ft) 

= ict — ft) (a — ft)ift — ft) 

= \a-ft\ 2 i 2 iv), 

and so D = A 2 = —4i> 2 |q: — ft \ 4 <0. 

Let E /Q be the splitting field of fix). If fix) has exactly one real root a , then E 
Q(a). Hence |G| > 3 and G = S 3 . If fix) has three real roots, then D > 0 and \[T) is 
real. By Proposition 4.59(h), G = A 3 = I 3 if and only if \flf is rational; hence G = S 3 if 
s/~D is irrational. • 


Example 4.61. 

The polynomial fix) — x 2 — 2 e Q[x] is irreducible, by Theorem 3.43. Its discriminant 
is D — —108, and so it has one real root; since V—108 ft Q (it is not even real), the Galois 
group of fix) is not contained in A 3 . Thus, the Galois group is S 3 . 
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The polynomial x 3 — 4x + 2 e Q[x] is irreducible, by Theorem 3.43 or by Eisenstein’s 
criterion; its discriminant is D — 148, and so it has 3 real roots. Since Vl48 is irrational, 
the Galois group is S 3 . 

The polynomial fix) = x 3 — 48x + 64 e Q[x] is irreducible, by Theorem 3.43; the 
discriminant is D = 2 12 3 4 , and so /(x) has 3 real roots. Since \[Y) is rational, the Galois 
group is A 3 = I 3 . < 

Before examining quartics, let us note that if d is a divisor of .S 4 = 24, then it is known 
that S4 has a subgroup of order d (see Exercise 5.23 on page 277). If d — 4, then Y and 
I 4 are nonisomorphic subgroups of order d ; for any other divisor d, any two subgroups of 
order d are isomorphic. We conclude that the Galois group G of a quartic is determined to 
isomorphism by its order unless | G \ —4. 

Consider a (reduced) quartic fix) = x 4 + gx 2 + rx + s e Q[x]; let E/Q be its splitting 
field and let G — Gal( E/Q) be its Galois group. [By Exercise 4.15 on page 246, there is 
no loss in generality in assuming that /(x) is reduced.] If /(x) has a rational root a, then 
fix) = (x — a)c(x), and its Galois group is the same as that of the cubic factor c(x); but 
Galois groups of cubics have already been discussed. Suppose that /(x) = h(x)£(x) is the 
product of two irreducible quadratics; let a be a root of h (x) and let ft be a root of (Ax). If 
Q(a ) fl Q(j3) = Q, then Exercise 4.17(iv) on page 246 shows that G = Y, the four group; 
otherwise, a e Q(/3), so that Q(/3) — Q(a, f) = E, and G has order 2. 

We are left with the case /(x) irreducible. The basic idea now is to compare G with the 
four group V, namely, the normal subgroup of 54 

V = {(1),(1 2)(3 4), (1 3)(2 4), (1 4)(2 3)}, 

so that we can identify the fixed field of V fl G. If the four (necessarily distinct) roots of 
f{x) are oq, 012 , 013 , 04 , consider the numbers [which are distinct, by Proposition 4.63(h)]: 

u — (oq + (X2K013 + (X4), 

v — (oq + oqXoq + «4), (6) 

w — (oq + a 4 )(a 2 + a 3 ). 

It is clear that if 0 e Y fl G, then o fixes m, i>, and w. Conversely, if a e .S 4 fixes 
u = (ai + Q! 2 )(a 3 + 0 : 4 ), then 

a eVUj(l 2), (3 4), (1 3 2 4), (1 4 2 3)}. 

However, none of the last four permutations fixes both v and w, and so 0 e G fixes each 
of u, v, w if and only if a e V fl G. Therefore, 

E vnc = q ( m> l)> u)) _ 

Definition. The resolvent cubic of fix) — x 4 + qx 2 + rx + s is 

g(x) = (x — u ) (x — v)ix — w), 
where u, v, w are the numbers defined in Eqs. ( 6 ). 
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Proposition 4.62. The resolvent cubic of f (x) = x 4 + qx 2 + rx + s is 
g(x ) = x 3 — 2 qx 2 + (q 2 — 4s) x + r 2 . 

Proof. If fix) = (x 2 + jx + £)(x 2 — jx + m), then we saw, in our discussion of the 
quartic formula on page 209, that j 2 is a root of 

h(x) — x 3 + 2 qx 2 + (q 2 — 4 s)x — r 2 , 

a polynomial differing from the claimed expression for g(x) only in the sign of its quadratic 
and constant terms. Thus, a number ft is a root of h(x) if and only if —f is a root of g(x). 

Let the four roots cq, cq, cq, cq of fix) be indexed so that a i, as are roots of x 2 +jx+i 
and cq, cq are roots of x 2 —jx+m. Then /' = — (oq+cq) and —7 = — (cq+cq); therefore, 

u = (aq + a 2)(«3 + “4) = — j 2 

and — u is a root of h(x) since h(j 2 ) = 0. 

Now factor f{x) into two quadratics, say, 

f(x) — (x 2 + jx + l)(x 2 — jx + in), 

where a\, oq are roots of the first factor and aj. «4 are roots of the second. The same 
argument as before now shows that 

d = (ari + (*3) ((*2 + 014) — - j 2 ; 


hence — v is a root of h{x). Similarly, — w = —(«i + oqHoq + “ 3 ) i s a root °f h(x)- 
Therefore, 

h(x) — {x + u){x + v)(x + u>), 


and so 


g(x) = (x — u)(x — v)(x — w) 

is obtained from h(x) by changing the sign of the quadratic and constant terms. 




Proposition 4.63. 

(i) The discriminant D(f) of a quartic polynomial fix) e Q[x] is equal to the dis¬ 
criminant Dig) of its resolvent cubic g)x). 

(ii) If fix) is irreducible, then g)x) has no repeated roots. 

Proof, (i) One checks easily that 

u — v — 0/1013 + a 20/4 — o/\ai — 0/30/4 — — (aq — aaXaq — 0/3). 

Similarly, 

u — w — — (oq — a 3 )(a 2 — a 4) and v — w — (ai — a 2 )(a 3 — 0/4). 

We conclude that Dig) = [(« — v)(u — w)iv — w)] 2 — [— fl i<j( a i ~ a i^]~ — Dif). 

(ii) If fix) is irreducible, then it has no repeated roots (for it is separable because Q has 
characteristic 0), and so D(f) 0. Therefore, Dig) — D)f) 0, and so gix) has no 
repeated roots. • 
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In the notation of Eqs. ( 6 ), if fix ) is an irreducible quartic, then u, v, w are distinct. 

Proposition 4.64. Let fix) e Q[x] be an irreducible quartic with Galois group G with 
discriminant D, and let m be the order of the Galois group of its resolvent cubic g(x). 

(i) If m = 6 , then G = S 4 . In this case, gix) is irreducible and \fT) is irrational. 

(ii) If m = 3, then G = A 4 . In this case, gix) is irreducible and ~J~D is rational. 

(iii) Ifhi — 1, then G = V. In this case, gix) splits in Q[x]. 

(iv) If m = 2, then G = D% or G = I4. In this case, gix) has an irreducible quadratic 

factor. 

Proof. We have seen that £’ vnc = Q(u, v, w). By the fundamental theorem of Galois 
theory, 

[G : VGG] = [£ VnG : Q] 

= [Q(«, V, w) : Q] 

= I Gal(Q(M, v, u>)/Q)| 

= m. 

Since fix) is irreducible, |G| is divisible by 4, by Corollary 4.9, and the group-theoretic 
statements follow from Exercise 4.28 on page 248 and Exercise 4.29 on page 248. Finally, 
in the first two cases, |G| is divisible by 12, and Proposition 4.59(ii) decides whether 
G = S 4 or G = A 4 . The conditions on gix) in the last two last two cases are easy to 
see. • 

We have seen that the resolvent cubic has much to say about the Galois group of the 
irreducible quartic from which it comes. 

Example 4.65. 

(i) Let fix) — x 4 — 4x + 2 e Q[x]; fix) is irreducible [the best way to see this is with 
Eisenstein’s criterion. Theorem 6.34, but we can also see that fix) has no rational roots, 
using Theorem 3.43, and then showing that fix) has no irreducible quadratic factors by 
examining conditions imposed on its coefficients]. By Proposition 4.62, the resolvent cubic 
is 

g(x) = x 3 — 8 x + 16. 

Now gix) is irreducible (again, the best way to see this uses some results of Chapter 6 : 
specifically. Theorem 6.30, for if we reduce mod 5, we obtain x 3 + 2x + 1, and this poly¬ 
nomial is irreducible over I5 because it has no roots). The discriminant of gix) is —4864, 
so that Theorem 4.60 shows that the Galois group of gix) is S 3 , hence has order 6 . Theo¬ 
rem 4.64 now shows that G = S 4 . 

(ii) Let fix) = x 4 — 10x 2 + 1 e Q[x]; fix) is irreducible, by Exercise 6.23(viii) on 
page 339. By Proposition 4.62, the resolvent cubic is 

x 3 + 20x 2 + 96x = x(x + 8 )(x + 12). 
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In this case, Q(u, v. w) — Q and m = 1. Therefore, G = V. [This should not be a surprise 
if we recall Example 3.122, where we saw that / (x) arises as the irreducible polynomial 
of a = V2 + V3, where Q(a) = Q(\/2, \/3).] ◄ 

An interesting open question is the inverse Galois problem : Which finite abstract 
groups G are isomorphic to GaKE/Q), where E /Q is a Galois extension? D. Hilbert 
proved that the symmetric groups S n are such Galois groups, and I. Shafarevich proved 
that every solvable group is a Galois group (see Neukirch-Schmidt-Wingberg, Cohomol¬ 
ogy of Number Fields). After the classification of the finite simple groups in the 1980s, it 
was shown that most simple groups are Galois groups. For more information, the reader is 
referred to Malle-Matzat, Inverse Galois Theory. 

Exercises 

4.14 Prove that co( 1 — co 2 )( 1 — a>) 2 — 3is/3, where to = e 2jn / 3 . 

4.15 (i) Prove that if a 0, then f(x) and af(x) have the same discriminant and the same 

Galois group. Conclude that it is no loss in generality to restrict attention to monic 
polynomials when computing Galois groups. 

(ii) Let k be a field of characteristic 0. Prove that a polynomial f(x) e k[x] and its associ¬ 
ated reduced polynomial / (x) have the same Galois group. 

4.16 (i) Let k be a field of characteristic 0. If f(x) = x 3 + ax 2 + bx + c e k[x\, then its 

associated reduced polynomial is x 3 + qx + r, where 

q = b — j« 2 and r = ^a 3 — ^ ab + c. 

(ii) Show that the discriminant of f(x) is 

D = a 2 b 2 — 4 Z? 3 — 4« 3 c — 27c 2 + 18 abc. 

4.17 Let k be a field, let fix) e k[x] be a separable polynomial, and let E/k be a splitting field of 
fix). Assume further that there is a factorization 

fix) = g{x)h{x) 

in k[x\ and that B/k and C/k are intermediate fields that are splitting fields of g(x) and h(x ), 
respectively. 

(i) Prove that Gal(£/6) and Gal(£/ C ) are normal subgroups of Gal iE/k). 

(ii) Prove that Gal(£/B) n Gal(£/C) = (1). 

(iii) If B fl C =k, prove that Gal (£'/ B) Gal {E/C) — Gal(i?/ k). (Intermediate fields B and 
C are called linearly disjoint if B Pi C = k.) 

(iv) Use Proposition 2.80 and Theorem 4.16 to show, in this case, that 

Gal (E/k) = Gal (B/k) x Gal(C/k). 

(Note that Gal (B/k) is not a subgroup of Gal (E/k).) 

(v) Use (iv) to give another proof that Gal(£/Q) = V, where E = Q(V? + v^3) [see 
Example 3.122 on page 190], 
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(vi) Let f(x) = (x 3 — 2)(x 3 — 3) e Q[x], If B/Q and C /Q are the splitting fields of x 3 — 2 
and x 3 — 3 inside C, prove that Gal(E/Q) Gal(6/Q) x Gal(C/Q), where E is the 
splitting field of fix) contained in C. 

4.18 Let k be a field of characteristic 0, and let fix) e k[x\ be a polynomial of degree 5 with 
splitting field E / k. Prove that fix) is solvable by radicals if and only if [E : k] < 60. 

4.19 (i) If C and C! are lattices, a function /: C -*■ Cf is called order-preserving if a < b in 

C implies fia) < fib) in Cf . Prove that if C and L'! are lattices and ip: C -¥ Cl is a 
bijection such that both i p and ip~ l are order-preserving, then 

i pia A b) = ipia) A i pib) and i pia V b) = ipia) V i pib). 

Hint. Adapt the proof of Lemma 4.42. 

(ii) Let E/k be a Galois extension with Gal iE/k) cyclic of order n. Prove that 

i p: Int iE/k) —*■ Div(n), 

[see Example 4.40(iv)] defined by i piL) = [L : k], is an order-preserving lattice iso¬ 
morphism. 

(iii) Prove that if L and K are subfields of ¥ p n , then 

[L V K : Fp] = lcm ([L : F p ], [K : F p ]) 

and 

[L n K : F p ] = gcd([L : F p ], [K : F p ]) . 

4.20 Find all finite fields k whose subfields form a chain ; that is, if k’ and k" are subfields of k, then 
either k’ C k" or k" C k!. 

4.21 (i) Let k be an infinite field, let fix) e L[x] be a separable polynomial, and let E = 

kia\, ..., a n ), where ai,... ,a n are the roots of fix). Prove that there are c,- e k so 
that E — kif), where f — cja i + • • • + c n a n . 

Hint. Use the proof of Steinitz’s theorem. 

(ii) iJanusz). Let k be a finite field and let E = kia, fi). Prove that if kia) and kif) are 
linearly disjoint [that is, if kia) n ki/3) — k], then E = kia + f). (This result is false in 
general. For example, N. Boston used the computer algebra system MAGMA to show 
that there is a primitive element a of F 2 6 and a primitive element f of F 2 io such that 
F 2 (a, f) = F 2 3o while F 2 (a + /?) = F->i5.) 

Hint. Use Exercise 4.19(iii) and Exercise 1.26 on page 13. 

4.22 Let E/k be a Galois extension with Galois group G = Gal(£ /k). Define the trace T : E -*■ E 
by 

Tin) — o)u). 

oeG 

(i) Prove that irn T C k and that Tin + v) = Tiu) + T)v) for all u,v e E. 

(ii) Use independence of characters to prove that T is not identically zero. 

4.23 Let E/k be a Galois extension with [E : k] = n and with cyclic Galois group G = Gal(£/ k), 
say, G = [a). 

(i) Define r = a — \p, and prove that ker T = ker r. 

Hint. Show that ker r = k, so that dim(im r) = n — 1 = dim(ker T). 
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(ii) Trace Theorem : Prove that if E/k is a Galois extension with cyclic Galois group 
Gal (E/k) = {a), then 

ker T — [a e E : a — er(u) — u for some u e E}. 

4.24 Let k be a field of characteristic p > 0, and let E/k be a Galois extension having a cyclic 
Galois group G = ( a) of order p. Using the trace theorem, prove that there is an element u e 
E with o(u) — u — 1. Prove that E = k(u) and that there is c e k with irr(u, k) = x p — x — c. 

4.25 If a e S n and fix \, ..., x n ) e k[x \, ..., x n ], where k is a field, define 

(of) Ot, ...,x n ) = f(x a i, ... ,x„„). 

(i) Prove that (a, fix i, ..., x n ) of defines an action of S n on k[x\ . x n ], 

(ii) Let A = A {x \, ..., x n ) = ]”[/<;' ( x i ~ x j ) (on P a g e 239, we saw that o A = ± A for all 
a G S n ). If a e S n . prove that a e A n if and only if o'A = A. 

Hint. Define i p: S n -¥ G, where G is the multiplicative group {1, —1}, by 

f 1 if o A = A; 

<p(o) = \ .... . 

I —1 if ct A = — A. 

Prove that <p is a homomorphism, and that ker^> = A„. 

4.26 Prove that if fix) e Q[jr] is an irreducible quartic whose discriminant is rational, then its 
Galois group has order 4 or 12. 

4.27 Let fix) = x 4 + rx + 4 1 e Q[x] have Galois group G. 

(i) Prove that the discriminant of fix) is —27r 4 + 256,s 3 . 

(ii) Prove that if s < 0, then G is not isomorphic to a subgroup of A 4 . 

(iii) Prove that fix) = x 4 + x + 1 is irreducible and that G = S 4 . 

4.28 Let G be a subgroup of S 4 with |G| a multiple of 4, and define m = |G/(G H V)|. 

(i) Prove that m is a divisor of 6 . 

(ii) If m — 6, then G = 54 ; if m = 3, then G = A 4 ; if m — 1, then G = V; if m = 2, then 
G = Dg, G = I 4 , or G = V. 

4.29 Let G be a subgroup of S 4 . If G acts transitively on X = (1, 2, 3, 4) and |G/(V fl G)| = 2, 

then G = Dg or G = I 4 . [If we merely assume that G acts transitively on X, then |G| is a 

multiple of 4 (Corollary 4.9). The added hypothesis |G/(V fl G)| = 2 removes the possibility 
G = V when m = 2 in Exercise 4.28.] 

4.30 Compute the Galois group over Q of x 4 + x 2 — 6 . 

4.31 Compute the Galois group over Q of fix) = x 4 + x 2 + x + 1. 

Hint. Use Example 3.35(ii) to prove irreducility of fix), and prove irreducibility of the 
resolvent cubic by reducing mod 2 . 

4.32 Compute the Galois group over Q of fix) — 4x 4 + I2x + 9. 

Hint. Prove that fix) is irreducible in two steps: First show that it has no rational roots, 
and then use Descartes's method (on page 209) to show that fix) is not the product of two 
quadratics over Q. 
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We now seek some structural information about groups. Finite abelian groups turn out to 
be rather uncomplicated: They are direct sums of cyclic groups. Returning to nonabelian 
groups, the Sylow theorems show, for any prime p, that finite groups G have subgroups of 
order p e , where // is the largest power of p dividing | G |, and any two such are isomorphic. 
The ideas of normal series and solvability that arose in Galois theory yield invariants of 
groups (the Jordan-Holder theorem), showing that simple groups are, in a certain sense, 
building blocks of finite groups. Consequently, we display more examples of simple groups 
to accompany the alternating groups A„, for n > 5, which we have already proved to be 
simple. This chapter concludes by investigating free groups and presentations, for they are 
useful in constructing and describing arbitrary groups. The chapter ends with a proof that 
every subgroup of a free group is itself a free group. 


5.1 Finite Abelian Groups 

We continue our study of groups by classifying all finite abelian groups; as is customary, 
we use the additive notation for the binary operation in these groups. We are going to prove 
that every finite abelian group is a direct sum of cyclic groups and that this decomposition 
is unique in a strong sense. 


Direct Sums 

Groups in this subsection are arbitrary, possibly infinite, abelian groups. 

Let us say at the outset that there are two ways to describe the direct sum of abelian 
groups Si,S n . The easiest version is sometimes called their external direct sum , 
which we denote by Si x • • • x S n \ its elements are the n-tuples (si,..., s„), where s,- e Si 
for all i , and its binary operation is 

(si, . . . , S„) + s',) = (*1 + 4-- S„ + 4). 


249 




250 


Groups II Ch. 5 


However, the most useful version, isomorphic to Si x • • • x S„, is sometimes called their 
internal direct sum ; it involves subgroups .S', of a given group G with G = Si x • • • x S„. 
We will usually omit the adjectives external and internal. 

The definition of the direct sum of two subgroups is the additive version of the statement 
of Proposition 2.80. 

Definition. If S and T are subgroups of an abelian group G, then G is the direct sum , 
denoted by 

G = 5 © T, 

if S + T — G (i.e., for each a e G, there are s e S and t e T with a = s + t) and 

snr = { 0 }. 

Here are several characterizations of a direct sum. 

Proposition 5.1. The following statements are equivalent for an abelian group G and 
subgroups S and T of G. 

(i) G = S ® T. 

(ii) Every g € G has a unique expression of the form 

g = s + t. 


where s e S and t e T. 

(iii) There are homomorphisms p: G —> S and q: G —> T, called projections, and 
i: S —> G and j : T —> G, called injections , such that 

pi = Is, qj — It, pj = 0, qi — 0, and ip + jq = Iq- 

Remark. The equations pi — Is and qj — 1 j imply that the maps i and j must be 
injections and the maps p and q must be surjections. -4 

Proof (i) =>• (ii) By hypothesis, G — S + T, so that each g e G has an expression 
of the form g — s + t with s e S and t e T. To see that this expression is unique, 
suppose also that g — s' + t ', where s' e S and t' e T. Then s + t = s' + t' gives 
s — s' = t' — teSHT = {0}. Therefore, s = s' and t = t', as desired. 

(ii) =>■ (iii) If g e G, then there are unique s e S and t e T with g = s + t. The functions 
p and q, given by 

Pig) = * and q(g) — t, 

are well-defined because of the uniqueness hypothesis. It is routine to check that p and q 
are homomorphisms and that all the equations in the statement hold. 

(iii) => (i) If g e G, the equation 1 q = ip + jq gives 

g = ip(g) + jqig) e S + T, 


because S = im i and T — i m j. 
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If g e S , then g = ig and pg = pig = g; if g e T, then g = jg and pg = pjg = 0. 
Therefore, if g e S fl T, then g — 0. Hence, S (IT — {0}, S + T — G. and G — S © T. • 

The next result shows that there is no essential difference between internal and external 
direct sums. 

Corollary 5.2. Let S and T be subgroups of an abelian group G. If G = S © T, then 
S®T = S xT. 

Conversely, given abelian groups S and T, define subgroups S' = S and T' = T of 
S x T by 

S' = {(s, 0) : s e S] and T 1 = {(0, t) : t e T}\ 
then S x T — S' ® T'. 

Proof. Define f:S®T->SxT as follows. If a e .S’® T, then the proposition says that 
there is a unique expression of the form a — s + t , and so/:ai-> (s, t) is a well-defined 
function. It is routine to check that / is an isomorphism. 

Conversely, if g — (s,t) e S x T. then g — ( s , 0) + (0, t) e S' + T’ and S' fl T' = 
{(0,0)}. Hence, S x T = S' © T'. • 


Definition. If Si, S 2 , • •. , S n , ... are subgroups of an abelian group G, define the finite 
direct sum .S’i ® .SS ® • ■ ■ ® S„ using induction on n > 2: 

Si © S 2 © • • • © S n+ 1 = [Si © S 2 © • • • © S„] © S n+ 1 . 


We will also denote the direct sum by 

n 

J2 s i = Si®S 2 ®---®S„. 

i= 1 

Given Si, S 2 , ■ ■ •, S n subgroups of an abelian group G, when is the subgroup they gen¬ 
erate, (Si, S 2 , ■ ■ ■, S n ), equal to their direct sum? A common mistake is to say that it is 
enough to assume that S; fl Sj — {0} for all i ^ j, but the following example shows that 
this is not enough. 

Example 5.3. 

Let V be a two-dimensional vector space over a field k , which we view as an additive 
abelian group, and let x, y be a basis. It is easy to check that the intersection of any 
two of the subspaces (x), (y), and (x + y) is {0}. On the other hand, we do not have 
V — [(x) © (y)] © (x + y) because [(x) © (y)] (T (x + y) # {0}. ◄ 

In the context of abelian groups, we shall write S c. G to denote S being a subgroup 
of G, as we do when denoting subrings and ideals; in the context of general, possibly 
nonabelian, groups, we will continue to write S < G to denote a subgroup. 
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Proposition 5.4. Let G = .S'i + Si + • ■ • + .S - „, where the Si are subgroups', that is, for 
each a e G, there are st e Si for all i, with 


a = si + *2 H-b s n . 


Then the following conditions are equivalent. 

(i) G — 5] © S 2 © • • • © S n . 

(ii) Every a e G has a unique expression of the form a — si + S 2 + • • • + s n , where 
Si e Si for all i. 

(iii) For each i, 

$i n (5i + $2 + • • • + Sj + • • • + s n ) = { 0 }, 

where Si means that the term Si is omitted from the sum. 

Proof, (i) =>■ (ii) The proof is by induction on n > 2. The base step is Proposition 5.1. 
For the inductive step, define T — S\ + S 2 + ■ ■ ■ + S„, so that G — T © S n + 1 . If a e G, 
then a has a unique expression of the form a — t + s n+ 1 , where t e T and s„ + i e S„+i 
(by the proposition). But the inductive hypothesis says that t has a unique expression of 
the form t — si + ■ ■ ■ + s n , where ,v, e Sj for all i < n, as desired. 

(ii) =>■ (iii) Suppose that 

x €. Si IT (Si + 52 + • • • + Si + • • • + 5 n ). 

Then x = Si e Si and ,v; = ff j-tj Sj, where sj e Sj. Unless all the sj — 0, the element 0 
has two distinct expressions: 0 = — s, + ^ ; sj and 0 = 0 + 0 + -- - + 0. Therefore, all 
sj — 0 and x = Sj — 0. 

(iii) => (i) Since S n+ 1 fl (5i + ^2 +-b S n ) = {0}, we have 

G = © (5i + ^2 + • • • + 5„). 

The inductive hypothesis gives Si + S 2 + ■ ■ ■ + S n = Si © S 2 © ■ • • © S n , because, for all 
j < n , we have 

Sj n (Si + • • • + Sj + • • • + sfj c Sj n (5i + • • • + Sj + • • • + s n + 

= {0}. . 

Corollary 5.5. Let G — (yi,..., y n ). If for all m ; e Z, we have JT m i y ( ' = 0 implies 
niiyt — 0; then 

G = (yi) © • • • © (y„). 

Proof. By Proposition 5.4(ii), it suffices to prove that if JT hi yi — iiyi, then Ayy,- = 
for all i. But this is clear, for JT (ki — if)yi — 0 implies (kf — ii)yi — 0 for all i. • 
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Example 5.6. 

Let V be an n -dimensional vector space over a field k, which we view as an additive abelian 
group. If ui,..., v n is a basis, then 

y = ( Vl }©(„ 2 ) 

where (i >,} = {rvt : r e k} is the one-dimensional subspace spanned by Vj. Each v e V 
has a unique expression of the form v = si + •••+«„, where Sj = r,Vi e (i>,}, because 
v \,..., v n is a basis. ◄ 

Now that we have examined finite direct sums, we can generalize Proposition 2.79 from 
two summands to a finite number of summands. Although we state the result for abelian 
groups, it should be clear that the proof works for nonabelian groups as well if we assume 
that the subgroups //, are normal subgroups (see Exercise 5.1 on page 267). 

Proposition 5.7. If G i, G 2 , .... G„ are abelian groups and II, C G, are subgroups, 
then 

(Gi ® • • • ® G n )/(H\ ©•••©//„) = (Gi/ffi) x • • • x (G n /H n ). 

Proof. Define / : G\ © • • • © G„ -> ( G\/H\ ) © • • • © ( G n /H n ) by 

(gl, . . . , gn) (g| + H\, .... g n + H„). 

Since / is a surjective homomorphism with ker / = Hi © • • • © H n , the first isomorphism 
theorem gives the result. • 

If G is an abelian group and m is an integer, let us write 

mG = {ma : a e G}. 

It is easy to see that mG is a subgroup of G. 

Proposition 5.8. If G is an abelian group and p is a prime, then G/pG is a vector space 
over F p . 

Proof. If [r] e F p and a e G, define scalar multiplication 

[r](a + pG) — ra + pG. 

This formula is well-defined, for if k = r mod p, then k = r + pm for some integer m, 
and so 

ka + pG — ra + pma + pG — ra + pG, 

because pma e pG. It is now routine to check that the axioms for a vector space do 
hold. • 
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Direct sums of copies of Z arise often enough to have their own name. 

Definition. Let F = {x\,, x n ) be an abelian group. If 

F = (x\) ® • • • © <x„), 

where each (x,-) = Z, then F is called a (finitely generated) free abelian group with basis 
x\, .... x„. More generally, any group isomorphic to F is called a free abelian group. 

For example, Z m = Z x • • • x Z, the group of all m-tuples n m ) of integers, is 

a free abelian group. 

Proposition 5.9. IfL m denotes the direct sum of m copies ofh, then Z m = Z" if and 
only if m = n. 

Proof. Only necessity needs proof. Note first, for any abelian group G, that if G — 
Gi © • • • © G n . then 2 G — 2G\ © • • • © 2 G n . It follows from Proposition 5.7 that 

G/2G = (Gi/2Gi) © • • • © (G n /2G n ), 

so that |G/2G| = 2". Similarly, if H = Z m , then \H/2H\ = 2 m . Finally, if G = IT = 
Z" 1 = H. then G/2G = H/2H and 2" = 2 m . We conclude that n = m. • 

Corollary 5.10. If F is a (finitely generated) free abelian group, then any two bases of F 
have the same number of elements. 

Proof. If xi,..., x„ is a basis of F, then F = Z", and if yi, ..., y m is another basis of 
F, then F = Z'". By the proposition, m — n. • 

Definition. If F is a free abelian group with basis xi,..., x„, then n is called the rank 
of F, and we write ranl<( F) = n. 

Corollary 5.10 says that rank) F) is well-defined; that is, it does not depend on the 
choice of basis. In this language. Proposition 5.9 says that two finitely generated free 
abelian groups are isomorphic if and only if they have the same rank; that is, the rank of 
a free abelian group plays the same role as the dimension of a vector space. Comparing 
the next theorem with Theorem 3.92 shows that a basis of a free abelian group behaves as 
does a basis of a vector space. 

Theorem 5.11. Let F be a free abelian group with basis X = {xi,..., x„}. If G is any 
abelian group and if y : X —»• G is any function, then there exists a unique homomorphism 
g: F -» G with g(x, ) = y(x;) for all x,. 
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Proof. Every element a e F has a unique expression of the form 

n 

a — J^miXi, 
i =l 

where m, e Z. Define g: F G by 

n 

8(a) = 2>y(*i). 

f=l 

If /;: F ■—> G is a homomorphism with h{xf) — g(xi) for all i, then h = g. for two 
homomorphisms that agree on a set of generators must be equal. • 

Theorem 5.11 characterizes free abelian groups. 


Proposition 5.12. Let A be an abelian group containing a subset X = {xi,..., x n }, and 
let A have the property in Theorem 5.11: For every abelian group G and every function 
y : X —>■ G, there exists a unique homomorphism g: A —> G with g(xj) = y(xj) for all 
Xi. Then A = Z”; that is, A is a free abelian group of rank n. 

Proof. Consider the diagrams 


A 

A 




and 


A 

-^Z" 


Z" 


q I 


h 

A 


—s- A, 

p 


where p: X —> A and q: X -> Z" are inclusions. The first diagram arises from the given 
property of A , and so gp — q\ the second arises from Theorem 5.11, which shows that Z" 
enjoys the same property; hence, hq — p. We claim that the composite g: A —> Tl l is an 
isomorphism. To see this, consider the diagram 


A 

" ■■ hg 

1 

A 


Now hgp — hq — p. By hypothesis, hg is the unique such homomorphism. But 1 
is another such, and so hg — 1 a- A similar diagram shows that the other composite 
gh = 1 in, and so g is an isomorphism. • 


Basis Theorem 

It will be convenient to analyze finite abelian groups “one prime at a time.” 

Recall that a p-group is a finite group G of order p k for some k > 0. When working 
wholly in the context of abelian groups, p-groups are called p-primary groups. 
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Definition. If p is a prime, then an abelian group G is p-primary if, for each a e G, there 
is n > 1 with p n a = 0 . 

If G is any abelian group, then its p-primary component is 

G p — {a e G : p n a = 0 for some n > 1}. 

It is easy to see, for every prime p, that G p is a subgroup of G (this is not the case when 
G is not abelian; for example, G 2 is not a subgroup if G — .S 3 ). 

If we do not want to specify the prime p, we may write that an abelian group is primary 
(instead of p-primary). 

Theorem 5.13 (Primary Decomposition). 

(i) Every finite abelian group G is a direct sum of its p-primary components: 

G = G Pl ® • • • ® G Pn . 

(ii) Two finite abelian groups G and G' are isomorphic if and only if G p = G' p for 
every prime p. 

Proof, (i) Let x e G be nonzero, and let its order be d. By the fundamental theorem of 
arithmetic, there are distinct primes pi,..., p„ and positive exponents e\,... ,e n with 

d = P V-P e n r 

Define r ; = d / p e f , so that p e 'r,- = d. It follows that r,x e G Pi for each i (because 
dx — 0). But the gcd d of r 1 ,..., r n is 1 (the only possible prime divisors of d are 
pi,..., p„; but no p; is a common divisor because p; j" r,); hence, there are integers 
,V|. ... ,s„ with 1 = s^r,. Therefore, 

X = Yl s i r i x e G PI H-t- G Pr . 

i 

Write Hi — G pi + G P2 + ■ ■ ■ + G Pi + ■ ■ ■ + G Pn . By Proposition 5.4, it suffices to 
prove that if 

x £ G Pi n Hj , 

then x = 0 . Since x e G Pl , we have pfx = 0 for some l > 0; since x e Hi. we have 
x — Jfj^i yj , where pV yj — 0 ; hence, ux — 0 , where u = Ylj^i P 8 / ■ Pi anc * u are 
relatively prime, so there exist integers 5 and t with 1 = spf + tu. Therefore, 

x — (spj + tu)x = sp t x + tux = 0 . 

(ii) If f:G —> G' is a homomorphism, then f(G p ) C G' p for every prime p, for if 
p e a = 0, then 0 = f(p l a) = p e f(a). If / is an isomorphism, then f~ l : G' —> G is 
also an isomorphism [so that f~ l (G l p ) c G p for all p]. It follows that each restriction 
f\G p : G p —> G' p is an isomorphism, with inverse / - 1 1 G' p . 

Conversely, if there are isomorphisms f p ; G p —> G' p for all p, then there is an isomor¬ 
phism <p: G p ->■ Yj P G ' P given by a p fp(a p ). • 
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The next type of subgroup will play an important role. 

Definition. Let p be a prime and let G be a p -primary abelian group. 1 A subgroup 5 C G 
is a pure 2 subgroup if, for all n > 0, 

SDp n G = p n S. 

The inclusion S fi p n G > p"S is true for every subgroup S C G. and so it is only 
the reverse inclusion S fl p n G C p" S that is significant. It says that if ,v e S satisfies an 
equation ,v = p n u for some a <= G, then there exists s' e S with ,v = p n s'. 

Example 5.14. 

(i) Every direct summand S of G is a pure subgroup. If G = S © T and (,v, 0) = p n (u , v), 
where u e S and v e T, then it is clear that (,v, 0) = p"{u. 0). (The converse: “Every 
pure subgroup S is a direct summand” is true when S is finite, but it may be false when S 
is infinite.) 

(ii) If G = (a) is a cyclic group of order p 2 , where p is a prime, then 5 = {pa) is not a 
pure subgroup of G, for if s — pa e S , then there is no element s' e S with s — pa = ps'. 


Lemma 5.15. If p is a prime and G is a finite p-primary abelian group, then G has a 
nonzero pure cyclic subgroup. 

Proof. Since G is finite, we may choose an element y e G of largest order, say, p l . We 
claim that S = {y) is a pure subgroup of G. 

Suppose that s e S, so that s — nip'y, where t > 0 and p \ m, and let 


for some a e G; an element s' e S must be found with s — p n s'. We may assume that 
n < l\ otherwise, s = p"a — 0 (for p'g = 0 for all g e G because y has largest order 
p l ), and we may choose s' — 0 . 

If t > n, define s' — mp'~"y e S , and note that 

yi ! n t—n t 

p s = p mp y = mp y = s. 


If t < n, then 


= p 1 n p n 


p e ~ n s = 


1 "mp'y — mp L n+t y. 


But p \ m and i — n + t < l, because —n+t < 0, and so p l a f 0. This contradicts y 
having largest order, and so this case cannot occur. • 


1 If G is not a primary group, then a pure subgroup S c G is defined to be a subgroup that satisfies SC\mG = 
mS for all m e Z (see Exercises 5.2 and 5.3 on page 267). 

^Recall that pure extensions k(u)/k arose in our discussion of solvability by radicals on page 206; in such an 
extension, the adjoined element u satisfies the equation u n = a for some a e k. Pure subgroups are defined in 
terms of similar equations (written additively), and they are probably so called because of this. 
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Definition. If p is a prime and G is a finite p-primary abelian group, then 

d(G) = dim (G/pG). 

Observe that d is additive over direct sums, 

d(G ® H) = d(G) + d(H), 


for Proposition 2.79 gives 

(G © H)/p(G © //) = (G © H)/{pG © pH) 

= 0 G/pG ) © ( H/pH). 

The dimension of the left side is d(G © II) and the dimension of the right-hand side 
is d(G) + d(H), for the union of a basis of G/pG and a basis of H/pH is a basis of 
(G/pG) © (7 1/pH). 

The nonzero abelian groups G with d(G) = 1 are easily characterized. 

Lemma 5.16. If G f {0} is p-primary, then d(G) — 1 if and only if G is cyclic. 

Proof. If G is cyclic, then so is any quotient of G; in particular, G/pG is cyclic, and so 
dim(G/pG) = 1. 

Conversely, if G/pG = {z + pG >, then G/pG = I p . Since I p is a simple group, the 
correspondence theorem says that pG is a maximal proper subgroup of G; we claim that 
pG is the only maximal proper subgroup of G. If L C G is any maximal proper subgroup, 
then G/L = \ p , for G/L is a simple abelian group of order a power of p, hence has order 
p (by Proposition 2.107, the abelian simple groups are precisely the cyclic groups of prime 
order). Thus, if a e G, then p(a + L) — 0 in G/L, so that pa e L; hence pG C L. But 
pG is maximal, and so pG = L. It follows that every proper subgroup of G is contained 
in pG (for every proper subgroup is contained in some maximal proper subgroup). In 
particular, if (z) is a proper subgroup of G, then (. z ) c pG , contradicting z + pG being a 
generator of G/pG. Therefore, G = (z), and so G is cyclic. • 

Lemma 5.17. Let G be a finite p-primary abelian group. 

(i) IfS c G, then d(G/S) < d(G). 

(ii) If S is a pure subgroup of G, then 

d(G) = d(S) + d(G/S). 

Proof, (i) By the correspondence theorem, p(G/S ) = (pG + S)/S, so that 

(< G/S)/p(G/S) = ( G/S)/[( P G + S)/S] = G/ipG + S), 

by the third isomorphism theorem. Since pG c pG + S, there is a surjective homomor¬ 
phism (of vector spaces over F p ), 


G/pG -* G/(pG + S), 
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namely, g + pG i-> g + ( pG + S). Hence, dim(G/^G) > dim(G/(pG + 5)); that is, 
d(G) > d(G/S). 

(ii) We now analyze (pG + S)/pG, the kernel of G/pG —»■ G/(pG + S ). By the second 
isomorphism theorem, 

(pG + S)/pG = S/(Sn P G). 

Since S is a pure subgroup, S D pG = pS\ therefore, 

(pG + S)/pG = S/pS, 

and so dim [(pG + S)/pG ] = d(S). But if W is a subspace of a finite-dimensional vector 
space V, then dim(V) = dim(W) + dim(V/W), by Exercise 3.72 on page 170. Hence, if 
V — G/pG and W = (pG + S)/pG, we have 

d{G) =d(S) + d{G/S). • 


Theorem 5.18 (Basis Theorem). Every finite abelian group G is a direct sum of cyclic 

groups of prime power orders. 

Proof. By the primary decomposition. Theorem 5.13, we may assume that G is ^-primary 
for some prime p. We prove that G is a direct sum of cyclic groups by induction on 
d(G) > 1. The base step is easy, for Lemma 5.16 shows that G must be cyclic in this case. 

To prove the inductive step, we begin by using Lemma 5.15 to find a nonzero pure cyclic 
subgroup S c G. By Lemma 5.17, we have 

d(G/S) = d(G) - d(S) = d(G) - 1 < d(G). 

By induction, G/S is a direct sum of cyclic groups, say, 

G/S = ][>>, 

i=i 


where x; = x,- + S. 

Let x e G and let x have order p 1 , where x = x + S. We claim that there is z e G with 
Z + S — x — x + S such that 

order z = order (x). 

Now x has order p n , where n > i. But p 1 (x + S) = p'x — 0 in G/S , so there is some 
s e S with //x = s. By purity, there is s' e S with p e x — p e s'. If we define z = x — s', 
then p t z — 0 and z + S=x + S = x. If z has order p m , then m > l because z i-> x; 
since p^z = 0, the order of z equals p e . 

For each i, choose Zi e G with z.i + S = x,- = x, + S and with order z.i = order x,; 
define T by 


T = <Z1, . .. , Zq). 
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Now S + T — G, because G is generated by S and the 74 ’s. To see that G — S © T, it now 
suffices to prove that S HT = {0}. If y e S fl 7\ then y — JT mm, where m, e Z. Now 
y e S, and so JT m ( x ( - = 0 in G/S. Since this is a direct sum, each /«,*,- = 0; after all, 
for each i, 

—mjXi — y^mjXj e (xt) n (<jci> H-h (xi) H-h { x q }) = {0}. 

j¥=‘ 

Therefore, mm = 0 for all i, and hence y = 0. 

Finally, G — S © T implies d(G) — d(S ) + d(T ) = 1 + d(T), so that d(T) < d{G). 
By induction, T is a direct sum of cyclic groups, and this completes the proof. • 

The shortest proof of the basis theorem that 1 know is due to G. Navarro, American 
Mathematical Monthly 110 (2003), pages 153-154. 

Lemma 5.19. A finite p-primary abelian group G is cyclic if and only if it has a unique 
subgroup of order p. 

Proof. Recall the unnumbered theorem on page 94: If G is an abelian group of order n 
having at most one cyclic subgroup of order p for every prime divisor p of n , then G is 
cyclic. The lemma follows at once when n is a power of p. The converse is Lemma 2.85. • 

Remark. We cannot remove the hypothesis that G be abelian, for the group Q of quater¬ 
nions is a 2-group having a unique subgroup of order 2. However, if G is a (possibly 
nonabelian) finite /r-group having a unique subgroup of order p, then G is either cyclic 
or generalized quaternion (the latter groups are defined on page 298). A proof of this last 
result can be found in Rotman, An Introduction to the Theory of Groups, pages 121-122. 

One cannot remove the finiteness hypothesis, for Proposition 9.25(iii) shows that the 
infinite ^-primary group Z(f>°°) has a unique subgroup of order p. a 

Lemma 5.20. If G is a finite p-primary abelian group and if a is an element of largest 
order in G, then A — (a) is a direct summand of G. 

Proof. The proof is by induction on |G| > 1; the base step is trivially true. We may as¬ 
sume that G is not cyclic, for any group is a direct summand of itself (with complementary 
summand {0}). Now A has a unique subgroup of order p\ call it C. By Lemma 5.19, G 
contains another subgroup of order p, say C'. Of course, A fi C' = {0}. By the second 
isomorphism theorem, (A + C')/C' = A/(AC\C') = Aisacyclic subgroup of G/C 1 . But 
no homomorphic image of G can have a cyclic subgroup of order greater than |A| (for no 
element of an image can have order larger than the order of a). Therefore, (A + C')/C' is 
a cyclic subgroup of G/C’ of largest order and, by the inductive hypothesis, it is a direct 
summand: There is a subgroup B/C', where C' C B C G, with 

G/C' = ((A + C')/C') © {B/C'). 

We claim that G — A © B. Clearly, G — A + C' + B — A + B (for C' C B), while 
A n b c a n ((A + C) n b) = a n c' = {0}. • 
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Theorem 5.21 (Basis Theorem). Every finite abelian group G is a direct sum of cyclic 
groups. 

Proof. The proof is by induction on |G| > 1, and the base step is obviously true. To 
prove the inductive step, let p be a prime divisor of |G|. Now G = G p (& II. where p \\H\ 
(either we can invoke the primary decomposition or reprove this special case of it). By 
induction, H is a direct sum of cyclic groups. If G p is cyclic, we are done. Otherwise, 
Lemma 5.20 applies to write G p — A © B, where A is cyclic. By the inductive hypothesis, 
B is a direct sum of cyclic groups, and the theorem is proved. • 

Another short proof of the basis theorem is due to R. Rado, Journal London Mathemat¬ 
ical Society 26 (1951), pages 75-76 and 160. We merely sketch the proof. 

Let G be an additive abelian group, and let x\,, x n be elements of G. Form the 
1 x n matrix A whose /th entry is xj. If U is an n x n matrix with entries in Z, then 
AG is another 1 xn matrix with entries in G, for its entries are Z-linear combinations of 
x\, .... x„. It is easy to check associativity: If U and V are n x n matrices with entries in 
Z, then X(UV) = (XU)V. Moreover, there is an obvious relation between the subgroups 
generated by AG and by A; namely, (AG) C (A). 

Lemma A. Let G be an additive abelian group, let x\,..., x n be elements of G, let X be 
the 1 x n matrix X whose jth entry is xj, and let U be an n x n matrix with entries in Z. 
//det(G) = 1, then (XU) = (A). 


Definition. An n x 1 matrix [a\,..., a„] with entries in a PID R is called a unimodular 
column if gcd (a\, ..., a„) = 1. 

Lemma B. If R is a PID, then every unimodular column [ai , ..., a„] is the first column of 
some n x n matrix U over R with det(G) = 1. 


Sketch of Proof. The proof is by induction on n > 2. If n = 2, then there are elements 

d i —$ 

s and r in R with ta i + sax — 1, and U = is a matrix of determinant 1. 

a 2 t 

The inductive step begins by setting d — gcd(ai, ..., a„_i) and defining bj — aj/d for 
i < n — 1. Since [b i,..., b„ _i] is a unimodular column, the inductive hypothesis says it 
is the first column of an (n — 1) x (n — 1) matrix U' of determinant 1. Now (a„, d) = 1, 
since [ai, ..., a n ] is a unimodular column, and so there are s,teR with td + sa„ — 1. 
These data are used, in a clever way, to modify U' and then augment it to form an n x n 
unimodular matrix with first column [a\,..., a„]. • 


Theorem, (i) If an abelian group G = (x \,..., x n ) and if[a \,.... a„] is a unimodular 
column, then there is a set of n generators ofG one of whose elements is a\x\ + ■ ■ ■ +a n x n . 


(ii )If G — (xi,..., x n ) is a finite abelian group, then G is a direct sum of cyclic groups. 

Proof, (i) By Lemma B, there is an n x n matrix G with det(G) = 1 whose first column 
is [ai, ..., a„]. Since det(G) = 1, Lemma A applies to say that the elements of AG, the 
first of which is a\X\ + ■ ■ ■ + a n x„, generate G. 
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(ii) Let n be the smallest cardinal of any generating set of G, and call such a generating 
set a minimal generating set. The proof is by induction on the number n of elements in a 
minimal generating set. If n — 1, then G is cyclic, and we are done. Of all the elements 
in minimal generating sets, choose one, say x, having smallest order, say k (so no minimal 
generating set contains an element of order less than k). Choose a minimal generating set 
[xi ,..., x n _ 1 , x\ containing x, and define x„ — x. Now II — (x\,, x n _i} is a proper 
subgroup of G, by minimality of n, and H is a direct sum of cyclic groups, by the inductive 
hypothesis. It suffices to prove that H fl (x n ) = {0}, for then G = H + (x n ) = H ® (x„), 
as desired. If, on the contrary, ( x n ) D H ^ {0}, then there are integers a\,..., a n with 
a n x n ^ 0 and a n x n = Yl'iZi a i x i e H (of course, we may assume that 0 < a n < k). Let 
d = gcd(«i,..., a n ). Now [a\/d, ..., a n /d] is a unimodular column, and so the element 
g = —(a n /d)x„ + Yl’iZi (a; /d)xj is part of a minimal generating set of G, by part (i). 
But dg — 0, and so the order of g is a divisor of d\ hence, g is an element of a minimal 
generating set that has order smaller than k, a contradiction. Therefore, (x n ) D H — {0}, 
and so G is a direct sum of cyclic groups. • 


Fundamental Theorem 

When are two finite abelian groups G and G' isomorphic? By the basis theorem, such 
groups are direct sums of cyclic groups, and so our first guess is that G = G’ if they 
have the same number of cyclic summands of each type. But this hope is dashed by Theo¬ 
rem 2.81, which says that if m and n are relatively prime, then I mn = I m x I„; for example, 
Ig = I 2 x I 3 . Thus, we retreat and try to count primary cyclic summands. But how can we 
do this? As in the fundamental theorem of arithmetic, we must ask whether there is some 
kind of unique factorization theorem here. 

Before stating the next lemma, recall that we have defined 

d(G) = dim (G/pG). 

In particular, d(pG) — dim(/;G/ p 2 G) and, more generally, 

d(p n G) = dim (p n G/p' ,+l G). 

Lemma 5.22. Let G be a finite p-primary abelian group, where p is a prime, and let 
G = JG Cj, where each Cj is cyclic. Ifb„ > 0 is the number of summands Cj having 
order p n , then there is some t > 1 with 


d(p n G ) — b n +1 + b „ + 2 + • • • + b t . 

Proof. Let B n be the direct sum of all Cj, if any, with order p". Thus, 

G = Bi © B 2 © • • • © B t 


for some t. Now 


p n G = p n B n+ i®---®p"B„ 
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because p n Bj = {0} for all j < n. Similarly, 

p n+1 G = p n+1 B n+2 © • • • © p n+l B t . 

Now Proposition 5.7 shows that p n G / p n+l G is isomorphic to 

\p n B n+l /p n+l B n+l ] © [p n B n+2 /p n+l B n+2 ] © • • • © [p n B,/p n+1 B t ]. 

Exercise 5.7 on page 267 gives d(p n B m /p’ ,+ l B m ) = dim ip"B m ) — b m for all n < m\ 
since d is additive over direct sums, we have 

d(p"G) — b n .|_i + b n+ 2 + ■ ■ ■ + b t . • 


The numbers b n can now be described in terms of G. 

Definition. Let G be a finite p-primary abelian group, where p is a prime. For n > 0, 
define 

U p (n, G) = d(p n G) - d(p n+l G). 

Lemma 5.22 shows that 

d(p n G) = b n+l + --- + b t 
and 

d( P >‘+ l G) = b n+2 + ---+b,, 

so that U p (n , G ) = b n +\. 

Theorem 5.23. If p is a prime, then any two decompositions of a finite p-primary abelian 
group G into direct sums of cyclic groups have the same number of cyclic summands of 
each type. More precisely, for each n > 0, the number of cyclic summands having order 
p" +1 is U p (n, G). 

Proof. By the basis theorem, there exist cyclic subgroups C, with G — ffj C,. The 
lemma shows, for each n > 0, that the number of C, having order p" +1 is U p (n , G), a 
number that is defined without any mention of the given decomposition of G into a direct 
sum of cyclics. Thus, if G — ffj Dj is another decomposition of G, where each Dj is 
cyclic, then the number of D j having order p" +1 is also U p (n, G), as desired. • 

Corollary 5.24. If G and G' are finite p-primary abelian groups, then G = G' if and 
only if Up(n, G) = U p (n, G') for all n > 0. 

Proof. If q> : G -> G' is an isomorphism, then (p(p n G ) = p"G' for all n > 0, and 
so (p induces isomorphisms of the F p -vector spaces p n G/p n+l G = p n G'/p n+i G' for all 
n > Oby p"g + p n+l G i-> p"(p(g) + p n+1 G l . Thus, their dimensions are the same; that 
is, U p {n, G) = Up in, G'). 

Conversely, assume that U p in,G) = U p in, G') for all n > 0. If G — ; C, and 

G' = ff j Cl, where the C, and C' are cyclic, then Lemma 5.22 shows that there are 
the same number of summands of each type, and so it is a simple matter to construct an 
isomorphism G -» G'. • 
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Definition. If G is a p- primary abelian group, then its elementary divisors are the num¬ 
bers in the sequence having U p (0, G) p’s, U p (l, G) p 2 ’ s, .. U p (t — 1, G) p 1 ’ s, where 
p' is the largest order of a cyclic summand of G. 

If G is a finite abelian group, then its elementary divisors are the elementary divisors of 
all its primary components. 

Theorem 5.25 (Fundamental Theorem of Finite Abelian Groups). Two finite abelian 
groups G and G' are isomorphic if and only if they have the same elementary divisors; that 
is, any two decompositions of G and G' into direct sums of primary cyclic groups have the 
same number of summands of each order. 

Proof. By the primary decomposition. Theorem 5.13(h), G = G' if and only if, for each 
prime p, their primary components are isomorphic: G p = G' p . The result now follows 
from Corollary 5.24. • 

Example 5.26. 

How many abelian groups are there of order 72? Now 72 = 2 3 3 2 , so that any abelian group 
of order 72 is the direct sum of groups of order 8 and order 9. There are three groups of 
order 8 , described by the elementary divisors 

(2,2,2), (2,4), and ( 8 ); 

there are two groups of order 9, described by the elementary divisors 

(3,3) and (9). 

Therefore, to isomorphism, there are six abelian groups of order 72. ■* 

Here is a second type of decomposition of a finite abelian group into a direct sum of 
cyclics that does not mention primary groups. 

Proposition 5.27. Every finite abelian group G is a direct sum of cyclic groups 

G = S(ci) ® S(c 2 ) © • • • © S(c t ), 
where t > 1, S(cj) is a cyclic group of order c,, and 


C\ I C 2 | • • • | C t . 

Proof. Let p \,..., p n be the prime divisors of |G|. By the basis theorem, we have, for 
each pi, 

G P , = Sip? 1 ) © Sip? 2 ) 0 • • • © S{p?'). 

We may assume that 0 < e,i < e,2 < • • • < moreover, we may allow “dummy” 
exponents ejj — 0 so that the same last index t can be used for all i. Define 
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It is plain that ci | ci \ ■ ■ ■ \ c t . Finally, Theorem 2.81 shows that 
S(p e i J ) © S(p e 2 2j ) 0 • • • 0 S{p e n ni ) = S(cj) 


for every j. • 


Definition. If G is an abelian group, then its exponent is the smallest positive integer m 
for which mG — {0}. 


Corollary 5.28. If G is a finite abelian group and G — 5(ci)05(c 2)©-• -0S(c,), S(c/) 
is a cyclic group of order Cj and ci | ci \ ■ ■ ■ \ c t , then c t is the exponent of G. 

Proof. Since c, | c, for all i, we have c,S(ci) = 0 for all i, and so c,G = {0}. On the 
other hand, there is no number e with 1 < e < c, with eS(c t ) = {0}, and so c, is the 
smallest positive integer annihilating G. • 


Corollary 5.29. Every noncyclic finite abelian group G has a subgroup isomorphic to 
© I cfor some c > 1. 

Proof. By Proposition 5.27, G = I fl © I C2 © • • • © I c ,, where t > 2, because G is not 
cyclic. Since ci | C 2 , the cyclic group I,.., contains a subgroup isomorphic to I n , and so G 
has a subgroup isomorphic to I C1 © I C1 . • 

Let us return to the structure of finite abelian groups. 

Definition. If G is a finite abelian group, and if 

G — S(ci) © S(C 2 ) © • • • © S(c t ), 

where t > 1 , S(cj) is a cyclic group of order cj > 1, and ci | C 2 | • • • | c t , then 
c i, c '2 .... , c t are called the invariant factors of G. 


Corollary 5.30. If G is a finite abelian group with invariant factors c\,... ,c t and ele¬ 
mentary divisors {pfi}, then |G| = Ylj=i c J = lhj pfi, and its exponent is c t . 

Proof. We have 


G — Z/(cj) © • • • © Z/(q) 

= Ici ® • ■ ■ © If, • 

Since the underlying set of a direct sum is the cartesian product, we have |G| = fl/= t c j 
and | G| = ]”[,/ p fi • That c, is the exponent was proved in Corollary 5.28. • 
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Example 5.31. 

In Example 5.26, we displayed the elementary divisors of abelian groups of order 72; here 
are their invariant factors. 


elementary divisors ** invariant factors 


(2,2,2, 3,3) = (2,2,2, 1,3,3) 
(2,4, 3,3) 
(8,3,3) = (1,8, 3, 3) 
(2, 2, 2, 9) = (2,2,2, 1, 1,9) 
(2,4,9) = (2,4, 1,9) 



2 

6 6 


6 

1 12 


3 

| 24 


2 

1 2 | 18 


2 

| 36 


(8,9) 72 <* 


Theorem 5.32 (Invariant Factors). 7vvo finite abelian groups are isomorphic if and 
only they have the same invariant factors. 


Proof Given the elementary divisors of G, we can construct invariant factors, as in the 
proof of Proposition 5.27: 


Ci 


eij e2j e„j 

P l Pi ■■■Pn , 


where those factors p e .’ 1 , //' 2 , • • • not equal to p9 = 1 are the elementary divisors of the 
Pi -primary component of G. Thus, the invariant factors depend only on G because they 
are defined in terms of the elementary divisors. 

To prove isomorphism, it suffices, by the fundamental theorem, to prove that the ele¬ 
mentary divisors can be computed from the invariant factors. Since Cj = p\ lj p*f j • ■ ■ Pn‘ J , 
the fundamental theorem of arithmetic shows that Cj determines all those prime powers 
p e .' j which are distinct from 1; that is, the invariant factors cj determine the elementary 
divisors. • 


In Example 5.31, we started with elementary divisors and computed invariant factors. 
Let us now start with invariant factors and compute elementary divisors. 


invariant factors -c* elementary divisors 


2|6|6 = 2|2-3|2-3 
6 | 12 = 2 • 3 | 2 2 • 3 
3 | 24 = 3 | 2 3 • 3 
2 I 2 | 18 = 2 | 2 | 2 • 3 2 
2 | 36 = 2 | 2 2 • 3 2 
72 = 2 3 • 3 2 


(2,2,2, 3,3) 
(2,4, 3,3) 
(8,3,3) 

(2, 2, 2, 9) 

(2.4.9) 

(8.9) . 


The results of this section will be generalized, in Chapter 9, from finite abelian groups 
to finitely generated abelian groups, where an abelian group G is finitely generated if there 
are finitely many elements ai,... ,a„ e G so that every x e G is a linear combination 
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of them: x — m ( a,-, where m,- e Z for all i. The basis theorem generalizes: Every 

finitely generated abelian group G is a direct sum of cyclic groups, each of which is a 
finite primary group or an infinite cyclic group; the fundamental theorem also generalizes: 
Given two decompositions of G into a direct sum of cyclic groups (as in the basis theorem), 
the number of cyclic summands of each type is the same in both decompositions. The basis 
theorem is no longer true for abelian groups that are not finitely generated; for example, 
the additive group Q of rational numbers is not a direct sum of cyclic groups. 

Exercises 

5.1 (i) Let G be an arbitrary, possibly nonabelian, group, and let S and T be normal subgroups 

of G. Prove that if S fl T = {1}. then st — ts for all s e 5 and t e T. 

Hint. Show that st.? -1 ? -1 e S fl T . 

(ii) Prove that Proposition 5.4 holds for nonabelian groups G if we assume that all the 
subgroups Sj are normal subgroups. 

5.2 Let G be an abelian group, not necessarily primary. Define a subgroup S C G to be a pure 
subgroup if, for all in e Z, 

5 fl mG — mS. 

Prove that if G is a p-primary abelian group, then a subgroup S C G is pure as just defined if 
and only if 5 fl p" G = p n S for all n > 0 (the definition in the text). 

5.3 Let G be a possibly infinite abelian group. 

(i) Prove that every direct summand 5 of G is a pure subgroup. 

Define the torsion 3 subgroup tG of G as 

tG = (a e G : a has finite order}. 

(ii) Prove that tG is a pure subgroup of G. [There exist abelian groups G whose torsion 
subgroup tG is not a direct summand (see Exercise 9. 1 (iii) on page 663); hence, a pure 
subgroup need not be a direct summand.] 

(iii) Prove that G/tG is an abelian group in which every nonzero element has infinite order. 

5.4 Let p be a prime and let q be relatively prime to p. Prove that if G is a p-group and g e G. 
then there exists x e G with qx = g. 

5.5 Let G = (a) be a cyclic group of finite order m. Prove that G/nG is a cyclic group of order 
d, where d = (in, n). 

5.6 For a group G and a positive integer n, define 

G[n] = [g e G : g n = 1}. 

Prove that G[n] = where d = (m, n), and conclude that G[n] = Ij. 

5.7 Prove that if B = B m = (x\) ® • • • © (xp m ) is a direct sum of b m cyclic groups of order p m , 
and if n < m, then the cosets p"x; + p" +1 B, for 1 < / < b m are a basis for p"6/p” +1 B. 
Conclude that d(p"B m ) = b m when n < in. [Recall that if G is a finite abelian group, then 
G/pG is a vector space over ¥ p and d(G) = dim(G/pG).] 

3 This terminology comes from algebraic topology. To each space X, a sequence of abelian groups is assigned, 
called homology groups, and if X is “twisted,” then there are elements of finite order in some of these groups. 
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5.8 (i) If G is a finite p-primary abelian group, where p is a prime, and if x e G has largest 

order, prove that (x) is a direct summand of G. 

(ii) Prove that if G is a finite abelian group and x e G has maximal order (that is, there is 
no element in G having larger order), then (x) is a direct summand of G. 

5.9 Prove that a subgroup of a finite abelian group is a direct summand if and only if it is a pure 
subgroup. 

Hint. Modify the proof of the basis theorem. Theorem 5.18. 

5.10 (i) If G and H are finite abelian groups, prove, for all primes p and all n > 0, that 

U p {n, G (B H) = U p {n, G ) + U p (n, H). 

(ii) If A, B, and C are finite abelian groups, prove that A © B = A © C implies B = C. 

(iii) If A and B are finite abelian groups, prove that A ® A = B ® B implies A = B. 

5.11 If n is a positive integer, then a partition of n is a sequence of positive integers i\ < i 2 < 
■ ■ ■ < ir with t’i + *2 + ■ ■ ■ + ir — n. If p is a prime, prove that the number of nonisomorphic 
abelian groups of order p n is equal to the number of partitions of n. 

5.12 Prove that there are, to isomorphism, exactly 14 abelian groups of order 288. 

5.13 Prove the uniqueness assertion in the fundamental theorem of arithmetic by applying the fun¬ 
damental theorem of finite abelian groups to G = I„. 

5.14 (i) If G is a finite abelian group, define 

V£(G) = the number of elements in G of order k. 

Prove that two finite abelian groups G and G' are isomorphic if and only if vj;(G) — 
VfciG') for all integers k. 

Hint. If B is a direct sum of k copies of a cyclic group of order p n , then how many 
elements of order p n are in 6? 

(ii) Give an example of two nonisomorphic not necessarily abelian finite groups G and G' 
for which ip(G) = v^iG') for all integers k. 

Hint. Take G of order /r\ 

5.15 Prove that the additive group Q is not a direct sum: Q ^ A ® B, where A and B are nonzero 
subgroups. 

Hint. If a,b e Q are not zero, then there is c e Q with a, b e (c). 

5.16 Let G = B\ ® B 2 ® • • • ® B t , where the B, are subgroups. 

(i) Prove that G\p\ = B { [p ] ® B 2 [p] ® • • • ® B t [p], 

(ii) Prove, for all n > 0 that 

p"G n G[p] = ( p"G n B^p]) ® (p n G n b 2 [ p ]) © • ■ • ® ( P ”G n b,[ p ]) 

= ( p"Bi n B { [pi) © ( p " b 2 n B 2 [p]) © ■ • • © (p n B, n b ,[ p ]). 


(iii) If G is a p-primary abelian group, prove, for all n > 0, that 


Upin, G) = dim 


/ p n G n G[p\ \ 
\p»+ l GnG[p]J 
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5.2 The Sylow Theorems 

We return to nonabelian groups, and so we revert to the multiplicative notation. The Sylow 
theorems are analogs, for finite nonabelian groups, of the primary components of finite 
abelian groups. 

Recall that a group G ^ {1} is called simple if it has no normal subgroups other than {1} 
and G itself. We saw, in Proposition 2.107, that the abelian simple groups are precisely the 
cyclic groups l p of prime order p, and we saw, in Theorem 2.112, that 4„ is a nonabelian 
simple group for all n > 5. In fact, A 5 is the nonabelian simple group of smallest order. 
How can we prove that a nonabelian group G of order less than 60 = |As| is not simple? 
Exercise 2.98 on page 114 states that if G is a group of order |G| = mp, where p is prime 
and 1 < m < p, then G is not simple. This exercise shows that many of the numbers less 
than 60 are not orders of simple groups. After throwing out all prime powers (p-groups 
are never nonabelian simple), the only remaining possibilities are 

12, 18, 24, 30, 36,40,45,48, 50,54, 56. 

The solution to the exercise uses Cauchy’s theorem, which says that G has a subgroup of 
order p. We shall see that if G has a subgroup of order p e instead of p, then Exercise 2.98 
can be generalized, and the list of candidates can be shortened. What proper subgroups of 
G do we know other than cyclic subgroups? The center Z(G) of a group G is a possible 
candidate, but this subgroup might not be proper or it might be trivial: if G is abelian, then 
Z(G) = G; if G = S 3 , then Z(G) = {1}. Hence, Z(G) cannot be used to generalize the 
exercise. 

Traite des Substitutions et des Equations Algebriques, by C. Jordan, published in 1870, 
was the first book on group theory (more than half of it is devoted to Galois theory, then 
called the theory of equations). At about the same time, but too late for publication in 
Jordan’s book, three fundamental theorems were discovered. In 1868, E. Schering proved 
the basis theorem: Every finite abelian group is a direct product of cyclic groups, each of 
prime power order; in 1870, L. Kronecker, unaware of Schering’s proof, also proved this 
result. In 1878, F. G. Frobenius and L. Stickelberger proved the fundamental theorem of fi¬ 
nite abelian groups. In 1872, L. Sylow showed, for every finite group G and every prime p. 
that if p e is the largest power of p dividing |G|, then G has a subgroup of order //, nowa¬ 
days called a Sylow subgroup ; we will use such subgroups to generalize Exercise 2.98. Our 
strategy for proving the Sylow theorems works best if we adopt the following definition. 

Definition. Let p be a prime. A Sylow p-subgroup of a finite group G is a maximal 
p-subgroup P . 

Maximality means that if Q is a p-subgroup of G and P < Q, then P = Q. 

It follows from Lagrange’s theorem that if p e is the largest power of p dividing |G|, 
then a subgroup of order p e , should it exist, is a maximal p-subgroup of G. One virtue 
of the present definition is that maximal p-subgroups always exist: indeed, we now show 
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that if S is any p-subgroup of G (perhaps S = {1}), then there exists a Sylow p-subgroup 
P containing S. If there is no p-subgroup strictly containing S, then S itself is a Sylow 
p-subgroup. Otherwise, there is a p-subgroup P\ with S < P\. If Pi is maximal, it is 
Sylow, and we are done. Otherwise, there is some p-subgroup P 2 with P\ < IP- This 
procedure of producing larger and larger p-subgroups P; must end after a finite number of 
steps, for 1^1 < |G| for all i ; the largest P, must, therefore, be a Sylow p-subgroup. 
Recall that a conjugate of a subgroup H < G is a subgroup of G of the form 

uZ/rG 1 = { aha -1 : h e ZZ}, 

where a e G. The normalizer of ZZ in G is the subgroup 

N g (H) = {a e G : aHa~ l = ZZ), 

and Proposition 2.101 states that if ZZ is a subgroup of a finite group G, then the number 
of conjugates of ZZ in G is [G : Ng(H)]. 

It is obvious that ZZ <] Nq(H ), and so the quotient group Nq (//)/// is defined. 

Lemma 5.33. Let P be a Sylow p-subgroup of a finite group G. 

(i) Every conjugate of P is also a Sylow p-subgroup of G. 

(ii) \N g (P)/P\ is prime to p. 

(iii) If a e G has order some power of p and ifaPa~ l — P, then a € P. 

Proof (i) If a e G, then aPa~ l is a p-subgroup of G; if it is not a maximal p-subgroup, 
then there is a p-subgroup Q with aPa~ l < Q. Hence, P < a~ l Qa, contradicting the 
maximality of P. 

(ii) If p divides \Ng(P)/P\, then Cauchy’s theorem shows that Ng(P)/P contains an 
element aP of order p, and hence Ng(P)/P contains a subgroup S* = ( aP) of order 
p. By the correspondence theorem (Theorem 2.76), there is a subgroup S with P < S < 
Ng(P ) such that S/P = S*. But S is a p-subgroup of Ng(P ) < G (by Exercise 2.75 on 
page 95) strictly larger than P, and this contradicts the maximality of P. We conclude that 
p does not divide \Ng(P)/P\- 

(iii) By the definition of normalizes the element a lies in AY; (P). If a f P, then the coset 
aP is a nontrivial element of Ng(P)/P having order some power of p; in light of part (ii), 
this contradicts Lagrange’s theorem. • 

Since every conjugate of a Sylow p-subgroup is a Sylow p-subgroup, it is reasonable 
to let G act by conjugation on the Sylow p-subgroups. 

Theorem 5.34 (Sylow). Let G be a finite group of order p^ 1 • • • p e> , and let P be a Sylow 
p-subgroup of G for some prime p — pj. 

(i) Every Sylow p-subgroup is conjugate to P. 
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(ii) If there are r / Sylow p/-subgroups, then r / is a divisor of | C7|/ pV and 

r/ = 1 mod pj. 

Proof. Let X — {P],..., P rj } he the set of all the conjugates of P, where we have 
denoted P by P\ . If Q is any Sylow p-subgroup of G, then Q acts on X by conjugation: 
If a e Q, then it sends 

Pi = giPgf 1 i-» a(giPg7 l )a~' = (agi)P(agi)~ x e X. 

By Corollary 2.99, the number of elements in any orbit is a divisor of | Q\\ that is, every 
orbit has size some power of p (because (lisa p-group). If there is an orbit of size 1, then 
there is some P\ with a P, a~ 1 = P, for all a e Q. By Lemma 5.33, we have a e Pj for 
all a e Q\ that is, Q < Pj. But Q, being a Sylow p-subgroup, is a maximal p-subgroup 
of G, and so Q = Pj. In particular, if Q — Pi, then there is only one orbit of size 1, 
namely, {Pi}, and all the other orbits have sizes that are honest powers of p. We conclude 
that |X| = rj = 1 mod p. 

Suppose now that there is some Sylow p-subgroup Q that is not a conjugate of P; thus, 
Q f Pj for any i. Again, we let Q act on X, and again, we ask if there is an orbit of 
size 1, say, {Pa-}. As in the previous paragraph, this implies Q = Pa, contrary to our 
present assumption that Q f X. Hence, there are no orbits of size 1, which says that each 
orbit has size an honest power of p. It follows that \X\ — r/ is a multiple of p; that is, 
r/ = 0 mod p, which contradicts the congruence r/ = 1 mod p. Therefore, no such Q 
can exist, and so all Sylow p-subgroups are conjugate to P. 

Finally, since all Sylow p-subgroups are conjugate, we have rj — [G : AV;(P)], and so 
r/ is a divisor of |G|. But rj = 1 mod p/ implies (rj, pV) — 1, so that Euclid’s lemma 
gives rj \ |G|/p^. • 

Corollary 5.35. A finite group G has a unique Sylow p-subgroup P, for some prime p, 
if and only if P <1 G. 

Proof. Assume that P, a Sylow p-subgroup of G, is unique. For each a e G, the conju¬ 
gate aPa~ l is also a Sylow p-subgroup; by uniqueness, aPa~ l = P for all a e G, and 
so P <1 G. 

Conversely, assume that P < G. If Q is any Sylow p-subgroup, then Q = aPa~ l for 
some a e G; but aPa~ l = P, by normality, and so Q — P. • 

The following result gives the order of a Sylow subgroup. 

Theorem 5.36 (Sylow). If G is a finite group of order p e m, where p is a prime and 
p { m, then every Sylow p-subgroup P of G has order p e . 

Proof. We first show that p \ [G : P], Now 

[G : P] = [G : A G (P)][A G (P) ; P], 
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The first factor, [ G : Ng(P)] — r, is the number of conjugates of P in G, and so p 
does not divide [G : Nq ( P )] because r = 1 mod p. The second factor, [ Nq( P) : P] = 
\Ng(P)/P\, is also not divisible by p, by Lemma 5.33. Therefore, p does not divide 
[G : P], by Euclid’s lemma. 

Now \P \ = p k for some k < e, and so 


[G : P] — \G\/\P\ = p e m/p k = p e ~ k m. 

Since p does not divide [G : P], we must have k — e; that is, |P| = p e . • 


Example 5.37. 

(i) Let G = S 4 . Now |S 4 1 = 24 = 2 3 3. Thus, a Sylow 2-subgroup of S 4 has order 8 . 
We have seen, in Exercise 2.83 on page 113, that .$4 contains a copy of the dihedral group 
Z >8 consisting of the symmetries of a square. The Sylow theorem says that all subgroups 
of order 8 are conjugate, hence isomorphic, to Z> 8 - Moreover, the number r of Sylow 2- 
subgroups is a divisor of 24 congruent to 1 mod 2; that is, r is an odd divisor of 24. Since 
c / 1 (see Exercise 5.17 on page 277), there are exactly three Sylow 2-subgroups. 

(ii) If G is a finite abelian group, then a Sylow p-subgroup is just its p-primary component 

(since G is abelian, every subgroup is normal, and so there is a unique Sylow p-subgroup 
for every prime p). -4 

Here is a second proof of the last Sylow theorem, due to H. Wielandt. 

Theorem 5.38. If G is a finite group of order p e m, where p is a prime and p j m, then 
G has a subgroup of order p e . 

Proof. If X is the family of all those subsets of G having exactly p e elements, then 
|.Xj = ( 7 y"); by Exercise 1.29 on page 14, p j |X|. Now G acts on X: define gB, for 
g e G and B e X, by 

gB = {gb :b e B}. 

If p divides \0(B)\ for every B e X, where O ( B) is the orbit of B, then p is a divisor 
of |XI, for X is the disjoint union of orbits, by Proposition 2.97. As p \ |X|, there exists 
a subset B with B — p e and with \0(B)\ not divisible by p. If Gb is the stabilizer of 
this subset B, then Theorem 2.98 gives [G : Gb] = \0{B)\, and so |G| = | Gb I ' \0{B)\. 
Since p e \ |G| and p \ 0(B)\, repeated application of Euclid’s lemma gives p e \ | G b I - 
Therefore, p e < \ G b I ■ 

For the reverse inequality, choose an element b e B and define a function r: Gb -> B 
by g !->■ gb. Note that r (g) = gb e gB = B , for g e Gb , the stabilizer of B. If g, 
h e Gb and h g , then r(7z) = hb gb — r(g); that is, r is an injection. We conclude 
that |Gb| < |Z?| = p e , and so Gb is a subgroup of G of order p e . • 



Sec. 5.2 The Sylow Theorems 


273 


Proposition 5.39. A finite group G all of whose Sylow subgroups are normal is the direct 
product of its Sylow subgroups. 

Proof. Let |G| = pj 1 ■ ■ ■ p e f and let G Pi be the Sylow p,-subgroup of G. We use Ex¬ 
ercise 5.1 on page 267, the generalization of Proposition 5.4 to nonabelian groups. The 
subgroup 5 generated by all the Sylow subgroups is G, for p e .‘ | .S' for all i. Finally, if 
x e G Pi n<U, ¥i G Pj >> then x = s i e G Pi and x = Y\jjti s j’ where sj e G P] . Now 
x p i = 1 for some n > 1. On the other hand, there is some power of pj, say qj, with 
.vj' = 1 for all j. Since the sj commute with each other, by Exercise 5.1 on page 267, we 
have 1 = x q — (Yljjti s j) q > where q = Y\jPi dj- Since ( p",q ) = 1, there are integers 
it and v with 1 = up" + vq, and so x = x 1 = x up > x vq — 1. Therefore, G is the direct 
product of its Sylow subgroups. • 

We can now generalize Exercise 2.98 on page 114 and its solution. 

Lemma 5.40. There is no nonabelian simple group G of order |G| = p e m, where p is 
prime, p \ m, and p e \ (m — 1)!. 

Proof. We claim that if p is a prime, then every p-group G with |G| > p is not sim¬ 
ple. Theorem 2.75 says that the center, Z(G), is nontrivial. But Z(G) <1 G, so that if 
Z(G) is a proper subgroup, then G is not simple. If Z(G) = G, then G is abelian, and 
Proposition 2.78 says that G is not simple unless |G| = p. 

Suppose that such a simple group G exists. By Sylow’s theorem, G contains a subgroup 
P of order //, hence of index m. We may assume that m > 1, for nonabelian p-groups are 
never simple. By Theorem 2.88, there exists a homomorphism tp: G -» S m with ker tp < 
P. Since G is simple, however, it has no proper normal subgroups; hence ker p — {1} and 
tp is an injection; that is, G = p(G) < S m . By Lagrange’s theorem, p e m \ ml, and so 
p e | (m — 1 )!, contrary to the hypothesis. • 

Proposition 5.41. There are no nonabelian simple groups of order less than 60. 

Proof. The reader may now check that the only integers n between 2 and 59, neither a 
prime power nor having a factorization of the form n = p e m as in the statement of the 
lemma, are n = 30,40, and 56. By the lemma, these three numbers are the only candidates 
for orders of nonabelian simple groups of order < 60. 

Assume there is a simple group G of order 30. Let P be a Sylow 5-subgroup of G, 
so that |P| —5. The number tp of conjugates of P is a divisor of 30 and tp = 1 mod 5. 
Now rs / 1 lest P <] G, so that tp = 6 . By Lagrange’s theorem, the intersection of 
any two of these is trivial (intersections of Sylow subgroups can be more complicated; 
see Exercise 5.18 on page 277). There are four nonidentity elements in each of these 
subgroups, and so there are 6 x 4 = 24 nonidentity elements in their union. Similarly, 
the number r 3 of Sylow 3-subgroups of G is 10 (for r 3 f 1, rj is a divisor of 30, and 
7-3 = 1 mod 3). There are two nonidentity elements in each of these subgroups, and so the 
union of these subgroups has 20 nonidentity elements. We have exceeded the number of 
elements in G, and so G cannot be simple. 
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Let G be a group of order 40, and let P be a Sylow 5-subgroup of G. If r is the number 
of conjugates of P, then r | 40 and r = 1 mod 5. These conditions force r — 1, so that 
P <3 G; therefore, no simple group of order 40 can exist. 

Finally, assume there is a simple group G of order 56. If P is a Sylow 7-subgroup of 
G, then P must have r-] — 8 conjugates (for rj | 56 and r-j = 1 mod 7). Since these 
groups are cyclic of prime order, the intersection of any pair of them is {1}, and so there 
are 48 nonidentity elements in their union. Thus, adding the identity, we have accounted 
for 49 elements of G. Now a Sylow 2-subgroup Q has order 8, and so it contributes seven 
more nonidentity elements, giving 56 elements. But there is a second Sylow 2-subgroup, 
lest Q < G, and we have exceeded our quota. Therefore, there is no simple group of 
order 56. • 

The “converse” of Lagrange’s theorem is false: If G is a finite group of order n, and 
if d | n, then G may not have a subgroup of order d. For example, we proved, in Propo¬ 
sition 2.64, that the alternating group A 4 is a group of order 12 having no subgroup of 
order 6. 

Proposition 5.42. Let G be a finite group. If p is a prime and if p k divides |G|, then G 
has a subgroup of order p k . 

Proof If |G| = p e m, where p \ m, then a Sylow p-subgroup P of G has order p e . 
Hence, if p k divides |G|, then p k divides \P\. By Proposition 2.106, P has a subgroup of 
order p k ; a fortiori, G has a subgroup of order p k . • 


What examples of p-groups have we seen? Of course, cyclic groups of order p" are 
p-groups, as is any direct product of copies of these. By the fundamental theorem, this 
describes all (finite) abelian p-groups. The only nonabelian examples we have seen so 
far are the dihedral groups ZL,, (which are 2-groups when n is a power of 2) and the 
quaternions Q of order 8 (of course, for every 2-group A, the direct products Z>8 x A and 
Q x A are also nonabelian 2-groups). Here are some new examples. 

Definition. A unitriangular matrix over a field k is an upper triangular matrix each of 
whose diagonal terms is 1. Define UT(n, k) to be the set of all n x n unitriangular matrices 
over k. 

Remark. We can generalize this definition by allowing k to be any commutative ring. 
For example, the group UT(n, Z) is an interesting group. < 

Proposition 5.43. Ifk is afield, then UT(n, k) is a subgroup of GL (n , k). 

Proof Of course, the identity I is unitriangular, so that I e UT(n, k). If A e UT(n, k ), 
then A — I + N, where N is strictly upper triangular; that is, N is an upper triangular 
matrix having only 0’s on its diagonal. Note that the sum and product of strictly upper 
triangular matrices is again strictly upper triangular. 
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Let e \,..., e„ be the standard basis of k n . If N is strictly upper triangular, define 
T: k n -* k n by T{ef) = Net. where et is regarded as a column matrix. Now T satisfies 
the equations, for all i, 

T(e i) = 0 and T(e i+ i) e {e \,..., a). 

It is easy to see, by induction on i, that 

T l (ej) = 0 for all j < 

It follows that T n — 0 and, hence, that N" — 0. Thus, if A e UT(n, k ), then A = I + N, 
where N n —0. 

To see that UT(n, k) is a subgroup of GL(n, k), first note that (7 + N)(I + M ) = 
I + (N + M + NM) is unitriangular. Second, we show that if A is unitriangular, then 
it is nonsingular and that its inverse is also unitriangular. In analogy to the power series 
expansion 1/(1 + x) = 1 — x + x 2 — x 3 + • • •, we define the inverse of A = I + N to be 
B — I — N + N 2 — IV 3 + • • • (note that this series stops after n — 1 terms because N" — 0), 
The reader may now check that BA = I = AB. so that B = A 1 . Moreover, N strictly 
upper triangular implies that — N + N 2 — A ' 3 H— • ± N"~ l is also strictly upper triangular, 
and so A~ l is unitriangular. (Alternatively, for readers familiar with linear algebra, we 
know that A is nonsingular, because its determinant is 1, and the formula for A -1 in terms 
of its adjoint [the matrix of cofactors] shows that A -1 is unitriangular.) Hence, UT(n, k) 
is a subgroup of GL(n, k). • 


Proposition 5.44. Let q = p e , where p is a prime. For each n > 2, UT(n, F^) is a 
p-group of order q( 2 ) = ^n(n-l)/2_ 

Proof. The number of entries in an n x n unitriangular matrix lying strictly above the 
diagonal is Q) = — 1 ) (throw away n diagonal entries from the total of n 2 entries; 

half of the remaining n 2 — n entries are above the diagonal). Since each of these entries 
can be any element of F ? , there are exactly q^ 2 ) n x n unitriangular matrices over F ? , and 
so this is the order of UT(«, F ? ). • 

Recall Exercise 2.26 on page 62: If G is a group and x 2 — 1 for all x e G, then G is 
abelian. We now ask whether a group G satisfying x p — 1 for all x e G, where p is an 
odd prime, must also be abelian. 

Proposition 5.45. If p is an odd prime, then there exists a nonabelian group G of order 
p 2 with x p = 1 for all x e G. 

Proof. If G = UT(3,F p ), then |G| = p 2 . Now G is not abelian; for example, the 
matrices 


"1 

1 

0 


"1 

0 

1 

0 

1 

1 

and 

0 

1 

1 

0 

0 

1 


0 

0 

1 
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do not commute. If A e G, then A — / + A' ; since p is an odd prime, p > 3, and so N p = 
0. The set of all matrices of the form gqI + a[N + ■ ■ ■ + a m N m , where a, e ¥ p , is easily 
seen to be a commutative ring in which pM — 0 for all M. But Proposition 3.2(vi) says 
that the binomial theorem holds in every commutative ring; since p | (?) when 1 < i < p, 
by Proposition 1.12, we have 

A p = (/ + N) p = I p + N p = I. • 

Theorem 5.46. Let denote the finite field with q elements. Then 


|GL(n, F,)| = (q n - 1 )(q" - q)(q n - q 2 ) ■ ■ ■ {q n - q n ~ l ). 

Proof. Let V be an n -dimensional vector space over F (/ . We show first that there is a 
bijection <l>: GL(n, F ? ) —> B. where B is the set of all bases of V. Choose, once for all, a 
basis e\, ..., e n of V. If T e GL(n, F ? ), define 

c t>(T) = Tei,...,Te n . 

By Lemma 3.103, <l>(7’) e B because T , being nonsingular, carries a basis into a basis. 
But <J> is a bijection, for given a basis v\,... ,v n , there is a unique linear transformation S , 
necessarily nonsingular (by Lemma 3.103), with Set — v, for all i (by Theorem 3.92). 

Our problem now is to count the number of bases tq,..., v n of V. There are q n vectors 
in V, and so there are q n — 1 candidates for rq (the zero vector is not a candidate). Having 
chosen iq, we see that the candidates for iq are those vectors not in (tq), the subspace 
spanned by tq; there are thus q" — q candidates for tq. More generally, having chosen a 
linearly independent list tq,..., tq, then u,-+i can be any vector not in {tq,..., t>;). Thus, 
there are q n — q' candidates for t>,+i. The result follows by induction on n. • 


Theorem 5.47. If p is a prime and q — p m , then the unitriangular group UT (n , F ? ) is a 
Sylow p-subgroup of GL(n, F ? ). 

Proof. Since q n — q' — q‘ iq n ~' — 1), the highest power of p dividing | GL (n, F ? )| is 

qq 2 q i ■■■q n ~ l — q( 2 \ 

But |UT(n, F ? )| = q(z\ and so it must be a Sylow /^-subgroup. • 


Corollary 5.48. If p is a prime and G is a finite p-group, then G is isomorphic to a 
subgroup of the unitriangular group UT(|G|, F^,). 

Proof. We show first, for every m > 1, that the symmetric group S m can be imbedded 
in GL (m,k), where k is a field. Let V be an m -dimensional vector space over k, and 
let tq,..., v m be a basis of V. Define a function tp: S m —> GL(Y) by o i-> T a . where 
T n : Vi Vrr(i) for all i. It is easy to see that <p is an injective homomorphism. By Cayley’s 
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theorem, G can be imbedded in Sc ; hence, G can be imbedded in GL(w,F p ), where 
m — |G|. Now G is contained in some Sylow ^-subgroup P of Gh(m,F p ), for every 
p-subgroup lies in some Sylow p-subgroup. Since all Sylow p-subgroups are conjugate, 
there is a e GL (in, ¥ p ) with P — a (UT(m, F p )) a -1 . Therefore, 

G = a~ l Ga < a~ l Pa < UT(»t, F p ). • 

A natural question is to find the Sylow subgroups of symmetric groups. This can be 
done, and the answer is in terms of a construction called wreath product (see Rotman, An 
Introduction to the Theory of Groups, page 176). 

Exercises 

5.17 How many Sylow 2-subgroups does S 4 have? 

5.18 Give an example of a finite group G having Sylow p-subgroups (for some prime p) P, Q and 
R such that P n Q = {1} and PflS / (1). 

Hint. Consider S 3 x 53 . 

5.19 A subgroup H of a group G is called characteristic if <p(H) < H for every isomorphism 
<p : G — > G. A subgroup S of a group G is called fully invariant if <p(S) < S for every 
homomorphism^): G —> G. 

(i) Prove that every fully invariant subgroup is a characteristic subgroup, and that every 
characteristic subgroup is a normal subgroup. 

(ii) Prove that the commutator subgroup, G', is a normal subgroup of G by showing that it 
is a fully invariant subgroup. 

(iii) Give an example of a group G having a normal subgroup H that is not a characteristic 
subgroup. 

(iv) Prove that Z(G), the center of a group G. is a characteristic subgroup (and so Z(G)<]G), 
but that it need not be a fully invariant subgroup. 

Hint. Let G = S 3 x IH. 

(v) For any group G, prove that if H <1 G, then Z(H) < G. 

5.20 If G is an abelian group, prove, for all positive integers m, that mG and G[/n] are fully invari¬ 
ant subgroups. 

5.21 (Frattini Argument). Let A be a normal subgroup of a finite group G. If P is a Sylow 
p-subgroup of K for some prime p, prove that 

G = KN C (P). 

where KNq(P) — {ab : a e K and b e Nq(P)}. 

Hint. If g e G, then gPg -1 is a Sylow p-subgroup of K, and so it is conjugate to P in K. 

5.22 Prove that UT(3, 2) = Hg, and conclude that D% is a Sylow 2-subgroup of GL(3, 2). 

Hint. You may use the fact that the only nonabelian groups of order 8 are Dg and Q. 

5.23 (i) Prove that if d is a positive divisor of 24, then S 4 has a subgroup of order d. 

(ii) If d ^ 4, prove that any two subgroups of 54 having order d are isomorphic. 



278 


Groups II Ch. 5 


5.24 (i) Find a Sylow 3-subgroup of 5g. 

Hint. (1, 2, 3, 4. 5, 6 ) = {1, 2, 3) U {4, 5, 6 }. 

(ii) Show that a Sylow 2-subgroup of Sg is isomorphic to Dg x I 2 . 

Hint. {1, 2, 3, 4. 5, 6 ) = {1, 2, 3, 4) U (5, 6 ). 

5.25 Let Q be a normal p-subgroup of a finite group G. Prove that Q < P for every Sylow 
p-subgroup P of G. 

Hint. Use the fact that any other Sylow p-subgroup of G is conjugate to P. 

5.26 (i) Let G be a finite group and let P be a Sylow p-subgroup of G. If H O G, prove that 

HP/H is a Sylow p-subgroup of G/H and H fl P is a Sylow p-subgroup of H. 

Hint. Show that [G/H : HP/H] and [H : H fl P] are prime to p. 

(ii) Let P be a Sylow p-subgroup of a finite group G. Give an example of a subgroup H of 
G with H fl P not a Sylow p-subgroup of H. 

Hint. Choose a subgroup H of S 4 with H = 53 , and find a Sylow 3-subgroup P of S 4 
with H HP = (1). 

5.27 Prove that a Sylow 2-subgroup of A 5 has exactly five conjugates. 

5.28 Prove that there are no simple groups of order 96, 120, 300, 312, or 1000. 

Hint. Some of these are not tricky. 

5.29 Let G be a group of order 90. 

(i) If a Sylow 5-subgroup P of G is not normal, prove that it has six conjugates. 

Hint. If P has 18 conjugates, there are 72 elements in G of order 5. Show that G has 
more than 18 other elements. 

(ii) Prove that G is not simple. 

Hint. Use Exercises 2.95(ii) and 2.96(ii) on page 114. 

5.30 Prove that there is no simple group of order 120. 

5.31 Prove that there is no simple group of order 150. 

5.32 If H is a proper subgroup of a finite group G, prove that G is not the union of all the conjugates 
of H: that is, G UxeG xHx~^. 


5.3 The Jordan-Holder Theorem 

Galois introduced groups to investigate polynomials in k [4 ], where k is a field of charac¬ 
teristic 0, and he saw that such a polynomial is solvable by radicals if and only if its Galois 
group is a solvable group. Solvable groups are an interesting family of groups in their own 
right, and we now examine them a bit more. 

Recall that a normal series of a group G is a finite sequence of subgroups, G = 
Go, Gi, G 2 ,..., G„ = {1}, with 

G = Go > Gi > G 2 > • • • > G„ = {1} 

and G,+i <1G, for all i. The factor groups of the series are the groups G 0 /G 1 , G 1 /G 2 , ■.., 
G n -\/G n , the length of the series is the number of strict inclusions (equivalently, the length 
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is the number of nontrivial factor groups), and G is solvable if it has a normal series whose 
factor groups are cyclic of prime order. 

We begin with a technical result that generalizes the second isomorphism theorem, for 
we will want to compare different normal series of a group. 


Lemma 5.49 (Zassenhaus Lemma). Given four subgroups ,4 <] A* and B <1 IT* of a 
group G, then A(A* fl B) <d A(A* ft B*), B(B* fl A) < B(B* fl A*), and there is an 
isomorphism 

A(A*(1B*) _ B(B*HA*) 

A(A* n B) ~ B(B*DA) ' 


Remark. The Zassenhaus lemma is sometimes called the butterfly lemma because of the 
following picture. I confess that I have never liked this picture; it doesn’t remind me of a 
butterfly, and it doesn’t help me understand or remember the proof. 


A(A* n B*) 


A(A* n B) 

I 

A 

I 

An b* 


A* n b* 


D — (A* fl B)(A n B*) 


B(A* n B*) 


B(A n B*) 
I 

B 
I 

A* n B 


The isomorphism is symmetric in the sense that the right side is obtained from the left 
by interchanging the symbols A and B. < 


Proof. We claim that (A D B*) <] (A* fl B*): that is, if c e A IT B* and x e A* fl B *, then 
xcx~ l e AC\ B*. Now xcx~ l e A because c e A, x e A*, and A < A*; but also xcx~ l e 
B*, because c, x e B*. Hence, (A n B*) <i (A* n B*); similarly, (A* n B) <3 (A* n B*). 
Therefore, the subset D , defined by D — (A fl B*)(A* fl B), is a normal subgroup of 
A* fl B*, because it is generated by two normal subgroups. 

Using the symmetry in the remark, it suffices to show that there is an isomorphism 

A(A* n B*) A* n B* 

~AfA*~CVB) D ' 

Define (p : A(A* fl B*) —> (A* fl B*)/D by q> : ax i-> xD. where a e A and x e A* fl B*. 
Now tp is well-defined: if ax — a!x', where a' e A and x' e A* fl B*. then ( a , )~ 1 a = 
x'x~ l e A n (A* n B*) — An B * < D\ also, ip is a homomorphism: axa'x' = a"xx ', 
where a" — a(xa'x~ l ) e A (because A <1 A*), and so iplaxa'x’) — (p(a”xx r ) — xx'D — 
<p(ax)ip(a'x'). It is routine to check that q> is surjective and that ker <p = A(A* fl B). The 
first isomorphism theorem completes the proof. • 

The reader should check that the Zassenhaus lemma implies the second isomorphism 
theorem: If S and T are subgroups of a group G with T <\ G. then TS/T = S/(S fl 7 ); 
set A* = G. A = T, B* — S. and B — S n T. 
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Definition. A composition series is a normal series all of whose nontrivial factor groups 
are simple. The nontrivial factor groups of a composition series are called composition 
factors of G. 

A group need not have a composition series; for example, the abelian group Z has no 
composition series. However, every finite group does have a composition series. 

Proposition 5.50. Every finite group G has a composition series. 

Proof. If the proposition is false, let G be a least criminal; that is, G is a finite group of 
smallest order that does not have a composition series. Now G is not simple, otherwise 
G > {1} is a composition series. Hence, G has a proper normal subgroup //; we may 
assume that H is a maximal normal subgroup, so that G/H is a simple group. But |//| < 
|G|, so that H does have a composition series; say, H — Hq > H\ > ■ ■ ■ > {1}, and 
G > Ho > H t > • • • > {1} is a composition series for G, a contradiction. • 

A group G is solvable if it has a normal series with factor groups cyclic of prime order. 
As cyclic groups of prime order are simple groups, a normal series as in the definition of 
solvable group is a composition series, and so composition factors of G are cyclic groups 
of prime order. 

Here are two composition series of G = (a), a cyclic group of order 30 (note that 
normality of subgroups is automatic because G is abelian). The first is 

G = (a) > {a 2 } > (a 10 > > {1}; 

the factor groups of this series are ( a)/(a 2 ) = I 2 , ( a 2 )/(a 10 ) = I 5 , and (a 10 }/{ 1} = 
(a 10 ) = I 3 . Another normal series is 

G — (a) > {a 5 ) > (fl 15 > > {1}; 

the factor groups of this series are ( a)/(a 5 ) = I 5 , ( a 5 )/(a 15 } = I 3 , and (a 15 )/{1} = 
(a 15 ) = I 2 . Notice that the same factor groups arise, although the order in which they arise 
is different. We will see that this phenomenon always occurs: Different composition series 
of the same group have the same factor groups. This is the Jordan-Holder theorem, and 
the next definition makes its statement more precise. 

Definition. Two normal series of a group G are equivalent if there is a bijection be¬ 
tween the sets of nontrivial factor groups of each so that corresponding factor groups are 
isomorphic. 

The Jordan-Holder theorem says that any two composition series of a group are equiv¬ 
alent. It will be more efficient to prove a more general theorem, due to Schreier. 

Definition. A refinement of a normal series is a normal series G = No, Ni ,..., Nk — {1} 
having the original series as a subsequence. 
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In other words, a refinement of a normal series is a new normal series obtained from the 
original by inserting more subgroups. 

Notice that a composition series admits only insignificant refinements; one can merely 
repeat terms (if G; /G, + i is simple, then it has no proper nontrivial normal subgroups and, 
hence, there is no intermediate subgroup L with G, > L > G l+ \ and L <\ Gi). Therefore, 
any refinement of a composition series is equivalent to the original composition series. 


Theorem 5.51 (Schreier Refinement Theorem). Any two normal series 

G — Go > Gi > • • • > G n — {1} 


and 


G = N 0 > Ni > • • • > N k = {1} 


of a group G have equivalent refinements. 


Proof. We insert a copy of the second series between each pair of adjacent terms in the 
first series. In more detail, for each i > 0, define 


Gij = G,+i (G, n Nj) 

(this is a subgroup because G /+1 < G;). Note that 

G,o = G, + i(G/ fl Nq) = G, + iG/ = Gi, 
because No = G, and that 


Gik — G/ + i(G/ n Nk) = G,_|_i, 

because Nk — {1}- Therefore, the series of G ;/ - is a subsequence of the series of G,-: 

■ ■ ■ > Gj — G,o > Gn > Gi2 > • • • > Gik = Gj +1 > • ■ • . 


Similarly, there is a subsequence of the second series arising from subgroups 


Npq = N p +\{N p fl G q ). 


Both subsequences have nk terms. For each i, j, the Zassenhaus lemma, for the four 
subgroups G,+i <1 Gj and N /+1 < Nj, says both subsequences are normal series, hence 
are refinements, and there is an isomorphism 


G;+i(G; rVNj) _ Nj+dNjnGj) 
Gi+iiGi n Nj+ 1 ) _ N i+ i(Nj n G i+ i)’ 


that is. 


Gij/Gij+i = Nj,i/Nj,i+ 1. 


The association Gij/G/j+i i-> Njj/Njj + \ is a bijection showing that the two refine¬ 
ments are equivalent. • 
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Theorem 5.52 (Jordan-Holder 4 Theorem). Any two composition series of a group G 
are equivalent. In particular, the length of a composition series, if one exists, is an invariant 

of G. 

Proof As we remarked earlier, any refinement of a composition series is equivalent to 
the original composition series. It now follows from Schreier’s theorem that any two com¬ 
position series are equivalent. • 

Here is a new proof of the fundamental theorem of arithmetic. 

Corollary 5.53. Every integer n > 2 has a factorization into primes, and the prime 
factors are uniquely determined by n. 

Proof Since the group I„ is finite, it has a composition series; let Si,..., S t be the factor 
groups. Now an abelian group is simple if and only if it is of prime order, by Proposi¬ 
tion 2.107; since n = |I„ | is the product of the orders of the factor groups (see Exercise 5.36 
on page 287), we have proved that n is a product of primes. Moreover, the Jordan-Holder 
theorem gives the uniqueness of the (prime) orders of the factor groups. • 


Example 5.54. 

(i) Nonisomorphic groups can have the same composition factors. For example, both I 4 
and V have composition series whose factor groups are I 2 , 12 . 

(ii) Let G — GL(2, F 4 ) be the general linear group of all 2 x 2 nonsingular matrices with 
entries in the field F 4 with four elements. Now det: G —> (F 4 ) x , where (F 4 ) x = I 3 is 
the multiplicative group of nonzero elements of F 4 . Since kerdet = SL(2, F 4 ), the special 
linear group consisting of those matrices of determinant 1 , there is a normal series 

G = GL(2, F 4 ) > SL(2, F 4 ) > (1). 

The factor groups of this normal series are I 3 and SL(2, F 4 ). It is true that SL(2, F 4 ) 
is a nonabelian simple group [in fact. Corollary 5.68 says that SL(2, F 4 ) = A 5 ], and so 
this series is a composition series. We cannot yet conclude that G is not solvable, for the 
definition of solvability requires that there be some composition series, not necessarily this 
one, having factor groups of prime order. However, the Jordan-Holder theorem says that 
if one composition series of G has all its factor groups of prime order, then so does every 
other composition series. We may now conclude that GL(2, F 4 ) is not a solvable group. 4 

We now discuss the importance of the Jordan-Holder theorem in group theory. 
Definition. If G is a group and K <\G, then G is called an extension of K by G/K. 

4 In 1868, C. Jordan proved that the orders of the factor groups of a composition series depend only on G and 
not upon the composition series; in 1889, O. Holder proved that the factor groups themselves, to isomorphism, 
do not depend upon the composition series. 
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With this terminology. Exercise 2.75 on page 95 says that an extension of one //-group 
by another //-group is itself a //-group, and Proposition 4.24 says that any extension of one 
solvable group by another is itself a solvable group. 

The study of extensions involves the inverse question: How much of G can be recovered 
from a normal subgroup K and the quotient Q — G j K ? For example, we do know that if 
K and Q are finite, then | G \ = \ K 11 Q \ . 

Example 5.55. 

(i) The direct product K x Q is an extension of K by Q (and K x Q is an extension of Q 
by K). 

(ii) Both S 3 and 16 are extensions of I 3 by I 2 . On the other hand, 16 is an extension of I 2 

by I 3 , but .S 3 is not, for .S 3 contains no normal subgroup of order 2. 4 

We have just seen, for any given pair of groups K and Q, that an extension of K by 
Q always exists (the direct product), but there may be nonisomorphic such extensions. 
Hence, if we view an extension of K by Q as a “product” of K and Q, then this product is 
not single-valued. The extension problem is to classify all possible extensions of a given 
pair of groups K and Q. 

Suppose that a group G has a normal series 

G = K 0 > Ki > K 2 > • • • > K„-i > K n = {1} 
with factor groups Q\,..., Q n , where 

Qi = Ki-i/Kj 

for all i > 1. Now K n — {1}, so that K n -\ — Q„, but something more interesting occurs 
next: K n - 2 /K n -\ — Q n ~\, so that K n -2 is an extension of K n -\ by Q n -\■ If we could 
solve the extension problem, then we could recapture K n _2 from K n _\ and Q n -i —that is, 
from Q n and Q n -\. Next, observe that K n _i,/K n _2 — Qn- 2 , s ° that K n _3 is an extension 
of K n -2 by 2 / 1 - 2 - If we could solve the extension problem, then we could recapture 
K n -3 from K n -2 and Q„- 2 ; that is, we could recapture K n -3 from Q n . Q n - 1 , and Q n -i- 
Climbing up the composition series in this way, we end with G = Kq being recaptured 
from Q n , Q n - 1 ,..., Q 1 • Thus, G is a “product” of the factor groups. If the normal 
series is a composition series, then the Jordan-Holder theorem says that the factors in this 
product (that is, the composition factors of G) are uniquely determined by G. Therefore, 
we could survey all finite groups if we knew the finite simple groups and if we could solve 
the extension problem. Now all the finite simple groups were classified in the 1980s; this 
theorem, one of the deepest theorems in mathematics, gives a complete list of all the finite 
simple groups, along with interesting properties of them. In a sense, the extension problem 
has also been solved. In Chapter 10, we will give a solution to the extension problem, due 
to Schreier, which describes all possible multiplication tables for extensions; this study 
leads to cohomology of groups and the Schur—Zassenhaus theorem. On the other hand, the 
extension problem is unsolved in that no one knows a way, given K and Q, to compute the 
exact number of nonisomorphic extensions of K by Q. 
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We now pass from general groups (whose composition factors are arbitrary simple 
groups) to solvable groups (whose composition factors are cyclic groups of prime order; 
cyclic groups of prime order are simple in every sense of the word). Even though solv¬ 
able groups arose in determining those polynomials that are solvable by radicals, there 
are purely group-theoretic theorems about solvable groups making no direct reference to 
Galois theory and polynomials. For example, a theorem of P. Hall generalizes the Sylow 
theorems as follows; If G is a solvable group of order ab , where a and b are relatively 
prime, then G contains a subgroup of order a; moreover, any two such subgroups are con¬ 
jugate. A theorem of W. Burnside says that if |G| = p m q n , where p and q are prime, 
then G is solvable. The remarkable Feit-Thompson theorem states that every group of odd 
order must be solvable. 

Solvability of a group is preserved by standard group-theoretic constructions. For ex¬ 
ample, we have seen, in Proposition 4.21, that every quotient G/N of a solvable group G 
is itself a solvable group, while Proposition 4.22 shows that every subgroup of a solvable 
group is itself solvable. Proposition 4.24 shows that an extension of one solvable group by 
another is itself solvable: If H <\G and both H and G/H are solvable, then G is solvable, 
and Corollary 4.25 shows that a direct product of solvable groups is itself solvable. 

Proposition 5.56. Every finite p-group G is solvable. 

Proof. If G is abelian, then G is solvable. Otherwise, its center, Z(G), is a proper non¬ 
trivial normal abelian subgroup, by Theorem 2.103. Now Z(G) is solvable, because it is 
abelian, and G/Z(G) is solvable, by induction on |G|, and so G is solvable, by Proposi¬ 
tion 4.24. • 

It follows, of course, that a direct product of finite p-groups is solvable. 

Definition. If G is a group and x, y e G, then their commutator [x, y] is the element 

[x, y] = xyx~ l y~ l . 

If X and Y are subgroups of a group G, then [ X , Y] is defined by 
[X, Y] = ([x, y]:x eX and y e Y). 

In particular, the commutator subgroup G' of a group G is 

G' = [G, G], 

the subgroup generated by all the commutators. 5 

It is clear that two elements x and y in a group G commute if and only if their commu¬ 
tator [x, y] is 1. The next proposition generalizes this observation. 

5 The subset consisting of all the commutators need not be closed under products, and so the set of all com- 
mutators may not be a subgroup. The smallest group in which a product of two commutators is not a commutator 
has order 96. Also, see Carmichael’s exercise on page 297. 
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Proposition 5.57. Let G be a group. 

(i) The commutator subgroup G' is a normal subgroup of G, and GIG' is abelian. 

(ii) If H <\ G and G/H is abelian, then G' < H. 

Proof, (i) The inverse of a commutator xyx _1 y _1 is itself a commutator: [x, v]~ 1 = 
yxy~ x x~ { — [v. x]. Therefore, each element of G' is a product of commutators. But any 
conjugate of a commutator (and hence, a product of commutators) is another such: 

a[x,y]a~ l — a{xyx~ l y~ l )a~ x 

— axa~ l aya~ l ax~ l a~ l ay~ l a~ l 
= [axa~ l , aya~ 1 ]. 

Therefore, G' < G. (Alternatively, G' is a fully invariant subgroup of G, for if <p : G —»• G 
is a homomorphism, then <p([x, y]) = [tp{x), tp{y)\ e G '.) 

IfaG',Z?G' e G/G', then 

aG'bG'(aG'r\bG')~ l = aba^b^G' = [a, b]G' = G', 
and so G/G ’ is abelian. 

(ii) Suppose that H <\ G and G/H is abelian. If a , b e G, then aHbH — bHaH\ that is, 
abH = haII, and so b~ l a~ l ba e H. As every commutator has the form b~ l a~ l ba, we 
have G' < H. • 


Example 5.58. 

(i) A group G is abelian if and only if G' = {1}. 

(ii) If G is a simple group, then G' = {1} or G' — G, for G' is a normal subgroup. The 
first case occurs when G has prime order; the second case occurs otherwise. In particular, 
(A,,)' = A„ for all n > 5. 

(iii) We show that ( S= A„ for all n > 5. Since S n /A n = I 2 is abelian. Proposition 5.57 
shows that ( S„Y < A„. For the reverse inclusion, note that ( S n )' IT A n < A n , so that the 
simplicity of A n gives this intersection trivial or A n . Clearly, ( S„Y fl A„ {(1)}, and so 

A n < ( s n y. <* 

Let us iterate the formation of the commutator subgroup. 

Definition. The derived series of G is 

G = G <0) > G (1) > G (2) > • • • > G (,) > G (!+1) > • • • , 

where G (0) = G, G (1) = G', and, more generally, G (, ' +1) = (G {i) Y = [G <;) , G (i) ] for all 
i > 0. 
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It is easy to prove by induction on i > 0, that G (,) is fully invariant, which implies that 
G <n <\ G; it follows that G (,+l 1 <1 G il K and so the derived series is a normal series. The 
derived series can be used to give a characterization of solvability: G is solvable if and 
only if the derived series reaches {1}. 

Proposition 5.59. 

(i) A finite group G is solvable if and only if it has a normal series with abelian factor 
groups. 

(ii) A finite group G is solvable if and only if there is some n with 

G (n) = {1}. 

Proof, (i) If G is solvable, then it has a normal series whose factor groups G,/G,+i are 
all cyclic of prime order, hence are abelian. 

Conversely, if G has a normal series with abelian factor groups, then the factor groups 
of any refinement are also abelian. In particular, the factor groups of a composition series 
of G, which exists because G is finite, are abelian simple groups; hence, they are cyclic of 
prime order, and so G is solvable. 

(ii) Assume that G is solvable, so there is a normal series 

G > G 1 > G2 > • • • > G;, = {1} 

whose factor groups G,-/G,-+ 1 are abelian. We show, by induction on i > 0, that G (l 1 < 
Gj. Since G (0;i — G — Go, the base step is obviously true. For the inductive step, since 
Gj/Gj+i is abelian. Proposition 5.57 gives (&’, )' < G, + 1 . On the other hand, the inductive 
hypothesis gives G ( '^ < G,-, which implies that 

G (m) = (G (,) y < {Gif < G i+ i. 

In particular, G (,,) < G n = {1}, which is what we wished to show. 

Conversely, if G (n> — {1}, then the derived series is a normal series (a normal series 
must end with {1}) with abelian factor groups, and so part (i) gives G solvable. • 

For example, the derived series of G = S 4 is easily seen to be 

s 4 > A 4 > V > {(1)}. 

Our earlier definition of solvability applies only to finite groups, whereas the character¬ 
ization in the proposition makes sense for all groups, possibly infinite. Nowadays, most 
authors define a group to be solvable if its derived series reaches {1} after a finite number 
of steps; with this new definition, every abelian group is solvable, whereas it is easy to see 
that abelian groups are solvable in the sense of the original definition if and only if they are 
finite. In Exercise 5.38 on page 287, the reader will be asked to prove, using the criterion 
in Proposition 5.59, that subgroups, quotient groups, and extensions of solvable groups are 
also solvable (in the new, generalized, sense). 

There are other interesting classes of groups defined in terms of normal series. One of 
the most interesting such consists of nilpotent groups. 
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Definition. The descending central series of a group G is 

G = yi(G) > 72 (G) > • •• , 

where y,+i(G) = [y, (G), G], A group G is called nilpotent if the lower central series 
reaches {1}; that is, if y„(G) = {1} for some n. 

Note that 72 (G) = G', but the derived series and the lower central series may differ 
afterward; for example, 73 (G) = [G\ G] > G l2 \ with strict inequality possible. 

Finite nilpotent groups can be characterized by Proposition 5.39; they are the groups 
that are direct products of their Sylow subgroups, and so one regards finite nilpotent groups 
as generalized p-groups. Examples of nilpotent groups are UT(n, F ? ), UT(n, Z) (unitrian- 
gular groups over Z), the Frattini subgroup 0(G) (defined in Exercise 5.46 on page 288) 
of a finite group G, and certain automorphism groups arising from a normal series of a 
group. We can prove results, for infinite nilpotent groups as well as for finite ones, such as 
those in Exercise 5.47 on page 288; Every subgroup and every quotient of a finite nilpotent 
group G is again nilpotent; if G/Z(G) is nilpotent, then so is G; every normal subgroup 
H intersects Z(G) nontrivially. 

Exercises 


5.33 Let p be a prime and let G be a nonabelian group of order p 3 . Prove that Z(G) = G'. 

Hint. Show first that both subgroups have order p. 

5.34 Prove that if H is a subgroup of a group G and G' < H, then H <1 G. 

Hint. Use the correspondence theorem. 

5.35 (i) Prove that (S n )' = A n for n = 2, 3. 4 [see Example 5.58(iii) for n >5], 

(ii) Prove that (GL(«, k)Y < SL(n, k). (The reverse inclusion is also true; see Exercise 5.56 
on page 296 for the case n = 2.) 

5.36 If G is a finite group and 

G = Gq > G\ > • • • > Gn = {1} 

is a normal series, prove that the order of G is the product of the orders of the factor groups: 

n —1 

|G|= Y[\Gi/G i+l \. 
i =0 

5.37 Prove that any two finite solvable groups of the same order have the same composition factors. 

5.38 Let G be an arbitrary, possibly infinite group. 

(i) Prove that if H < G, then H < G ( !) for all i. Conclude, using Proposition 5.59, that 
every subgroup of a solvable group is solvable. 

(ii) Prove that if /: G —> K is a surjective homomorphism, then 

/(G (0 ) = K (i) 

for all i. Conclude, using Proposition 5.59, that every quotient of a solvable group is 
also solvable. 
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(iii) For every group G, prove, by double induction, that 

Q(m+n) _ (Q(m)\(n) 

(iv) Prove, using Proposition 5.59, that if H <1 G and both H and G/H are solvable, then G 
is solvable. 

5.39 Let p and q be primes. 

(i) Prove that every group of order pq is solvable. 

Hint. If p = q, then G is abelian. If p < q , then a divisor r of pq for which 
r = 1 mod q must equal 1 . 

(ii) Prove that every group G of order p 2 q is solvable. 

Hint. If G is not simple, use Proposition 4.24. If p > q, then r = 1 mod p forces 
r = 1. If p < q, then r = p~ and there are more than p~q elements in G. 

5.40 Show that the Feit-Thompson theorem—“Every finite group of odd order is solvable,” is 
equivalent to "Every nonabelian finite simple group has even order.” 

Hint. For sufficiency, choose a “least criminal”: a nonsolvable group G of smallest odd order. 
By hypothesis, G is not simple, and so it has a proper nontrivial normal subgroup. 

5.41 (i) Prove that the infinite cyclic group Z does not have a composition series. 

(ii) Prove that an abelian group G has a composition series if and only if G is finite. 

5.42 Prove that if G is a finite group and H <1 G, then there is a composition series of G one of 
whose terms is H. 

Hint. Use Schreier’s theorem. 

5.43 (i) Prove that if S and T are solvable subgroups of a group G and 5 <1 G, then ST is a 

solvable subgroup of G. 

Hint. The subgroup ST is a homomorphic image of S x T. 

(ii) If G is a finite group, define 5(G) to be the subgroup of G generated by all normal solv¬ 
able subgroups of G. Prove that 5(G) is the unique maximal normal solvable subgroup 
of G and that G/5(G) has no nontrivial normal solvable subgroups. 

5.44 (i) Prove that the dihedral groups D 2 „ are solvable. 

(ii) Give a composition series for Di n ■ 

5.45 (Rosset). Let G be a group containing elements x and y such that the orders of x, y, and xy 
are pairwise relatively prime; prove that G is not solvable. 

5.46 (i) If G is a finite group, then its Frattini subgroup, denoted by <F(G), is defined to be 

the intersection of all the maximal subgroups of G. Prove that <t> (G) is a characteristic 
subgroup, and hence it is a normal subgroup of G. 

(ii) Prove that if p is a prime and G is a finite abelian p-group, then $(G) = pG. The 
Burnside basis theorem says that if G is any (not necessarily abelian) finite /;-group, 
then G/<t>(G) is a vector space over F p , and its dimension is the minimum number of 
generators of G (see Rotman, An Introduction to the Theory of Groups, page 124). 

5.47 (i) If G is a nilpotent group, prove that its center Z(G) 7 ^ {1}. 

(ii) If G is a group with G/Z(G) nilpotent, prove that G is nilpotent. 

(iii) If G is a nilpotent group, prove that every subgroup and every quotient group of G is 
also nilpotent. 
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(iv) Let G be a group and let H <1 G. Give an example in which both H and G/H are 
nilpotent and yet G is not nilpotent. 

(v) If G is a finite p-group and if H <1 G, prove that H fl Z(G) ^ (1). (The generalization 
of this result to finite nilpotent groups is true.) 

5.48 Let 21 denote the class of all abelian groups, (ft the class of all nilpotent groups, and © the 
class of all solvable groups. 

(i) Prove that 21 C (ft C ©. 

(ii) Show that each of the inclusions in part (i) is strict; that is, there is a nilpotent group that 
is not abelian, and there is a solvable group that is not nilpotent. 

5.49 If G is a group and g, x e G, write g x — xgx 1 . 

(i) Prove, for all x, y, z e G, that [x, yz] = [x, y][x, z] y and [xy, z] = [y, z] x [x, z], 

(ii) (Jacobi Identity) If x, y, z € G are elements in a group G, define 

[x, y, z] = [x, [y,z]]. 

Prove that 

[x, y -1 , z\ y [y, z _1 , x] z [z, x -1 , y] x = 1 . 

5.50 If H, K, L are subgroups of a group G, define 

[ H , K, L] = ({[h,k,l] \ h e H.k e K.l e L}). 


(i) Prove that if [H, K, L] = {1} = [A - , L, H], then [L, H, K] = {1). 

(ii) ( Three subgroups lemma) If N <1 G and [H. K , L \\K, L, H \ < ;V, prove that 

[L, H, K] < N. 

(iii) Prove that if G is a group with G = G / , then G/Z(G) is centerless. 

Hint. If jt : G -*■ G/Z(G) is the natural map, define f 2 (G) = n _1 (Z(G/Z(G))). 
Use the three subgroups lemma with L = f 2 (G) and H = K = G . 

(iv) Prove, for all i, j, that [y;(G), /j(G)] < y,+y(G). 


5.4 Projective Unimodular Groups 


The Jordan-Holder theorem associates a family of simple groups to every finite group, and 
it can be used to reduce many problems about finite groups to problems about finite simple 
groups. This empirical fact says that a knowledge of simple groups is very useful. The 
only simple groups we have seen so far are cyclic groups of prime order and the alternating 
groups A„ for u >5. We will now show that certain finite groups of matrices are simple, 
and we begin by considering some matrices that will play the same role for 2 x 2 linear 
groups as the 3-cycles played for the alternating groups. 
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Definition. A transvection 6 over a field k is a matrix of the form 
Bn(r)= q j or B 2l (r) = * ® , 

where r e k and r ^ 0. 

Let A be a 2 x 2 matrix. It is easy to see that B\ 2 (r )A is the matrix obtained from A by 
replacing Row(l) by Row(l) + rRow(2), and that B 2 \ (r)A is the matrix obtained from A 
by replacing Row(2) by Row(2) + rRow(l). 



det(A) = det(Z>) = d. 

As the inverse of a transvection is also a transvection, we have A = W~ l D, which is the 
factorization we seek. • 

Recall that SL(2, k ) is the subgroup of GL(2, k) consisting of all matrices of determi¬ 
nant l . 6 7 If k is a finite field, then k = F ? , where q — p n and p is a prime; we may denote 
GL(2, F ? ) by GL(2, q) and, similarly, we may denote SL(2, F^) by SL(2, q). 

6 Most group theorists define a 2 x 2 transvection as a matrix that is similar to B 12(f) or B 2 i(r) [that is, a 
conjugate of B 12(f) or B 2 i (r) in GL( 2 , k)\. The word transvection is a synonym for transporting, and its usage 
in this context is probably due to E. Artin, who gives the following definition in his book Geometric Algebra: “An 
element r € GL(V), where V is an //-dimensional vector space, is called a transvection if it keeps every vector 
of some hyperplane H fixed and moves any vector x € V by some vector of H: that is, z(x) — x € Hi' In our 
case, l>\ 2 (r) fixes the “x-axis” and B 2 [(r) fixes the “y-axis.” 

7 GL abbreviates general linear and SL abbreviates special linear. 
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Proposition 5.61. 

(i) Ifk is afield, then SL(2, k) is generated by transvections. 

(ii) Ifk is afield, then GL(2, A;)/SL(2, k) = k x , where k x is the multiplicative group of 
nonzero elements ofk. 

(iii) Ifk — F ? , then 

|SL(2, F 9 )| = (q + l)q{q — 1). 

Proof, (i) If A e SL(2, k), then Lemma 5.60 gives a factorization A — UD, where U is 
a product of transvections and D — diag{l, d\, where d = det(A). Since A e SL(2, k), 
we have det(A) = 1, and so A = U. 

(ii) If a e k x , then the matrix diag{ 1, a} has determinant a, hence is nonsingular, and so 
the map det: GL(2, k) —> k x is surjective. The definition of SL(2, k) shows that it is the 
kernel of det, and so the first isomorphism theorem gives the result. 

(iii) If H is a normal subgroup of a finite group G, then Lagrange’s theorem gives |//| = 

| G | /| G/H |. In particular, 

|SL(2,F,)| = |GL(2,F,)|/|F*|. 

But | GL(2, F ? )| = (q 2 — 1 )(q 2 — q), by Theorem 5.46, and |F*| = q — 1. Hence, 
|SL(2, F ? )| — (q + l)q(q — 1). . 

We now compute the center of these matrix groups. If V is a two-dimensional vector 
space over k, then we proved, in Proposition 3.108, that GL(2, k) = GL( V), the group of 
all nonsingular linear transformations on V. Moreover, Proposition 3.109(i) identifies the 
center with the scalar transformations. 


Proposition 5.62. The center of SL(2, k), denoted by SZ(2. k), consists of all scalar 
f a ol 


matrices 


0 a 


with a 2 — 1 . 


Remark. Here we see that SZ = SL fl Z(GL), but it is not true in general that if H < G, 
then Z(H) = 110 Z(G ) (indeed, this equality may not hold even when H is normal). We 
always have H fl Z(G) < Z(H), but the inclusion may be strict. For example, if G — .S';, 
and H = A 3 = I 3 , then Z(A 3 ) = A 3 while A 3 fl ZiSi) — {1}. ◄ 

Proof. It is more convenient here to use linear transformations than matrices. Assume that 
T e SL(2, k ) is not a scalar transformation. Therefore, there is a nonzero vector v e V 
with Tv not a scalar multiple of v. It follows that the list v. Tv is linearly independent 
and, since dim(V) = 2, that it is a basis of V. Define S : V —> f by S(r) = w and 
S(Tv ) = v + I'v. Notice, relative to the basis v, Tv, that S has matrix Il\ 2 i \), so that 
det(S) = 1. Now T and S do not commute, for TS(v ) = Tv while ST(v) — v + Tv. It 
follows that the center must consist of scalar transformations. In matrix terms, the center 
consists of scalar matrices A = diagja, a}, and a 2 = det(A) = 1. • 



292 


Groups II Ch. 5 


Definition. The projective unimodular 8 group is the quotient group 

PSL(2, k) = SL(2, k)/ SZ(2, k). 

Note that if c 2 — 1, where c is in a field k , then c = ±1. If k = ¥ q , where q is a power 
of 2, then F (/ has characteristic 2, so that c 2 — 1 implies c = 1. Therefore, in this case, 
SZ(2, F ? ) = {/} and so PSL(2, F 2 «) = SL(2, F 2 »). 


Proposition 5.63. 


|PSL(2,F,)| 


|\(q + 1 )q(q ~ D 

\{q + 1 )q(q - 1) 


ifq = p n and p is an odd prime', 

ifq = 2". 


Proof. Proposition 5.61(iii) gives |PSL(2, F ? )| = (q + 1) q(q — 1)/|SZ(2, F ? )| and. 
Proposition 5.62 gives 

|SZ(2,F,)| = \{a eF, : a 2 = 1}|. 

Now F* is a cyclic group of order q — 1, by Theorem 3.30. If q is odd, then q — 1 is 
even, and the cyclic group F* has a unique subgroup of order 2; if q is a power of 2, then 
we noted, just before the statement of this proposition, that SZ(2, ¥ q ) = {/}. Therefore, 
\SZ(2,q) \ = 2 if q is a power of an odd prime, and \ SZ(2, q)\ = 1 if q is a power of 2. • 

We are now going to prove that the groups PSL(2, F ? ) are simple for all prime pow¬ 
ers q > 4. As we said earlier, the transvections will play the role of the 3-cycles (see 
Exercise 2.91 on page 113). 


Lemma 5.64. If H is a normal subgroup of SL(2, F ? ) containing a transvection B\i{r) 
or B 2 \{r), then H — SL(2, F^). 

Proof. Note first that if 

£ ' = [? o 1 ]' 

then det((/) = 1 and U e SL(2, F ? ); since H is a normal subgroup, U Bi 2 (r )(/ -1 also lies 
in H. But UBi2(r)U~ l = B 2 i(— r), from which it follows that H contains a transvection 
of the form Bi 2 (r) if and only if it contains a transvection of the form fi 2 i(— r). Since 
SL is generated by the transvections, it suffices to show that every transvection B^ir) lies 
in H. 

The following conjugate of Bnir) lies in H because H is normal: 


a 

p ' 

'1 r 


~P 


'i 

ra 2 

_0 

a~ l 

0 1_ 

0 

a 


0 

1 


= B l 2 (ra 2 ). 


Define 

G = {0}U{u e¥ q : B n (u) e H }. 

8 A matrix is called unimodular if it has determinant 1. The adjective projective arises because this group turns 
out to consist of automorphisms of a projective plane. 
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We have just shown that ra 2 e G for all a e F ? . Tt is easy to check that G is a subgroup of 
the additive group of and, hence, it contains all the elements of the form u = r(a 2 — f J > 2 ), 
where a, f e k. We claim that G = ¥ q , which will complete the proof. 

If q is odd, then each w e F g is a difference of squares: 

w = [3(w + D] 2 -[i(w-i)] 2 . 

Hence, if it e ¥ q , there are a, ft e ¥ q with r~ l u — a 2 — f 2 . and so u — r(a 2 — /l 2 ) e G; 
therefore, G — ¥ q . If q — 2'", then the function u m* u 2 is an injection F g -> F g (for 
if u 2 — v 2 , then 0 — u 2 — v 2 = (u — i>) 2 , and u = v). It follows from Exercise 1.58 
on page 36 (an injection from a finite set to itself must be a bijection) that this function is 
surjective, and so every element u has a square root in F ? . In particular, there is a e F g 
with r~ l u = a 2 , and u — ra 2 e G. • 

We need a short technical lemma before giving the main result. 


Lemma 5.65. Let H be a normal subgroup of SL(2, F (/ ). If A e H is similar to 


R = 


a 

V 


P 

8 


where R e GL(2, F^), then there is u e¥ q so that H contains 

a u ~ 1 ft 

uy 8 


Proof. By hypothesis, there is a matrix P e GL(2, F (/ ) with R — PAP~ l . There is a 
matrix U € SL and a diagonal matrix D = diagjl, u) with P~ 1 = UD, by Lemma 5.60. 
Therefore, A — UDRD~ l U~ l \ since H <] SL, we have DRD~ l = U~ l AU e H. But 


DRD 


1 _ 

ri oi 

a f 


'1 0 ' 


a u 1 ft 


i 

3 

O 

Y 


0 u — 1 


uy 8 




The next theorem was proved by C. Jordan in 1870 for q prime. In 1893, after F. Cole 
had discovered a simple group of order 504, E. H. Moore recognized Cole’s group as 
PSL(2, F«), and he then proved the simplicity of PSL(2, F ? ) for all prime powers q > 4. 
We can define PSL(m, ¥ (l ) for all m > 3 as SL(m, ¥ q )/SZ(m, F ? ), and Jordan proved, for 
all m > 3, that PSL(ot, F ; ,) is simple for all primes p. In 1897, L. E. Dickson proved that 
PSL(m, F ? ) is simple for all prime powers q. 

We are going to use Corollary 3.101: Two n x n matrices A and B over a field k are 
similar (that is, there exists a nonsingular matrix P with B = PAP~ l ) if and only if they 
both arise from a single linear transformation <p: k" -» k" relative to two choices of bases 
of k". Of course, two nonsingular n x n matrices A and B over a field k are similar if and 
only if they are conjugate elements in the group GL(n, k). 
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Theorem 5.66 (Jordan-Moore). The groups PSL(2, ¥ q ) are simple for all prime pow¬ 
ers q > 4. 


Remark. By Proposition 5.63, |PSL(2, F 2 )I = 6 and |PSL(2, F 3 )| = 12, so that neither 
of these groups is simple. 

It is true that PSL(2, k) is a simple group for every infinite field k. ■* 


Proof. It suffices to prove that a normal subgroup H of SL(2, F (/ ) that contains a matrix 
not in the center SZ(2, F ? ) must be all of SL(2, F ? ). 

Suppose, first, that H contains a matrix 


where a f ±1; that is, or 1. If B = If] ( 1), then H contains the commutator 
BAB~ l A~ l — B 2 \G — a -2 ), which is a transvection because 1 — a ~ 2 0. Therefore, 

H — SL(2, F ? ), by Lemma 5.64. 

To complete the proof, we need only show that H contains a matrix whose top row is 
[a 0], where a f ±1. By hypothesis, there is some matrix M e H that is not a scalar 
matrix. Let q>\ k 2 —> k 2 be the linear transformation given by (p{v ) = Mv, where v is a 
2x1 column vector. If cp(v) = c v v for all v, where c v e k, then the matrix [qi] relative 
to any basis of k 2 is a diagonal matrix. In this case, M is similar to a diagonal matrix 
D — diag{a, f}, and Lemma 5.65 says that D e H. Since M f SZ(2, F^), we must have 
a f. But afi — det (M) — 1, and soa / ±1. Therefore, D is a matrix in H of the 
desired form. 

In the remaining case, there is a vector v with <fi(v) not a scalar multiple of v, and we 
saw in Example 3.96(ii) that M is similar to a matrix of the form 

'0 - 1 ' 

1 b 


D 


■ H. 


(the matrix has this form because it has determinant 1). Lemma 5.65 now says that there is 
some nek with 

0 —m _1 

u b 

If T — diag{a, a -1 } (where a will be chosen in a moment), then the commutator 

„2 0 


V = (TDT~ l )D~ l 


u 

2 


ub(a 1 — 1) a 


-2 


e H. 


We are done if a 2 f ±1; that is, if there is some nonzero a e k with « 4 f 1. If q >5, 
then such an element a exists, for the polynomial x 4 — 1 has at most four roots in a field. 
If q —4, then every a e F 4 is a root of the equation x 4 — x, and so a 1 implies o 4 / 1. 
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Only the case q — 5 remains. The entry b in D shows up in the lower left corner 
v — ub(e/- 2 — 1) of the commutator V. There are two subcases depending on whether 
b ^ 0 or b = 0. In the first subcase, choose a = 2 so that a~ 2 — 4 = or and v = 
(4 — 1 )ub — 3 ub ^ 0. Now H contains V 2 — B 21 (—2d), which is a transvection because 
—2v = — 6 ub — 4uh ^ 0. Finally, if b — 0, then D has the form 


Conjugating D by Biiiy) for y e F 5 gives a matrix B\ 2 (y)DB\ 2 _(—y) e H whose top row 
is 


[uy — uy 2 — u 1 ], 


If we choose y — 2u 1 , then the top row is [2 0], and the proof is complete. • 

Here are the first few orders of these simple groups: 

| PSL(2, F 4 )| = 60; 

| PSL(2, F 5 )| = 60; 

| PSL(2, F 7 )| = 168; 

|PSL(2,F 8 )| = 504; 

| PSL(2, F 9 )| = 360; 

| PSL(2, Fi 1 )| = 660. 

It can be shown that there is no nonabelian simple group whose order lies between 60 and 
168. Indeed, these are all the nonabelian simple groups of order less than 1000. 

Some of the orders in the table, namely, 60 and 360, coincide with orders of alternating 
groups. There do exist nonisomorphic simple groups of the same order; for example, ,4* 
and PSL(3, F 4 ) are nonisomorphic simple groups of order 78 ! = 20,160. The next result 
shows that any two simple groups of order 60 are isomorphic [Exercise 5.53 on page 296 
shows that PSL(2, Fg) = A 6 ]. 

Proposition 5.67. If G is a simple group of order 60, then G = 4> 

Proof. It suffices to show that G has a subgroup H of index 5, for then Theorem 2.88, the 
representation on the cosets of H , provides a homomorphism tp : G —> S 5 with ker cp < H. 
As G is simple, the proper normal subgroup ker tp is equal to {1}, and so G is isomorphic 
to a subgroup of S 5 of order 60. By Exercise 2.94(ii) on page 114, A 5 is the only subgroup 
of S 5 of order 60, and so G = A 5 . 

Suppose that P and Q are Sylow 2-subgroups of G with P( 1 Q f {1}; choose x e PDQ 
with x / 1. Now P has order 4, hence is abelian, and so 4 | \Cg(x)\, by Lagrange’s 
theorem. Indeed, since both P and Q are abelian, the subset P U Q is contained in Cg(x), 
so that \Cg(x)\ > \P U <21 >4. Therefore, |Cg(a)| is a proper multiple of 4 which is also 
a divisor of 60: either |Cg(4')| = 12, |Cg(x)| = 20, or |Cg(^')| = 60. The second case 
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cannot occur lest Co(x ) have index 3, and representing G on its cosets would show that 
G is isomorphic to a subgroup of S 3 ; the third case cannot occur lest x e Z( G) — {1}. 
Therefore, C'c (x) is a subgroup of G of index 5, and we are done in this case. We may 
now assume that every pair of Sylow 2-subgroups of G intersect in {1}. 

A Sylow 2-subgroup P of G has r — [G : Nc;(P)] conjugates, where r — 3, 5, or 
15. Now r / 3 (G has no subgroup of index 3). We show that r — 15 is not possible 
by counting elements. Each Sylow 2-subgroup contains three nonidentity elements. Since 
any two Sylow 2-subgroups intersect trivially (as we saw above), their union contains 
15 x 3 = 45 nonidentity elements. Now a Sylow 5-subgroup of G must have 6 conjugates 
(the number r$ of them is a divisor of 60 satisfying r$ = 1 mod 5). But Sylow 5-subgroups 
are cyclic of order 5, so that the intersection of any pair of them is {1}, and so the union of 
them contains 6 x 4 = 24 nonidentity elements. We have exceeded the number of elements 
in G, and so this case cannot occur. • 

Corollary 5.68. PSL(2, F 4 ) = A 5 = PSL(2, F 5 ). 

Proof. All three groups are simple and have order 60. • 

There are other infinite families of simple matrix groups (in addition to the cyclic groups 
of prime order, the alternating groups, and the projective unimodular groups), as well as 26 
sporadic simple groups belonging to no infinite family, the largest of which is the “mon¬ 
ster” of order approximately 8.08 x 10 53 . We refer the interested reader to the books by E. 
Artin, by R. Carter, and by J. Dieudonne. In fact, all finite simple groups were classified 
in the 1980’s, and an excellent description of this classification can be found in Conway et 
al, ATLAS of Finite Groups. 

Exercises 

5.51 Give a composition series for GL(2, F 5 ) and list its factor groups. 

5.52 (i) Prove that PSL(2, F 2 ) = S 3 . 

(ii) Prove that PSL(2, F 3 ) = A 4 . 

5.53 Prove that PSL(2, F 9 ) = Ag. 

Hint. Let A = [ j "p ] an d B = ^ 1_t j“ j, where u e F 9 satisfies u~ = — 1. If A and B 
represent elements a and b in PSL(2, F 9 ), prove that ab has order 5 and |(a, b) \ = 60. 

5.54 (i) Prove that SL(2, F 5 ) is not solvable. 

(ii) Show that a Sylow 2-subgroup of SL(2, F 5 ) is isomorphic to the quaternions Q. 

(iii) Prove that the Sylow p-subgroups of SL(2, F 5 ) are cyclic if p is an odd prime. Con¬ 
clude, for every prime p, that all the Sylow p-subgroups of SL(2, F 5 ) have a unique 
subgroup of order p. 

5.55 Prove that GL(2, F 7 ) is not solvable. 

5.56 (i) Prove that SL(2, F ? ) is the commutator subgroup of GL(2, F ? ) for all prime powers 

q > 4. 

(ii) What is the commutator subgroup of GL(2, F q) when q = 2 and when q = 3? 
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5.57 Let n be a primitive element of Fg. 

f n 0 


(i) What is the order of A = 

(ii) What is the order of A = 


1 n 

tc 0 0 

1 7T 0 

0 1 71 


considered as an element of GL(2, Fg)? 

considered as an element of GL(3, Fg)? 



'0 

0 

0 


'0 

0 

0 

Hint. Show that if N = 

1 

0 

0 

, then N 2 = 

0 

0 

0 


0 

1 

0 


1 

0 

0 


and N 2 = 0, and use 
the binomial theorem to show that A m = n m l + mn m ~ 2 N + (” ! )7r m ~ 2 N 2 . 


5.5 Presentations 

How can we describe a group? By Cayley’s theorem, a finite group G is isomorphic to a 
subgroup of the symmetric group S n , where n = |G|, and so G can always be defined as a 
subgroup of S„ generated by certain permutations. An example of this kind of construction 
occurs in the following exercise from Carmichael’s group theory book 9 : 

Let G be the subgroup of Sjg generated by the following permutations: 

(a c)(b d)\ (e g)(f h)\ 

(i k)(j t)\ (in o)(n p) 

(a c)(e g)(i k)\ (a b)(c d)(m o)\ 

(e f)(g h)(m n)(o p)\ (i j)(k l). 

Prove that |G| = 256, |G'| = 16, 

a = (i k)(j l)(m o)(n p) e G ', 

but a is not a commutator. 

A second way of describing a group is by replacing S n by GL(«, k) for some n > 2 
and some field k [remember that all the n x n permutation matrices form a subgroup of 
GL(«, k) isomorphic to S n , and so every group of order n can be imbedded in GL(n, k)]. 
We have already described some groups in terms of matrices; for example, we defined 
the quaternion group Q in this way. For relatively small groups, descriptions in terms of 
permutations or matrices are useful, but when n is large, such descriptions are cumbersome. 

We can also describe groups as being generated by elements subject to certain relations. 
For example, the dihedral group Z> 2 » could be described as a group of order 2 n that can be 
generated by two elements a and b , such that a" — 1 = b 2 and hah — a -1 . Consider the 
following definition. 

^Carmichael posed this exercise in the 1930s, before the era of high-speed computers, and he was able to 
solve it by hand. 



298 


Groups II Ch. 5 


Definition. The group of generalized quaternions Q„, where n > 3, is a group of order 
2 " that is generated by two elements a and b such that 

2" -1 i ; 7-1 -1 j 7 2 2"~ 2 

a = 1, bab = a , and b = a 

When n — 3, this is the group Q of order 8. An obvious defect in this definition is that 
the existence of such a group is left in doubt; for example, is there such a group of order 
16? Notice that it is not enough to find a group G = (a, b) in which a 8 = 1, bab~ l — a -1 , 
and b 2 — a 4 . For example, the group G — {a , b) in which a 2 = 1 and b — 1 (which is, of 
course, cyclic of order 2) satisfies all of the equations. 

It was W. von Dyck, in the 1880s, who invented free groups in order to make such 
descriptions rigorous. 

Here is a modern definition of a free group. 

Definition. If A is a subset of a group F, then F is a free group with basis X if, for every 
group G and every function /: X —> G, there exists a unique homomorphism <p\ F -» G 
with q>(x) — f(x) for all x e X. 

F 

t ■ ■ , <P 


A 



This definition is modeled on a fundamental result in linear algebra. Theorem 3.92, 
which is the reason why it is possible to describe linear transformations by matrices. 

Theorem. Let X — v \,..,, v n be a basis of a vector space V. If W is a vector space 

and mi . u n is a list in W, then there exists a unique linear transformation T : V —> W 

with T(i)j ) = Uj for all i. 

We may draw a diagram of this theorem after we note that giving a list u\,... ,u n of 
vectors in W is the same thing as giving a function /: X —> W, where /(v,) = m,-; after 
all, a function /: X —> W is determined by its values on V/ e X. 

V 


w 

If we knew that free groups exist, then we could define Q„ as follows. Let F be 
the free group with basis X = {v, y), let R be the normal subgroup of F generated by 
|x 2 " , yxy~ 1 x, y~ 2 x 2 " }, and define Q„ = F/R. It is clear that F/R is a group gener¬ 

ated by two elements a — xR and b — yR that satisfy the relations in the definition; what 
is not clear is that F/R has order 2", and this needs proof (see Proposition 5.80). 
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The first question, then, is whether free groups exist. The idea of the construction is 
simple and natural, but checking the details is a bit fussy. We begin by describing the 
ingredients of a free group. 

Let X be a nonempty set, and let X~ l be a disjoint replica of X\ that is, X and X -1 
are disjoint and there is a bijection X —> X , which we denote by x i-> x _l . Define the 
alphabet on X to be 

zur 1 . 

If n is a positive integer, we define a word on X of length n > 1 to be a function 
w: {1, 2-> X U X~ l . In practice, we shall write a word w of length n as fol¬ 
lows: if w(i) = x . 1 , then 

e l e n 

w = X\ ■■■x n , 

where x, e X and e, — ±1. The length n of a word w will be denoted by w |. For example, 
|xx _1 | —2. The empty word , denoted by 1, is a new symbol; the length of the empty word 
is defined to be 0 . 

The definition of equality of functions reads here as follows. If u — x[' • • ■ x e " and 
v = yf l ■ ■ ■ y^'j are words, where x,-, yj e X for all i, j. then u — v if and only if m = n, 
Xj = yj , and e; = dj for all i; thus, every word has a unique spelling. 

Definition. A subword of a word w = Xj 1 • • • x e " is either the empty word or a word of 
the form u — x e / ■ ■ ■ x e *, where 1 < r < s < n. The inverse of a word w = x[' • • • x e " is 

— 1 ~ e n —€\ 

w = X n •• -X x . 

It follows that (w -1 ) -1 = w for every word w. 

The most important words are reduced words. 


Definition. A word w on X is reduced if w — 1 or if w has no subwords of the form 
xx _1 orx _ 1 x, where x e X. 

Any two words on X can be multiplied. 


Definition. If u = X|' xj ' 2 • • • x e '' and i> = yf l ■ ■ ■ are words on X, then their juxta¬ 
position is the word 


u v 


e\ e n d 1 
:Xt • • • X n •••y 


If 1 is the empty word, then lv = v and u 1 = it. 


dm 

III ‘ 


Let us try to define a free group as the set of all words on X with operation juxtaposition, 
with the identity being the empty word 1 , and with the inverse of w = x[ ! • • • x' n " being 
w~ l — x n e " ■ ■ ■ Xj e ‘. There is a problem: If x e X. then we want x -1 x = 1, but this is not 
true; x -1 x has length 2, not length 0. We can try to remedy this by restricting the elements 
of F to be reduced words on X; but, even if u and v are reduced, their juxtaposition uv 
may not be reduced. Of course, we can do all the cancellation to convert uv into a reduced 
word, but now it is tricky to prove associativity. We solve this problem as follows. Since 
words such as zx _ 1 xyzx _1 and zyzx~ l . for example, must be identified, it is reasonable to 
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impose an equivalence relation on the set of all the words on X. If we define the elements 
of F to be the equivalence classes, then associativity can be proved without much difficulty, 
and it turns out that there is a unique reduced word in each equivalence class. Therefore, 
we can regard the elements of F as reduced words and the product of two elements as their 
juxtaposition followed by reduction. 

The casual reader may accept the existence of free groups as just described and proceed 
to Proposition 5.73 on page 304; here are the details for everyone else. 

Definition. Let A and B be words on X , possibly empty, and let w = AB. An elementary 
operation is either an insertion , changing w — AB to Aaa~ l B for some a e X U A -1 , or 
a deletion of a subword of w of the form aa~ l , changing w — Aaa~ l B to AB. 


Definition. We write 

w —»■ w' 

to denote w' arising from w by an elementary operation. Two words u and v on X are 
equivalent , denoted by u ~ v, if there are words u — w\, w 2 , ..., w n = v and elementary 
operations 

U — Wi —> W2 —»■••• —»■ W„ — V. 

Denote the equivalence class of a word w by [w]. 

Note that xx _1 ~ 1 andx _1 x ~ 1; that is, [xx _1 ] = [1] = [x _ 1 x]. 

We construct free groups in two stages. 

Definition. A semigroup is a set having an associative operation; a monoid is a semigroup 
S having an identity element 1; that is. Is = s — si for all s e S. If S and 5' are 
semigroups, then a homomorphism is a function /; S —> S' such that f(xy ) = /(x)/(>’); 
if S and S' are monoids, then a homomorphism f: S —> S' must also satisfy /(1) = 1. 

Of course, every group is a monoid, and a homomorphism between groups is a homo¬ 
morphism of them qua monoids. 

Example 5.69. 

(i) The set of natural numbers N is a commutative monoid under addition. 

(ii) A direct product of monoids is again a monoid (with cooordinatewise operation). In 
particular, the set N" of all n -tuples of natural numbers is a commutative additive monoid. 


Here is an example of a noncommutative monoid. 

Lemma 5.70. Let X be a set, and let W(X) be the set of all words on X [if X = 0, then 
W(X) consists of only the empty word]. 

(i) W(X) is a monoid under juxtaposition. 
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(ii) If u ~ if and v ~ v', then uv ~ if v'. 

(iii) If G is a group and f: X -> G is a function, then there is a homomorphism 
f: W(X) —> G extending f such that w ~ w' implies ffw) — f{w ') in G. 

Proof, (i) Associativity of juxtaposition is obvious once we note that there is no cancel¬ 
lation in W(X). 

(ii) The elementary operations that take u to if, when applied to the word uv, give a chain 
taking uv to u'v, the elementary operations that take v to v', when applied to the word if v, 
give a chain taking u'v to u'v'. Hence, uv ~ u'v'. 

(iii) If w = x • • • x e f, then define 

f(w) = f(x 1 ) ei f(x 2 ) e2 ---f(x n ) e ". 

That w has a unique spelling shows that / is a well-defined function, and it is obvious that 
/: W(X) -» G is a homomorphism. 

Let vu ~ uf. We prove, by induction on the number of elementary operations in a chain 
from xv to w', that f(w) = f(w' ) in G. Consider the deletion w = Aaa~ l B —> AB, 
where A and B are subwords of w . That / is a homomorphism gives 

f(Aaa~ l B) = f (A) f (a) f (a)~ l f (B). 

But 

f (A) f (a) f (a) -1 f (B) = f(A)f(B) in G, 

because there is cancellation in the group G, so that / (Aaa~ l B) — f(AB). A similar 
argument holds for insertions. • 

The next proposition will be used to prove that each element in a free group has a normal 
form. 

Proposition 5.71. Every word w on a set X is equivalent to a unique reduced word. 

Proof. If X = 0, then there is only one word on X, the empty word 1, and 1 is reduced. 

If X ^ 0, we show first that there exists a reduced word equivalent to w. If w has no 
subword of the form aa~ l , where a e XU X~ l , then w is reduced. Otherwise, delete the 
first such pair, producing a new word w\, which may be empty, with |wi| < |w|. Now 
repeat: If w i is reduced, stop; if there is a subword of w i of the form aa~ l , then delete it, 
producing a shorter word w 2 . Since the lengths are strictly decreasing, this process ends 
with a reduced word that is equivalent to w. 

To prove uniqueness, suppose, on the contrary, that u and v are distinct reduced words 
and there is a chain of elementary operations 

U = uq —> W2 ->•••—> w n — v; 

we may assume that n is minimal. Since u and v are both reduced, the first elementary 
operation is an insertion, while the last elementary operation is a deletion, and so there must 
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be a first deletion, say, Wi —> Wj+ \. Thus, the elementary operation w,_i -> w,- inserts 
aa~ l while the elementary operation Wj —»■ uv+i deletes /?/? -1 , where a,b e X U X~ l . 

There are three cases. If the subwords aa~ l and bb~ l of w; coincide, then w;_i = 
utj+i, for Wi+ 1 is obtained from w,_i by first inserting aa~ l and then deleting it; hence, 
the chain 

U = W\ —*■ W 2 —*■••• —*■ Wi -1 = Wj +1 —»■•••—»■ w n — v 

is shorter than the original shortest chain. The second case has aa~ l and bb~ l overlapping 
subwords of w,; this can happen in two ways. One way is 

wi = Aaa~ l b~ l C, 

where A, C are subwords of ?/;, and a~ l = b\ hence, a — h~ 1 and 

Wi — Aaa~ l aC. 

Therefore, w,_i = AaC, because we are inserting aa _1 , and = AaC , because we 
are deleting bb~ l = a~ x a. Thus, w,_i = Wi+ 1 , and removing Wi gives a shorter chain. 
The second way an overlap can happen is w, = Aa~ l aa~ l C, where b~ l = a. As in the 
first way, this leads to w,_ j = Wj+\. 

Finally, suppose that the subwords aa~ l and bb~ l do not overlap: 

Wi — A r aa~ [ A"bb~ l C and Wj + 1 = A'aa~ { A"C. 

Now bb~ l became a subword of Wj by an earlier insertion of either bb~ l or b~ l b to 
some word Wj-\ — XY with j < i ; that is, Wj-\ -> wj, where Wj — Xbb~ l Y or 
Wj — Xb~ l bY. In the first instance, the subchain w/_i ->•••-> Wi+i looks like 

XY -» Xbb~ l Y ->-» Abb~ l C -» A'aa~ x A"bb~ l C -> A'a a - { A"C, 

where A — A'A". But we can shorten this chain by not inserting bb~ l : 

XY -*- > AC A'aa~ l A"C. 

The only ways the deletion of bb~ l can occur in the second instance is if, in w/-\ — XY , 
we have X = X'b or Y = b~ l Y'. If X — X'b , then wj-\ = X'bY and Wj = X'bb~ l bY 
(and it will be the subword bb~ l that will be deleted by the elementary operation Wi —» 
w,-l_i). As with the first possibility, we do not need the insertion. In more detail, the chain 

X'bY -* X'bb~ l bY ->-* Abb~ l C A’aa~ l A"bb~ l C -* A'aa~ l A"C, 

where the processes X' —»■ A and bY —> C involve insertions only, can be shortened by 
removing the insertion of/? -1 /?: 

X'bY ->-» AC -> A'aa~ { A"C. 

The second case, Y = b~ 1 Y\ is treated in the same way. Therefore, in all cases, we are 
able to shorten the shortest chain, and so no such chain can exist. • 
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Theorem 5.72. If X is a set, then the set F of all equivalence classes of words on X 
with operation [m][i>] = [uv] is a free group with basis {[x] : x e X}. 

Moreover, every element in F has a normal form: For each [u ] e F, there is a unique 
reduced word w with [m] = [u>]. 

Proof. If X — 0, then W(0) consists only of the empty word 1, and so F = {1}. The 
reader may show that this is, indeed, a free group on 0 . 

Assume now that X 0 . We have already seen, in Lemma 5.70(h), that juxtaposition 
is compatible with the equivalence relation, and so the operation on F is well-defined. The 
operation is associative, because of associativity in W(X): 

[m]([u][w]) = [m][uw] 

= [m(vw)] 

= [(hv)w] 

= [uv][w] 

= ([«][«])[»]. 

The identity is the class [1], the inverse of [«;] is [u; -1 ], and so F is a group. 

If [w] e F, then 

[wi = [ x e l 1 ---x e n "] = [ x e l l ][ x e 2 2 ]---[x e n "l 

where e; = ±1 for all i, so that F is generated by X (if we identify each x e X with |x]). 
It follows from Proposition 5.71 that for every [w], there is a unique reduced word it with 
[w] = [m]. 

To prove that F is free with basis X, suppose that /: X —> G is a function, where G is 
a group. Define <p: F -> G by 

<P- [*?][*?] ' ' ' [-<"] ^ f(xff'f(X2) e2 ■ ■ ■ f(Xnf", 

where Xj 1 • • • x e " is reduced. Uniqueness of the reduced expression of a word shows that 
<p is a well-defined function (which obviously extends /). Take note of the relation of <p to 
the homomorphism /: W(X) —»• G in Lemma 5.70: When w is reduced, 

<p([w]) = f(w). 

It remains to prove that tp is a homomorphism (if so, it is the unique homomorphism 
extending /, because the subset X generates F). Let [u], [t>] e F, where u and v are 
reduced words, and let u v ~ w, where w is reduced. Now 

(p([u][v]) = tp{[w]) = f(w), 

because w is reduced, and 

<p([ui = /(«)/( V), 

because u and v are reduced. Finally, f(u)f(v) = f(w), by Lemma 5.70(iii), and so 
(p([u\[vf) = (p{[u\)(p{[vf). • 
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Remark. There is a less fussy proof of the existence of the free group F with basis a 
given set X , due to M. Barr (see Montgomery-Ralston, Selected Papers in Algebra). We 
have not given this proof here because it does not describe the elements of F, and this 
description is often needed when using free groups. -4 

We have proved, for every set X, that there exists a free group that is free with basis X. 
Moreover, the elements of a free group F on X may be regarded as reduced words and the 
operation may be regarded as juxtaposition followed by reduction; brackets are no longer 
used, and the elements [ w ] of F are written as w. 

The free group F with basis X that we have just constructed is generated by X. Are any 
two free groups with basis X isomorphic? 

Proposition 5.73. 

(i) Let X\ be a basis of a free group F\ and let Xi be a basis of a free group Fi. If there 
is a bijection f: X\ —>■ X 2 , then there is an isomorphism <p: F\ —> F 2 extending f. 

(ii) If F is a free group with basis X, then F is generated by X. 


Proof (i) The following diagram, in which the vertical arrows are inclusions, will help 
the reader follow the proof. 


n * 

/ 

Xi^^X 2 . 

r l 

We may regard / as having target F2, because A2 C Fr, since F\ is a free group with basis 
X 1 , there is a homomorphism <p \: F\ —> Fi extending /. Similarly, there exists a homo¬ 
morphism (p2 '• F2 —> F] extending f~ l . It follows that the composite <p2<pi : Fi —> F\ is 
a homomorphism extending lx- But the identity 1 also extends lx. so that uniqueness 
of the extension gives ^ 2^1 = 1 f, . In the same way, we see that the other composite 
cp\q>2 = If,, and so <p\ is an isomorphism. 

(ii) Let there be a bijection /: X\ —> X for some set X\ . If F\ is the free group with basis 
X\ constructed in Theorem 5.72, then X \ generates F\ . By part (i), there is an isomorphism 
(p\ F\ —> F with ip(X 1 ) = X. But if X[ generates F\ , then <p{X 1 ) generates im^>; that is, 
X generates F. • 

There is a notion of rank for free groups, but we must first check that all bases in a free 
group have the same number of elements. 


Lemma 5.74. If F is a free group with basis X — x \,..., x n , then F / F' is a free abelian 
group with basis X' — x\F', ..., x n F', where F' is the commutator subgroup of F. 
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Proof. We begin by noting that X' generates F/F'; this follows from Proposition 5.73(h), 
which says that X generates F. We prove that F/F’ is a free abelian group with basis X' 
by using the criterion in Proposition 5.12. Consider the following diagram. 


F --->■ F/F' 



Here, G is an arbitrary abelian group, p and p' are inclusions, n is the natural map, v: i h> 
xF', and y: X’ —> G is a function. Let g: F -> G be the unique homomorphism with 
gp — yv given by the definition of free group (for yv: X —»■ G is a function), and define 
g': F/F' -* G by wF' m* g(w) (g 1 is well-defined because G abelian forces F' < kerg). 
Now g'p' = y, for 

g'p'v = g'np = gp = yv; 

since v is a suijection, it follows that g'p' = y. Finally, g' is the unique such map, for if g" 
satisfies g"p' = y, then g' and g" agree on the generating set X', hence they are equal. • 


Proposition 5.75. Let F be the free group with basis X. If\X\ — n, then every basis of 
F has n elements. 

Proof. By the lemma, F/F' is a free abelian group of rank n. On the other hand, if 
yi,..., y, n is another basis of F, then F/F' is a free abelian group of rank m. By Propo¬ 
sition 5.9, we have m — n. • 

The following definition now makes sense. 

Definition. The rank of a free group F , denoted by rank(F), is the number of elements 
in a basis. 

Proposition 5.73(i) can now be restated: two free groups of finite rank are isomorphic 
if and only if they have the same rank. 

Proposition 5.76. Every group G is a quotient of a free group. 

Proof. Let X be a set for which there exists a bijection /: X -* G (for example, we 
could take X to be the underlying set of G and / = lp), and let F be the free group 
with basis X. There exists a homomorphism <p: F —»• G extending /, and <p is surjective 
because / is. Therefore, G = F/kerip. • 


Let us return to describing groups. 
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Definition. A presentation of a group G is an ordered pair 

G = (X | R), 

where A is a set, R is a set of words on A , and G — F/N, where F is the free group with 
basis X and N is the normal subgroup generated by R, that is, the subgroup generated by 
all conjugates of elements of R. We call the set X generators 10 and the set R relations. 

Proposition 5.76 says that every group has a presentation. 

Definition. A group G is finitely generated if it has a presentation {X \ R) with X finite. 
A group G is called finitely presented if it has a presentation (X \ R) in which both X and 
R are finite. 

It is easy to see that a group G is finitely generated if and only if there exists a finite 
subset A C G with G — (A). There do exist finitely generated groups that are not finitely 
presented (see my book. An Introduction to the Theory of Groups, page 417). 

Remark. There are interesting connections between group theory and algebraic topology. 
If A is a topological space, then its fundamental group 7 Ti(A) is defined to be the set of 
all homotopy classes of continuous functions .S’ 1 —> A, where S 1 is the unit circle. A finite 
simplicial complex is a topological space that can be triangulated in the sense that it is the 
union of finitely many vertices, edges, triangles, tetrahedra, and so forth. We can prove 
that a group G is finitely presented if and only if there is a finite simplicial complex A with 
G = u\ (A). 

Quite often, a group is known only by some presentation of it. For example, suppose 
that A is a simplicial complex containing subcomplexes Y\ and Yi_ such that Y\ U lo = A 
and Y\ fi Y 2 is connected. Then van Kampen’s theorem says that a presentation of 7T i (A) 
can be given if we know presentations of Tt\ (Y \) and tc\{Y f). < 

Example 5.77. 

(i) A group has many presentations. For example, G = If, has presentations 

(x | x 6 ) 

as well as 

(i a , b | a 3 , b 2 , aba~ { ft -1 ). 

A fundamental problem is how to determine whether two presentations give isomorphic 
groups. It can be proved that no algorithm can exist that solves this problem (see Rotman, 
An Introduction to the Theory of Groups, page 469). 

(ii) The free group with basis A has a presentation 

(A | 0). 

A free group is so called precisely because it has a presentation with no relations. -4 

10 The term generators is now being used in a generalized sense, for X is not a subset of G. The subset 
[x /V: x € X] of G = F/N does generate G in the usual sense. 
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A word on notation. Often, we write the relations in a presentation as equations. Thus, 
the relations 

a 3 , b 2 , cibci~\b~\ 

in the second presentation of Ig may also be written 

a 3 = 1 , b~ — 1 , ab — ba. 

If r is a word on i]. x n , we may write r = r(x\,..., x n ). If H is a group and 

hi,..., h n e H, then r(h\,..., h n ) denotes the element in H obtained from r by replacing 
each Xi by hi. 

The next, elementary, result is quite useful; we state only the finitely generated case 
of it. 

Theorem 5.78 (von Dyck’s Theorem). Let a group G have a presentation 

G = (x\, ...,x n | r j, j e J); 

that is, G = F/N, where F is free on {x\,..., x n ] and N is the normal subgroup 
of F generated by all rj = rj(x i, ..., x„). If H = {hi, ..., h n ) is a group and if 
r j (h i, ... ,h n ) = 1 in H for all j e J, then there is a surjective homomorphism G —> H 
with XjN i—>• hi for all i. 

Proof. If F is the free group with basis {xi,..., x n j, then there is a homomorphism 
ip: F -> H with <p (x,-) = h, for all i. Since r)(h i,..., h n ) = 1 in H for all j e J , 
we have r ; e ker cp for all j e J , which implies N < ker tp. Therefore, <p induces a 
(well-defined) homomorphism G — F/N —*■ H with XjN i->- h, for all i. • 

The next proposition will show how von Dyck’s theorem enters into the analysis of 
presentations, but we begin with the construction of a concrete group of matrices. 

Example 5.79. 

We are going to construct a group H n that is a good candidate to be the generalized quater¬ 
nion group Q„ for n > 3 defined on page 298. Consider the complex matrices 



where w is a primitive 2" 1 th root of unity, and let //„ = (A, B) < GL(2, C). We claim 
that A and B satisfy the relations in the definition of the generalized quaternion group. For 
all i > 1, 
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Notice that A and B do not commute; hence, B <£ (A), and so the cosets (A) and B(A) are 
distinct. Since A has order 2" _1 , it follows that 

\H„\ > |(A} U B{A)\ = 2 " _1 + 2 " _1 =2". 

The next theorem will show that | ZZ„ | = 2". a 

Proposition 5.80. For every n > 3, the generalized quaternion group Q„ exists. 

Proof. Let G n be the group defined by the presentation 

G„ = (a,b | a 2 — 1, bab~ l — a ~ l , b 2 — a 2 ). 

The group G„ satisfies all the requirements in the definition of the generalized quaternions 
with one possible exception: We do not yet know that its order is 2". By von Dyck’s 
theorem, there is a surjective homomorphism G„ —> ZZ„, where ZZ„ is the group just 
constructed in Example 5.79. Hence, \G n \ > 2". 

On the other hand, the cyclic subgroup (a) in G n has order at most 2" _1 , because 
a 2 " — 1. The relation bab~ l — a -1 implies that (a) <\ G n = {a , b), so that G n /(a) is 

generated by the image of b. Finally, the relation b 2 — a 2 " " shows that |G„/(a}| < 2. 
Hence, 

\G n \ < \{a)\\G n /(a)\ < 2" _l • 2 = 2". 

Therefore, |G„| = 2", and so G„ = Q„. • 

It now follows that the group ZZ„ in Example 5.79 is isomorphic to Q„. 

In Exercise 2.57 on page 81, we gave a concrete construction of the dihedral group D 2n , 
and we can use that group—as in the proof just given— to give a presentation. 

Proposition 5.81. The dihedral group D 2n has a presentation 

D 2n = (a, b | a n = 1, b 2 — 1, bab — a -1 ). 

Proof. Let Cf,, denote the group defined by the presentation, and let ZH,, be the group 
of order 2 n constructed in Exercise 2.57 on page 81. By von Dyck’s theorem, there is 
a surjective homomorphism /: Cin —■ y Ditu> an d so \Cin\ > 2n. To see that / is an 
isomorphism, we prove the reverse inequality. The cyclic subgroup (a) in C 21 , has order 
at most n , because a 11 — 1. The relation bab~ l = a~ l implies that (a) <1 C 2 « = (a, b), 
so that C 2n /(a) is generated by the image of h. Finally, the relation b 2 = 1 shows that 
\Cm/{a)\ < 2. Hence, 

\C 2n l < \{a)\\C 2n /{a)\ < 2n. 

Therefore, |C 2 h| = 2 n, and so C 2n L D 2n . • 

In Chapter 2, we classified the groups of order 7 or less. Since groups of prime order 
are cyclic, it was only a question of classifying the groups of orders 4 and 6 . The proof 
we gave, in Proposition 2.90, that every nonabelian group of order 6 is isomorphic to S 3 
was rather complicated, analyzing the representation of a group on the cosets of a cyclic 
subgroup. Here is a proof in the present spirit. 
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Proposition 5.82. If G is a nonabelian group of order 6 , then G = S 3 . 

Proof. As in the proof of Proposition 2.90, G must contain elements a and b of orders 
3 and 2, respectively. Now (a) <\ G , because it has index 2, and so either bab~ l — a or 
bab~ l — a~ l . The first possibility cannot occur, because G is not abelian. Therefore, G 
satisfies the conditions in the presentation of D(, = S 3 , and so von Dyck’s theorem gives 
a surjective homomorphism f >6 -> G. Since both groups have the same order, this map 
must be an isomorphism. • 

We can now classify the groups of order 8 . 

Theorem 5.83. Every group G of order 8 is isomorphic to 

D%, Q, Ig, I 4 © I 2 , or I 2 ©l 2 ©l 2 - 
Moreover, no two of the displayed groups are isomorphic. 

Proof. If G is abelian, then the basis theorem shows that G is a direct sum of cyclic 
groups, and the fundamental theorem shows that the only such groups are those listed. 
Therefore, we may assume that G is not abelian. 

Now G cannot have an element of order 8 , lest it be cyclic, hence abelian; moreover, 
not every nonidentity element can have order 2, lest G be abelian, by Exercise 2.26 on 
page 62. We conclude that G must have an element a of order 4; hence, (a) has index 2, 
and so (a) < G. Choose b e G with b f. (a); note that G — (a, b) because (a) has index 
2, hence is a maximal subgroup. Now b 2 e (a), because G/(a) is a group of order 2, and 
so b 2 — a 1 , where 0 < i < 3. We cannot have b 2 = a or b 2 — a 3 — a~ l lest b have order 
8 . Therefore, either 

b 2 = a 2 or b 2 — 1 . 

Furthermore, bab~ l e (a), by normality, and so bab~ l = a or bab~ l = a~ l (for bab~ l 
has the same order as a). Now bab~ x — a says that a and b commute, which implies that 
G is abelian. We conclude that bab~ l — a -1 . Therefore, there are only two possibilities: 

a 4 = 1 , b 2 = a 2 , and bab~ l — a~ l , 

or 

a 4 — 1 , b 2 — 1 , and bab~ { — a~ [ . 

By the lemma, the first equations give relations of a presentation for Q, while Proposi¬ 
tion 5.81 shows that the second equations give relations of a presentation of Z)g. By von 
Dyck’s theorem, there is a surjective homomorphism Q —> G or /fg —> G; as |G| = 8 , 
however, this homomorphism must be an isomorphism. 

Finally, Exercise 2.61 on page 82 shows that Q and D s are not isomorphic (for example, 
Q has a unique element of order 2 while Z >8 has several such elements). • 

The reader may continue this classification of the groups G of small order, say, | G \ < 
15. Here are the results. By Corollary 2.104, every group of order p 2 . where p is a prime, 
is abelian, and so every group of order 9 is abelian; by the fundamental theorem of finite 
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abelian groups, there are only two such groups: I9 and I3 x I3. If p is a prime, then every 
group of order 2 p is either cyclic or dihedral (see Exercise 5.63). Thus, there are only two 
groups of order 10 and only two groups of order 14. There are 5 groups of order 12, two of 
which are abelian. The nonabelian groups of order 12 are Z>i2 = S3 x IE. -44, and a group 
T having the presentation 

T = (a, b | a 6 = 1, b 2 = a 3 = ( ab) 2 )\ 

see Exercise 5.64, which realizes T as a group of matrices. The group 11 T is an example 
of a semidirect product, a construction that will be discussed in Chapter 10. A group of 
order pq, where p < q are primes and q ^ 1 mod p, must be cyclic, and so there is only 
one group of order 15 [see Exercise 10. ll(ii) on page 794], There are 14 nonisomorphic 
groups of order 16, and so this is a good place to stop. 

Exercises 

5.58 Let F be a free group with basis X and let A C X. Prove that if N is the normal subgroup of 
F generated by A, then F/N is a free group. 

5.59 Let F be a free group. 

(i) Prove that F has no elements (other than 1) of finite order. 

(ii) Prove that a free group F is abelian if and only if rank(F) < 1. 

Hint. Map a free group of rank > 2 onto a nonabelian group. 

(iii) Prove that if rank(F) > 2, then Z(F) = {1}, where Z(F) is the center of F, 

5.60 Prove that a free group is solvable if and only if it is infinite cyclic (see page 286). 

5.61 (i) If G is a finitely generated group and n is a positive integer, prove that G has only 

finitely many subgroups of index n. 

Hint. Consider homomorphisms G —> S„. 

(ii) If H and K are subgroups of finite index in a group G, prove that H Cl K also has finite 
index in G. 

5.62 (i) Prove that each of the generalized quaternion groups Q„ has a unique subgroup of order 

2, namely, ( b ~), and this subgroup is the center Z(Q„). 

(ii) Prove that Q„/Z(Q„) = D 2 „-i- 

5.63 If p is a prime, prove that every group G of order 2p is either cyclic or isomorphic to D 2p . 
Hint. By Cauchy’s theorem, G must contain an element a of order p, and (a) <1 G because 
it has index 2. 

5.64 Let G be the subgroup of GL(2, C) generated by 

co 0 1 [0 i 

0 arj |_t 0 

where co = e 2jr '/ 3 is a primitive cube root of unity. 

(i) Prove that G is a group of order 12 that is not isomorphic to A4 or to Di 2 . 

11 The group T is called a dicyclic group of type (2, 2, 3) in Coxeter and Moser, Generators and Relations for 
Discrete Groups, but this terminology in not generally accepted. 
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(ii) Prove that G is isomotphic to the group T on page 310. 

5.65 Prove that every finite group is finitely presented. 

5.66 Compute the order of the group G with the presentation 

G = (ci, b, c, d | bab~ l = ic, bdb ~ 1 = d 2 , c~ l ac — b~, dcd~ l = c 2 , bd = db ). 

5.67 If A is a nonempty set, define f2(A) to be the set of all positive words w on X : that is, £2 (A) 
is the subset of W( A) consisting of all x j 1 • • • x e f " with all e,- = 1. Define a free monoid, and 
prove that Q(X) is the free monoid with basis X. 

5.6 The Nielsen-Schreier Theorem 


We are now going to prove one of the most fundamental results about free groups: Every 
subgroup is also free. This theorem was first proved by J. Nielsen, in 1921, for finitely 
generated subgroups; the finiteness hypothesis was removed by O. Schreier, in 1926, and 
so the theorem is now called the Nielsen-Schreier theorem. Nielsen’s method actually 
provides an algorithm, analogous to Gaussian elimination in linear algebra, which replaces 
a generating set A of a free group F with a basis of (A). 12 In particular, if S is a finitely 
generated subgroup of a free group F, then Nielsen’s algorithm replaces any generating 
set of 5 with a basis of S, thereby proving that S is free. For an exposition of this proof, 
we refer the reader to the book of Lyndon and Schupp, pages 4-13. 

A second type of proof was found by R. Baer and F. Levi in 1933. It uses a connection, 
analogous to the correspondence between Galois groups and intermediate fields, between 
covering spaces A of a topological space X and subgroups of its fundamental group n\ (X ). 
In particular, if A is a graph (a space constructed of edges and vertices), then it can be 
shown that every covering space is also a graph. It turns out that Tt\ (A) is isomorphic to 
a subgroup of 7 Ti(A). Conversely, given any subgroup S < 7ri(A), there exists a covering 
space Ay of A for which jri (Ay) is isomorphic to S. Moreover, tc\(X) is a free group 
whenever A is a graph. Once all these facts are established, the proof proceed as follows. 
Given a free group F, there is a graph A (a “bouquet of circles”) with F = tc\ (A); given a 
subgroup S < F , we know that S = tt\ (Ay). But Ay is also a graph, so that 7Ti(Ay), and 
hence S, is free. There are versions of this proof that avoid topology; for example, there is 
an exposition of such a proof in my book. An Introduction to the Theory of Groups, pages 
377-384. Interesting variations of this idea are due to J.-R Serre, in his book Trees, who 
characterized free groups by their action on trees (trees arise as certain universal covering 
spaces of connected graphs), and by P. J. Higgins, who used groupoids. 

We give A. J. Weir’s proof [“The Reidemeister-Schreier and Kuros Subgroup Theo¬ 
rems,” Mathematika 3 (1956), 47-55] of the subgroup theorem because it requires less 
preparation than the others. The idea arises from a proof of the Reidemeister-Schreier 

'“This theoretical algorithm has evolved into the Schreier—Sims algorithm, an efficient way to compute the 
order of a subgroup H < S„ when a generating set of H is given; it also can determine whether a specific 
permutation lies in H. 
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theorem, which gives a presentation of a subgroup of a group G in terms of a given pre¬ 
sentation of G. 

Definition. Let S be a subgroup of a group G. A transversal l of S in G is a subset of G 
consisting of exactly one element £(Sb) e Sb from every coset Sb , and with £(S) = 1. 

Let F be a free group with basis X, and let S be a subgroup of F. Given a transversal i 
of 5 in F, then for each x e X, both l(Sb)x and I(Sbx) lie in the coset Sbx, and so 

t sb ,x = t(Sb)xl{Sbx)~ l 

lies in S. We are going to prove that if the transversal £ is chosen wisely, then the set of all 
tsb,x that are not 1 form a basis of S, so that S is free. 

Let i be a transversal of a subgroup S of a free group F, let the elements tsh.x be as 
above, and define Y to be the free group on symbols ysb,x so that ysb.x !-»■ tsb.x is a 
bijection. Define <p : Y —> S to he the homomorphism with 

(p: ysb,x >-> t S b,x — £(Sb)xl(Sbx)~ l . 

We begin by defining coset functions F —> Y, one for each coset Sb , which we denote by 
u i-^ u sh . These functions are not homomorphisms, and we define them all simultaneously 
by induction on \u\ >0, where u is a reduced word on X. For all x e X and all cosets Sb, 
define 

I s * =1, x sb — ysb.x, and {x~ l ) sb = {x sbx ~ l T 1 . 

If u — x £ v is a reduced word of length n + 1, where s = ±1 and |u| = n, define 

L, Sb = (; X £ ) Sb V SbX \ 


Lemma 5.84. 

(i) For all u, v e F, the coset functions satisfy (uv) sb — u sb v sbu . 

(ii) For all u e F, (; u~ 1 ) sb = (u shu ') ’. 

(iii) Iftp: Y —>■ S is the homomorphism (p : ysb.x ^ tsb.x = l(Sb)x£(Sbx)~ l , then, for 
all u e F, (p(u sb ) = £(Sb)ul(Sbu)-\ 

(iv) The function 9 : S —> Y, given by 9: u i —> u s , is a homomorphism, and q>9 = I 5 . 

Proof (i) The proof is by induction on \u\, where u is reduced. If \u\ = 0, then u = I 
and (uv) sb — v sb \ on the other hand, = v sb . 

For the inductive step, write u — x £ u>. Then 

(uv) sb = (. x e ) sb (wv) sbxS 

= ( x £ ) sb w sbxe v sbxew 
= {x e ) sb w sbxS v sbu 
= ( x £ w) sb v sbu 
= U sb v sbu . 


(definition of coset functions) 
(inductive hypothesis) 
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(ii) The result follows from 

l = l sb = (U~ 1 u) sb = (.U~ 1 ) sb u sbu ~ 1 . 

(iii) Note that <p does define a homomorphism because Y is the free group with basis 

all ysh.x ■ This proof is also an induction on \u\ >0. First, (p(l sb ) — (p{ 1) = 1, while 
€(5')1€(5'1)- 1 = 1. 

For the inductive step, write u — x £ v. where u is reduced. Then 

qo(u sb ) = ip((x £ v ) sb ) = (p((x £ ) sb v sbxe ) 

= <P«x £ ) sb )cp( v sbxS ) 

= (p{(x £ ) sb )l (Sbx s )v£ (Sbx e v)~ l , 

the last equation following from the inductive hypothesis. There are now two cases, de¬ 
pending on the sign e. If e = +1, then 

(p(u sb ) = l(Sb)xt{Sbx)- l l(Sbx)vt{Sbxv)~ l 
= l(Sb)xvUSbxv)- 1 
= l{Sb)ul{Sbu)~ l . 

If e = — 1, then 

(p{ii sb ) = (p((y sbx -i x )~ l )l{Sbx~ l )vl{Sbx~ l v)~ l 

— (^t(Sbx~ l )xl(Sbx~ l x)~ l ^j £(Sbx~ 1 )vi(Sbx~ l v)~ 1 
= l{Sb)x-H(Sbx- l )-H(Sbx- l )vl(Sbx- l v)- 1 
= e(Sb)x- l vl(Sbx~ l vr l 
= l{Sb)ul{Sbu)- { . 

(iv) For u e S, define 9 : S —>■ Y by 

9 : u i—> u s 

(of course, 9 is the restriction to S of the coset function u u sb when b = 1). Now, if 
u, v e S, then 

9(uv) = (uv) s = u s v Su = u s v s = 9(u)9(v), 

because Su — S when u e S. Therefore, 9 is a homomorphism. Moreover, if u e S, then 
part (iii) gives 

(p9(u) = (p(u s ) = l (S\)ui (Sin) -1 — u. • 
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Corollary 5.85. If S is a subgroup of a free group F and if I is a transversal of S in F, 
then the set of all tsb.x that are distinct from 1 generates S. 

Proof. Since the composite <p9 = l^, the function cp\ Y —> S is surjective; hence, the 
images tsb,x of the generators ysb.x of Y generate mup — S. Of course, we may delete any 
occurrences of 1 from a generating set. • 

The next lemma shows that we have a presentation of S, namely, 

S — ( ysb,x> all x e X. all cosets Sb \ £(Sb) s , all cosets Sb). 

Lemma 5.86. If l is a transversal of S in F, then ker q> is the normal subgroup of Y 
generated by all £(Sb) s . 

Proof. Let N be the normal subgroup of Y generated by all t(Sb) s , and let K — ker <p. By 
Lemma 5.84(iv), 9 : S -> Y is a homomorphism with <p6 — Is (where <p : ysb.x tsb.x 
and 9: u (->• u s ). It follows from Exercise 5.72(ii) on page 318 that K is the normal 
subgroup of Y generated by {y~^ p(y) : y e T}, where p = 9<p. By Lemma 5.84(i), 

yf^ x p(ysb.x) = yfb.x (^(Sb)xl(Sbx)~ l ^j 

= yfbJiSbfx 31 ’ (t(Sbxr l ) Sb '' 

= (ysbJ( Sb f ysb.x) (i(Sbxy i y bx , 


for x sb — ysb.x is part of the definition of the coset function u i-»- u sb . Therefore, 

ysb, x p(ysb,x) - (ysbJW s ysb,x) (nsbxfy 1 , d) 

because Lemma 5.84(ii) gives ( I(Sbx)~ l ) sbx — (t(Sbx) s )~ l . It follows from Eq. (1) that 

_1 o 

ysb <P(ysb.x) ^ N, and so K < N. For the reverse inclusion, Eq. (1) says that £(Sb ) e K 
if and only if I(Sbx) s e K. Therefore, the desired inclusion can be proved by induction 
on \£(Sb)\, and so K = N, as desired. • 

We now choose a special transversal. 

Definition. Let F be a free group with basis X and let S be a subgroup of F. A Schreier 
transversal is a transversal l with the property that if i'(Sb) = x^x^ 2 • ■ ■ x'f is a reduced 
word, then every initial segment x^x^ 2 ■ ■ ■ x^ : . for 1 < k < n, is also in the transversal. 

Lemma 5.87. A Schreier transversal exists for every subgroup S of F. 

Proof. Define the length of a coset Sb, denoted by \Sb\, to be the minimum length of 
the elements sb e Sb. We prove, by induction on | Sb\, that there is a representative 
i(Sb) e Sb such that all its initial segments are representatives of cosets of shorter length. 
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Begin by defining l(S) — 1. For the inductive step, let Sz .j = n + 1 and let ux £ e Sz, 
where e = ±1 and \ux e \ = n + 1. Now |Sm| = n, for if its length were m < n, it 
would have a representative v of length m, and then vx £ would be a representative of Sz 
of length < n + 1. By induction, b — l(Su) exists such that every initial segment is also a 
representative. Define KSz) — bx £ . • 

Here is the result we have been seeking. 

Theorem 5.88 (Nielsen-Schreier). Every subgroup S of a free group F is free. In fact, 
if X is a basis of F and if I is a Schreier transversal of S in F, then a basis for S consists 
of all tsb.x — £(Sb)x£(Sbx)~ 1 that are not 1. 

Proof. Recall that S = Y/ K, where Y is the free group with basis all symbols ysb.x and 
K = ker q>\ by Lemma 5.86, K is equal to the normal subgroup generated by all t(Sb) s . 
By Exercise 5.58 on page 310, it suffices to show that K is equal to the normal subgroup T 
of Y generated by all special ysb.x : that is, by those ysb,x for which <p (ysb.x) — tsb.x = L 
Clearly, T < K — ker <p, and so it suffices to prove the reverse inclusion. We prove, by 
induction on |£(Su)|, that £(Sv) s is a word on the special ysb.x■ If |£(Su)| = 0, then 
USv) — US) — 1, which is a word on the special ysb.x■ If |€(5n)| > 0, then £(Sv) — 
ux £ , where e = ±1 and \u\ < |£(Su)|. Since £ is a Schreier transversal, u is also a 
representative: u = £(Su). By Lemma 5.84(i), 

USv) s = u s (x £ ) s r 

By induction, u s is a word on the special ysb.x, an d hence u s e T. 

It remains to prove that ( x £ ) Su is a word on the special ysb.x■ If e = +1, then ( x e ) Su — 
x Su — ysu.x ■ But £{Sux) = ux , because v = ux and £ is a Schreier transversal, so that 

<p(ysu,x ) = tsu.x — £(Su)x£(Sux)~ 1 = ux(ux)~ l = 1. 

Therefore, ysu.x is special and x s " lies in T. If s = —1, then the definition of coset 
functions gives 

(x~ l ) Su = (X s "*- 1 )- 1 = (y Sux - 1 ,,)" 1 . 

Hence, 

^((x -1 ) 5 ") = = [USux-^xUSux^x)]- 1 = [USux^xUSu)]- 1 . 

Since £ is a Schreier transversal, we have £(Su) = u and £(Sux~ l ) — £(Sv) — v — ux _1 . 
Hence, 

(p((x~ l ) Su ) = [(m,v _i )xm _1 ] _I = 1. 

Therefore, y 5HX -i x is special, (x _1 ) s “ e T. and the proof is complete. • 

Here is a nice application of the Nielsen-Schreier theorem. 
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Corollary 5.89. Let F be a free group, and let u, v e F. Then u and v commute if and 
only if there is z € F with u, v e (z)■ 

Proof Sufficiency is obvious; if both u, v e (z), then they lie in an abelian subgroup, and 
hence they commute. 

Conversely, the Nielsen-Schreier theorem says that the subgroup (u, v) is free. On the 
other hand, the condition that u and v commute says that (u, v) is abelian. But an abelian 
free group is cyclic, by Exercise 5.59(ii) on page 310: therefore, (u, v) = Z, as desired. • 

The next result shows, in contrast to abelian groups, that a subgroup of a finitely gener¬ 
ated group need not be finitely generated. 

Corollary 5.90. If F is a free group o/rank2, then its commutator subgroup F' is a free 
group of infinite rank. 

Proof Let {x,y} be a basis of F. Since F/F' is free abelian with basis [xF 1 , yF'}, 
by Lemma 5.74, every coset F'b has a unique representative of the form x m y n , where 
m, n e Z; it follows that the transversal choosing £(F'V) — x m y n is a Schreier transversal, 
for every subword of x m y" is a word of the same form. If n > 0, then £(F'y") — y n , 
but £{F'y n x) — xy n ^ y n x. Therefore, there are infinitely many elements tsy,x — 
£(F'y n )x£(F'y n x)~ l 1, and so the result follows from the Nielsen-Schreier theorem. • 

Even though an arbitrary subgroup of a finitely generated free group need not be finitely 
generated, a subgroup of finite index must be finitely generated. 


Corollary 5.91. If F is a free group of finite rank n, then every subgroup S of F having 
finite index j is also finitely generated. In fact, rank (.S') = jn — j + 1. 

Proof. Let X = [xi,..., x n ] be a basis of F, and let l = {£(Sb)} be a Schreier transver¬ 
sal. By Theorem 5.88, a basis of 5 consists of all those elements tsb.x not equal to 1, where 
x e X. There are j choices for Sb and n choices for x, and so there are at most jn elements 
in a basis of S. Therefore, rank (.S’) < jn, and so S is finitely generated. 

Call an ordered pair (Sb, x) trivial if tsb,x = 1; that is, if £(Sb)x — £(Sbx). We will 
show that there is a bijection x[r between the family of cosets [Sb S} and the trivial 
ordered pairs, so that there are j — 1 trivial ordered pairs. It will then follow that 


rank (S') = jn - ( j - 1) = jn - j + 1. 

Since Sb S, we have £(Sb) — b = ux £ ; since l is a Schreier transversal, we have 
u e £. Define jr(Sb) as follows. 


i (r(Sux e ) 


| (Su, x) if s — +1; 
j (Sux~ l , x) if s — —1. 


Note that f(Sux £ ) is a trivial ordered pair. If s — +1, then £(Sux) = £(Sb) — b — ux, 
so that £(Su)x — ux and t$ u ,x = L If e = —1, then £(Sbx ) = £(Sux~ l x) — £(Su) = u, 
so that £(Sb)x — bx — ux~ l x — u and t$b.x = 1- 
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To see that i fr is injective, suppose that fr(Sb) — fr(Sc), where b — ux £ and c — vy' 1 ; 
we assume that x, y lie in the given basis of F and that s = ±1 and i] — ±1. There are 
four possibilities, depending on the signs of e and if. 

( Su,x ) = ( Sv , y); (Su,x) = ( Svy ~ l , y); (Sux~ l ,x) — (Sv, y); (Su,x) — ( Svy _1 , y). 

In every case, equality of ordered pairs gives x = y. If ( Su,x ) = (Sv. x), then Su — 
Sv, hence, Sb = Sux — Svx = Sc, as desired. If ( Su,x ) = (Svx~ , x), then Su — 
Svx~ l — Sc, and so £(Su) — £(Sc ) = c. But £(Su)x — l(Sux) = b, because ( Su,x ) is 
a trivial ordered pair. Hence, b = £(Su)x = cx = i>x _1 x, contradicting b (as any element 
of a Schreier transveral) being reduced. A similar contradiction shows that we cannot 
have (Sux~ l ,x) = ( Sv,x ). Finally, if ( Sux~ l ,x ) = S(ux _1 ,x), then Sb — Sux~ l = 
Svx~ l = Sc. 

To see that x[r is surjective, take a trivial ordered pair ( Svu , x); that is, £(Sw)x = vox = 
i(Swx). Now w = ux s , where u e £ and s = ±1. If s = +1, then w does not 
end with x -1 , and ^/(Swx) — ( Sw,x ). If e = —1, then w does end with x _1 , and so 
t/f (Su) — (Sux~ l , x) = (Sw, x). • 

Corollary 5.92. There exist nonisomorphic finitely generated groups G and H each of 
which is isomorphic to a subgroup of the other. 

Proof. If G is a free group of rank 2 and H is a free group of rank 3, then G H. 
Clearly, G is isomorphic to a subgroup of H. On the other hand, the commutator subgroup 
G' is free of infinite rank, and so G', hence G, contains a free subgroup of rank 3; that is, 
H is isomorphic to a subgroup of G. • 

We are at the very beginning of a rich subject called combinatorial group theory, which 
investigates how much can be said about a group given a presentation of it. One of the most 
remarkable results is the unsolvability of the word problem. A group G has a solvable word 
problem if it has a presentation G = (X \ R) for which there exists an algorithm to deter¬ 
mine whether an arbitrary word w on X is equal to the identity element in G (if X and R 
are finite, it can be proved that this property is independent of the choice of presentation). 
In the late 1950s, P. S. Novikov and W. W. Boone, independently, proved that there exists 
a finitely presented group G that does not have a solvable word problem (see Rotman, 
An Introduction to the Theory of Groups, Chapter 12). Other problems involve finding pre¬ 
sentations for known groups, as we have done for Q„ and Djn ; an excellent reference for 
such questions is Coxeter-Moser, Generators and Relations for Discrete Groups. Another 
problem is whether a group defined by a presentation is finite or infinite. For example, 
Burnside’s problem asks whether a finitely generated group G of finite exponent m, that 
is, x m — 1 for all x e G, must be finite [W. Burnside had proved that if such a group G 
happens to be a subgroup of GL(n, C) for some n, then G is finite]. The answer in general, 
however, is negative; such a group can be infinite. This was first proved, for m odd and 
large, by P. S. Novikov and S. I. Adyan, in a long and complicated paper. Using a ge¬ 
ometric technique involving van Kampen diagrams (see Lyndon-Schupp, Combinatorial 
Group Theory, for an introduction to this subject), A. Yu. Ol’shanskii gave a much shorter 
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and simpler proof. Finally, S. V. Ivanov was able to complete the solution by showing that 
the presented group can be infinite when m is even and large. Another geometric technique 
involves a Cayley graph of a finitely generated group G, which is a graph depending on 
a given finite generating set; it can be proved that G is free if and only if it has a Cay¬ 
ley graph that is a tree (see Serre, Trees). Finally, the interaction between presentations 
and algorithms is both theoretical and practical. A theorem of G. Higman (see Rotman, 
An Introduction to the Theory of Groups, Chapter 12) states that a finitely generated group 
G can be imbedded as a subgroup of a finitely presented group H (that is, H has a presen¬ 
tation with a finite number of generators and a finite number of relations) if and only if G 
is recursively presented: there is a presentation of G whose relations can be given by an 
algorithm. On the practical side, many efficient algorithms solving group-theoretic prob¬ 
lems have been implemented; see Sims, Computation with Finitely Presented Groups. The 
first such algorithm was coset enumeration (see Lyndon-Schupp, Combinatorial Group 
Theory, pages 163-167), which computes the order of a group G, defined by a presenta¬ 
tion, provided that |G| is finite (unfortunately, there can be no algorithm to determine, in 
advance, whether G is finite). 

Exercises 


5.68 Let G be a finitely generated group, and let H < G have finite index. Prove that H is finitely 
generated. 

5.69 Prove that if F is free of finite rank n > 2, then its commutator subgroup F' is free of infinite 
rank. 

5.70 Let G be a finite group that is not cyclic. If G = F/S, where F is a free group of finite rank, 
prove that rank(S) > rank(C). 

5.71 (i) Prove that if G is a finite group generated by two elements a, b having order 2, then 

G = Di n f° r some n > 2. 

(ii) Let G = (A, B) < GL(2, Q), where 



1 

o • 

o - 
1 _ 


"-1 f 

A = 

and B = 


L 1 !J 


0 1 


Show that A 2 = / = B , but that AB has infinite order. (Exercise prefextmodular 
group gives another example of a group in which the product of two elements of finite 
order has infinite order.) The group G is usually denoted by Doo, and it is called the 
infinite dihedral group. 

5.72 Let Y and S be groups, and let tp : Y —> S and 9 : S —> Y be homomorphisms with (p9 — 1$. 

(i) If p : Y —y Y is defined by p = 9<p, prove that pp = p and p(a) = a for every a e im 9. 
(The homomorphism p is called a retraction.) 

(ii) If K is the normal subgroup of Y generated by all p(y) f° r y e L, prove that 
K — kerip. 

Hint. Note that keri ip = kerp because 9 is an injection. Use the equation y = 
B(y)(p(y) _1 ).V for all y e Y. 
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Our main interest in this chapter is the study of polynomials in several variables. As usual, 
it is simpler to begin by looking at the more general setting—in this case, commutative 
rings—before getting involved with polynomial rings. It turns out that the nature of the 
ideals in a commutative ring is important: for example, we have already seen that gcds 
exist in PIDs and that they are linear combinations, while these properties may not be 
enjoyed by other commutative rings. Three special types of ideals—prime ideals, max¬ 
imal ideals, and finitely generated ideals—are the most interesting. A commutative ring 
is called noetherian if every ideal is finitely generated, and Hilbert’s basis theorem shows 
that k[x i,..., x n ], where k is a field, is noetherian. Next, we collect several interesting 
applications of Zorn’s lemma (which is discussed in the Appendix), such as the existence 
of maximal ideals, a theorem of I. S. Cohen saying that a commutative ring is noetherian if 
and only if every prime ideal is finitely generated, the existence and uniqueness of the alge¬ 
braic closures of fields, the existence of transcendence bases (as well as Liiroth’s theorem), 
and the existence of maximal separable extensions. The next step introduces a geometric 
viewpoint in which ideals correspond to certain affine subsets called varieties', this dis¬ 
cussion involves the Nullstellensatz as well as primary decompositions. Finally, the last 
section introduces the idea of Grobner bases, which extends the division algorithm from 
k[x~\ to k[x i, ..., x„] and which yields a practical algorithm for deciding many problems 
that can be encoded in terms of polynomials in several variables. 


6.1 Prime Ideals and Maximal Ideals 

A great deal of the number theory we have presented involves divisibility: Given two 
integers a and b, when does a \ b ; that is, when is a a divisor of bl This question 
translates into a question about principal ideals, for a \ b if and only if (b) C (a). We 
now introduce two especially interesting types of ideals: prime ideals, which are related to 
Euclid’s lemma, and maximal ideals. 

Let us begin with the analog of Theorem 2.76, the correspondence theorem for groups. 
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Proposition 6.1 (Correspondence Theorem for Rings). If I is a proper ideal in a 
commutative ring R, then there is an inclusion-preserving Injection cp from the set of all 
intermediate ideals J containing 7, that is, I C J C R, to the set of all the ideals in R/I, 
given by 

cp: J i->- 7t(J) = J // = [a + /: a e J], 
where jt : R —> R/I is the natural map. 



Proof. If we forget its multiplication, the commutative ring R is merely an additive 
abelian group and its ideal I is a (normal) subgroup. The correspondence theorem for 
groups. Theorem 2.76, now applies, and it gives an inclusion-preserving bijection 

<1> : {all subgroups of R containing 1} —>• {all subgroups of R/I}, 

where 0(7) = tt(J) — J/I. 

If 7 is an ideal, then 0(7) is also an ideal, for if r e R and a e 7, then ra e 7, and so 
(r -\- I}}a -\- I ) — ra -\- I £ 7 ] I . 

Let q> be the restriction of O to the set of intermediate ideals; <p is an injection because O 
is a bijection. To see that <p is surjective, let 7* be an ideal in R/I. Now 7r -1 (7*) is an 
intermediate ideal in R [it contains I — tr _1 ({0})], and (p(7t~ [ (J*)) — (J*)) = 7*, 

by Proposition 1.50(h). • 

In practice, the correspondence theorem is invoked, tacitly, by saying that every ideal in 
the quotient ring R/I has the form 7/7 for some unique ideal 7 with I C 7 C R. 

Example 6.2. 

Let I — (m) be a nonzero ideal in Z. If 7 is an ideal in Z containing 7, then 7 = (a) for 
some a e Z, because Z is a PID, and (m) c (a) if and only if a \ m. The correspondence 
theorem now shows that every ideal in the ring I„, has the form ([a]) for some divisor a of 
m, for 7/7 = ([a]). ◄ 
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Definition. An ideal I in a commutative ring R is called a prime ideal if it is a proper 
ideal, that is, I R, and cib e I implies a e I or b e I. 

Example 6.3. 

(i) Recall that a nonzero commutative ring R is a domain if and only if ah = 0 in R 
implies a — 0 or b = 0. Thus, the ideal (0) = {0} in R is a prime ideal if and only if R is a 
domain. 

(ii) We claim that the prime ideals in Z are precisely the ideals ( p ), where either p — 0 
or p is a prime. Since m and —m generate the same principal ideal, we may restrict our 
attention to nonnegative generators. If p — 0, then the result follows from item (i), for Z is 
a domain. If p > 0, we show first that (p ) is a proper ideal; otherwise, 1 e (p), and there 
would be an integer a with ap — 1, a contradiction. Next, if ab e ip), then p \ ab. By 
Euclid’s lemma, either p \ a or p | Ir, that is, either a e (p) or h e (p). Therefore, ( p) is 
a prime ideal. 

Conversely, if m > 1 is not a prime, then it has a factorization m — ab with 0 < a < m 
and 0 < b < m; thus, neither a nor h is a multiple of m, and so neither lies in (m). But 
ab — m e (in), and so ( m ) is not a prime ideal. ◄ 

Proposition 6.4. An ideal I in a commutative ring R is a prime ideal if and only if R/I 
is a domain. 

Proof. Let I be a prime ideal. Since I is a proper ideal, we have 1 f. I and so 1+/ f 0+7 
in R/I. If 0 = (a + I)(b+1) — ab + 7, then ab e I. Since 7 is a prime ideal, either a e I 
or b e 7; that is, either a + I = 0 or b + I = 0. Hence, R/I is a domain. The converse is 
just as easy. • 

The characterization of prime numbers in Example 6.3(h) extends to polynomials with 
coefficients in a field. 

Proposition 6.5. Ifk is a field, then a nonzero polynomial p(x) e k[x] is irreducible if 
and only if (p(x)) is a prime ideal. 

Proof. Suppose that p(x) is irreducible. First, ( p) is a proper ideal; otherwise, k[x] = (p) 
and hence 1 e (p), so there is a polynomial f(x) with 1 = p(x)f(x). But p(x) has degree 
at least 1, whereas 

0 = deg(l) = deg (pf) = deg (p) + deg(/) > deg(p) > 1. 

This contradiction shows that (p) is a proper ideal. Second, if ab e (p), then p \ ab, and 
so Euclid’s lemma in k[. r] gives p \ a or p \ b. Thus, a e (p) or b e (p). It follows that 
(p) is a prime ideal. 

Conversely, if (p(x)) is a prime ideal, then fg e (p) implies / e (p) or g e (p); that 
is, p | / or p | g. Therefore, Euclid’s lemma holds for p, and Exercise 3.31 on page 142 
shows that p is irreducible. • 
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If I is an ideal in a commutative ring R, we may write / C R if / is a proper ideal. 
More generally, if I and J are ideals, we may write I C / if / C / and I J. 

Here is a second interesting type of ideal. 

Definition. An ideal I in a commutative ring R is a maximal ideal if it is a proper ideal 
and there is no ideal J with I C / C R. 

Thus, if I is a maximal ideal in a commutative ring R and if J is a proper ideal with 
I C /, then I = J. Does every commutative ring R contain a maximal ideal? The (posi¬ 
tive) answer to this question involves Zorn’s lemma , which we will discuss in Section 6.4. 

Example 6.6. 

The ideal {0} is a maximal ideal in a commutative ring R if and only if R is a field. It is 
shown in Example 3.5l(ii) that every nonzero ideal I in R is equal to R itself if and only 
if every nonzero element in R is a unit. That is, {0} is a maximal ideal if and only if R is a 
field. ◄ 

Proposition 6.7. A proper ideal I in a nonzero commutative ring R is a maximal ideal if 
and only if R/I is afield. 

Proof The correspondence theorem for rings shows that I is a maximal ideal if and only 
if R/I has no ideals other than {0} and R/I itself; Example 6.6 shows that this property 
holds if and only if R/I is a field. (Note that since 1 0 in a field, I must be a proper 

ideal.) • 

Corollary 6.8. Every maximal ideal I in a commutative ring R is a prime ideal. 

Proof. If I is a maximal ideal, then R/I is a field. Since every field is a domain, R/I is 
a domain, and so I is a prime ideal. • 

The prime ideals in the polynomial ring k[x i,..., x n ] can be quite complicated, but 
when k is an algebraically closed field. Theorem 6.101 shows that every maximal ideal 
has the form (xi — a \,... ,x n — a„) for some point (a\,.... a n ) e k n \ that is, when k is 
algebraically closed, there is a bijection between k n and the set of all maximal ideals in 
k[x i, x n ]. 

Example 6.9. 

The converse of Corollary 6.8 is false. For example, consider the principal ideal (x) in 
Z[x], By Exercise 3.83 on page 196, we have 

Z[x]/(x) = Z; 

since Z is a domain, (x) is a prime ideal; since Z is not a field, (x) is not a maximal ideal. 
It is not difficult to exhibit a proper ideal J strictly containing (x); let 

J — {/(x) e Z[x]: fix) has even constant term}. 

Since Z [xi]/J = F 2 is a field, it follows that / is a maximal ideal containing (x). ◄ 
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Example 6.10. 

Let k be a field, and let a = (ai ,..., a n ) e k n . Define the evaluation map 

e a \ k[x i, ...,x n ] -+ k 
by 

e a : fix i,... ,x n ) i-> / (a) = /(ai-- a„). 

We have seen, in Example 3.46(iv), that e a is a surjective ring homomorphism, and so 
kere fl is a maximal ideal. Now (xi — ai,..., x„ — a„) C kere fl . In Exercise 6.6(i) on 
page 325, however, we shall see that (x\ — ai,..., x„ — a„) is a maximal ideal, and so it 
must be equal to ker e a . < 

The converse of Corollary 6.8 is true when R is a PID. 

Theorem 6.11. If R is a principal ideal domain, then every nonzero prime ideal I is a 
maximal ideal. 

Proof. Assume that there is a proper ideal J with I C J. Since R is a PID, I — (a) and 
J — (h) for some a, b e R. Now a e J implies that a — rb for some r e R, and so 
rb e /; but I is a prime ideal, so that r e / or b € /. If r e I, then r = sa for some s e R, 
and so a = rb — sab. Since R is a domain, 1 = sb , and Exercise 3.18 on page 125 gives 
J = (b) — R, contradicting the hypothesis that / is a proper ideal. If b e /, then / c /, 
and so J = I. Therefore, I is a maximal ideal. • 

We can now give a second proof of Proposition 3.116. 

Corollary 6.12. If k is a field and p(x) e k[x] is irreducible, then the quotient ring 
k[x]/(p(x)) is afield. 

Proof. Since p(x) is irreducible, the principal ideal I — (p(x)) is a nonzero prime ideal; 
since k[x] is a PID, I is a maximal ideal, and so k[x]/I is a field. • 

Here are some ways that prime ideals can be used. 

Proposition 6.13. Let P be a prime ideal in a commutative ring R. If I and J are ideals 
with IJ C P, then I C P or J C P. 

Proof. Suppose, on the contrary, that I £ P and ./ g P\ thus, there are a e I and b e J 
with a, b £ P. But ab 6 //CP, contradicting P being prime. • 

The next result is taken from Kaplansky, Commutative Rings. 

Proposition 6.14. Let B be a subset of a commutative ring R which is closed under 
addition and multiplication. 

(i) Let J\, .... J n be ideals in R, at least n — 2 of which are prime. If B C /j U • • • U J n , 
then B is contained in some Jj. 



324 


Commutative Rings II Ch. 6 


(ii) Let I be an ideal in R with I C B. If there are prime ideals P\, , P n such that 

B — I C Pi U • • • U P n (where B — I is the set-theoretic complement of I in B ), then 
B C Pj for some i. 

Proof, (i) The proof is by induction on n > 2. For the base step n = 2, neither of the 
ideals J\ or J 2 need be prime. If B g J 2 , then there is h\ e B with h\ f Jp, since 
B C. J 1 U J 2 , we must have b\ e J\. Similarly, if B £ J 1 , there is b 2 € B with bi f J 1 
and /?2 € J 2 - However, if y = b\ + b 2 , then y f J 1 : otherwise, b 2 — y — b\ e /1 (because 
both y and b\ are in / 1 ), a contradiction. Similarly, y f J 2 , contradicting B C J | \J J 2 . 

For the inductive step, assume that B C /jU-• - U 1 , where at least n — 1 = (n + 1)— 2 
of the Jj are prime ideals. Let 

Di = Ji U ... U Z U • • • U J n+ 1 . 


Since Z), is a union of n ideals at least (n — 1) — 1 — n— 2 of which are prime, the inductive 
hypothesis allows us to assume that B f Di for all i. Hence, for all i, there exists bj e B 
with bj Dp, since B C If U J,, we must have bj e Jj. Now n > 3, so that at least one 
of the Jj is a prime ideal; for notation, assume that J\ is prime. Consider the element 


y = b\ +^ 2^3 ' --K+l- 

Since all bj e B and B is closed under addition and multiplication, y e B. Now y f J 1 ; 
otherwise, £> 2^3 • • • b n +i — y — b 1 e J\. Since J\ is prime, some bj e J\. This is a 
contradiction, for bj f. If f J\. If / > 1 and y e Jj, then Iplp ■ ■ ■ b n+ i e Jj, because 
Jj is an ideal, and so b\ — y — Ipb^ ■ ■ ■ b n+ \ e Jj. This cannot be, for b\ ^ D\ f Jj. 

Therefore, y f Jj for any ;, contradicting B C J\ U • • • U J n+ \. 

(ii) The hypothesis gives B C I U P\ U • • • U P n , so that part (i) gives B C I or B C Pj. 

Since / is a proper subset of B, the first possibility cannot occur. • 


Exercises 

6.1 (i) Find all the maximal ideals in Z. 

(ii) Find all the maximal ideals in R[.v]; that is, describe those g(.v) G R[x] for which (g) is 
a maximal ideal. 

(iii) Find all the maximal ideals in C[x]. 

6.2 Let I be an ideal in a commutative ring R. If J* and L* are ideals in R/I, prove that there exist 
ideals J and L in R containing / such that J /I = J*, L/1 = L* , and ( J H L)/1 = J* H L*. 
Conclude that if J* n L* = {0}, then J C\L = I. 

Hint. Use the correspondence theorem. 

6.3 (i) Give an example of a commutative ring containing two prime ideals P and Q for which 

P fl Q is not a prime ideal. 

(ii) If P\ 5 Pi — ''' Pi — Zh +1 5 • • • is a decreasing sequence of prime ideals in a 
commutative ring R, prove that n«>l Zn is a prime ideal. 
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6.4 Let /: A —> R be a ring homomorphism, where A and R are commutative nonzero rings. 
Give an example of a prime ideal P in A with f(P) not a prime ideal in R. 

6.5 Let f:A—>R be a ring homomorphism. If Q is a prime ideal in R, prove that / -1 (Q) is a 
prime ideal in A. Conclude that if J/I is a prime ideal in R/I, where / C J C R, then / is a 
prime ideal in R. 

6.6 (i) Let k be a field, and let a \, ..., a„ e k. Prove that (x\ — a\ . x n — a n ) is a maximal 

ideal in k[x\, ..., x,,]. 

(ii) Prove that if Xj — b e (x\ — a i, ..., x n — a n ) for some i, where b e k, then b = a;. 

(iii) Prove that fi: k n —> {maximal ideals in k[x \, ..., x n ]], given by 

pt: (ai,..., a n ) i-> {x\ - a \,..., x n - a n ), 

is an injection, and give an example of a field k for which p. is not a surjection. 

6.7 Prove that if P is a prime ideal in a commutative ring R and if r n e P for some r e R and 
n > 1 , then r e P . 

6.8 Prove that the ideal (x 2 — 2, y 2 + 1, z) in Q[x, y, z] is a proper ideal. 

6.9 (i) Call a nonempty subset S of a commutative ring R multiplicatively closed i (’ 0 f_ S and, 

if s, s' e S , then .s'/ e S. Prove that an ideal J which is maximal with the property that 
J fl 5 = 0 is a prime ideal. (The existence of such an ideal J is proved, using Zorn's 
lemma, in Exercise 6.48 on page 374.) 

(ii) Let 5 be a multiplicatively closed subset of a commutative ring R, and suppose that 
there is an ideal / with I fl S = 0. If P is an ideal maximal such that /CP and 
P fl S = 0, prove that P is a prime ideal. 

6.10 (i) If I and J are ideals in a commutative ring R. define 

IJ = {all finite sums : ag e I and b^ e i}. 

i 

Prove that / J is an ideal in R and that f/C/nl. 

(ii) If / = (2) is the ideal of even integers in Z, prove that /“ = tf C in t = /. 

(iii) Let P be a prime ideal and let Q \, ... Q r be ideals. Prove that if Q\ fl • • • fl Q r C P. 
then Qj C P for some i. 

6.11 Let I and J be ideals in a commutative ring R. 

(i) Prove that the map R/(I fl J) -*■ R/I x R/J, given by <p: r (r + I, r + J), is an 
injection. 

(ii) Call / and J coprime if I + J — R. Prove that if I and J are coprime, then the ring 
homomorphism < -p : R/{1 fl J) —>- R/I x R/J in part (i) is a surjection. 

Hint. If / and J are coprime, there are a e I and b e J with 1 = a + b. If r, r' e R , 
prove that(rl + I.d + J) = (r + I,r' + 7) e R/I x R/J, where d = r'a + rb. 

(iii) Generalize the Chinese remainder theorem as follows. Let R be a commutative ring 
and let /i, ..., I n be pairwise coprime ideals; that is, /, and Ij are coprime for all i / j. 
Prove that if a\, ... % a n e R , then there exists r e R with r + /,- = a ,• + /,• for all i. 

6.12 If I and J are coprime ideals in a commutative ring R, prove that 


/ n J = IJ. 



326 


Commutative Rings II Ch. 6 


6.13 If I is an ideal in a commutative ring R and if S is a subset of R, define the colon ideal 1 

(7 : S) = {r e R : rs e 7 for all 5 e S}. 


(i) Prove that (7 : 5) is an ideal. 

(ii) If J = (5) is the ideal generated by S, prove that (7 : S) = (7 : J). 

(iii) Let R be a domain and let a, b e R, where b ^ 0. If 7 = ( ab ) and J = (b), prove that 

(7:7) = (a). 

6.14 (i) Let 7 and J be ideals in a commutative ring R. Prove that 7 C (7 : J) and that 

7(7 : J) C 7. 

(ii) Prove that if 7 = (2l n • • • fl Q r , then 

(7 : J) = (Qr- 7)n...n(Gr : 7). 


(Hi) 


If 7 is an ideal in a commutative ring R, and if 7 = J\ + ■ ■ ■ + J n is a sum of ideals, 
prove that 

(7:7) = (7:7 1 )n...n(7:7„). 


6.15 A Boolean ring is a commutative ring R in which cr = a for all a e R. Prove that every 
prime ideal in a Boolean ring is a maximal ideal. (See Exercise 8.21 on page 533.) 

Hint. When is a Boolean ring a domain? 

6.16 A commutative ring R is called a local ring if it has a unique maximal ideal. 

(i) If p is a prime, prove that the ring of p-adic fractions. 


%(p) = {“lb e Q: p\b]. 


is a local ring. 

(ii) If R is a local ring with unique maximal ideal m, prove that a e R is a unit if and only 
if a f m. 

Hint. You may assume that every nonunit in a commutative ring lies in some maximal 
ideal (this result is proved using Zorn's lemma). 


6.2 Unique Factorization Domains 


We have proved unique factorization theorems in Z and in k[x], where k is a field. We 
are now going to prove a common generalization: Every PID has a unique factorization 
theorem. We will then prove a theorem of Gauss: If R has unique factorization, then so 
does R[x], A corollary is that there is unique factorization in the ring k[x \,..., x n ] of all 
polynomials in several variables over a field k. One immediate consequence is that any 
two polynomials in several variables have a gcd. 

We begin by generalizing some earlier definitions. 

1 This ideal is also called the ideal quotient. 
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Definition. Elements a and b in a commutative ring R are associates if there exists a unit 
a e R with b = ua. 

For example, in Z, the units are ±1, and so the only associates of an integer m are ±m; 
in k[. r], where k is a field, the units are the nonzero constants, and so the only associates 
of a polynomial f(x) e k[x~\ are the polynomials uf(x), where nek and u ^ 0. The 
only units in Z[x] are ±1 (see Exercise 6.19 on page 339), and so the only associates of a 
polynomial f(x) e Z[x\ are ±/(x). 

In any commutative ring R, associates a and b generate the same principal ideal; the 
converse may be false if R is not a domain. 

Proposition 6.15. Let R be a domain and let a, b e R. 

(i) a | b and b \ a if and only if a and b are associates. 

(ii) The principal ideals (a) and ( b ) are equal if and only if a and b are associates. 

Proof, (i) If a \ h and h \ a , there are r, ,y e R with b — ra and a — sb , and so 
b — ra — rsb. If b = 0, then a = 0 (because b | a); if b f 0, then we may cancel it (R 
is a domain) to obtain 1 = rs. Hence, r and ,v are units, and a and b are associates. The 
converse is obvious. 

(ii) If (a) = (b), then a e (/?); hence, a = rb for some r e R, and so h a. Similarly, 
b e (a) implies a \ b. and so (i) shows that a and b are associates. 

Conversely, if a — ub , where u is a unit, then a e ( b ) and (a) C ( b ). Similarly, 
b — u~ l a implies (b) C (a), and so (a) — (b). • 

Recall that an element p in a domain R is irreducible if it is neither 0 nor a unit and 
if its only factors are units or associates of p. For example, the irreducibles in Z are 
the numbers ±p, where p is a prime, and the irreducibles in k[x], where A: is a field, 
are the irreducible polynomials p(x); that is, deg(p) > 1 and p(x) has no factorization 
p{x) = f(x)g(x) where deg(/) < deg(p) and deg(g) < deg(p). This characterization of 
irreducible polynomials does not persist in rings R[x] when R is not a field. For example, 
in Z[x], the polynomial f(x) — 2x + 2 cannot be factored into two polynomials, each 
having degree smaller than deg(/) = 1, yet f(x) is not irreducible, for in the factorization 
2x + 2 = 2(x + 1), neither 2 nor x + 1 is a unit. 

Corollary 6.16. If R is a PID and p e R is irreducible, then (p) is a prime ideal. 

Proof. Fet I be an ideal with (p) C I. Since R is a PID, there is q e R with I — (q). 
Hence, p e (q), and so p — rq for some r e R. Irreducibility of p says that q is either 
an associate of p or a unit. In the first case, (p) — (q), by Proposition 6.15; in the second 
case, (q) = R. It follows that ( p) is a maximal ideal, and hence it is a prime ideal, by 
Corollary 6.8. • 

Here is the definition we have been seeking. 
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Definition. A domain R is a unique factorization domain (UFD) if 

(i) every r e R, neither 0 nor a unit, is a product 2 of irreducibles; 

(ii) if up i • • • p m — vq\ ■ ■ ■ q„ , where u and v are units and all /?, and qj are irreducible, 
then in — n and there is a permutation o e S„ with p, and q a (i) associates for all i. 

When we proved that Z and k[x\, for k a field, have unique factorization into irre¬ 
ducibles, we did not mention associates because, in each case, irreducible elements were 
always replaced by favorite choices of associates: In Z, positive irreducibles (i.e., primes) 
are chosen; in k[x ], monic irreducible polynomials are chosen. The reader should see, for 
example, that the statement: “Z is a UFD” is just a restatement of the fundamental theorem 
of arithmetic. 

Proposition 6.17. Let R be a domain in which every r e R, neither 0 nor a unit, is a 
product of irreducibles. Then R is a UFD if and only if (p) is a prime ideal in R for every 
irrreducible element p e R. 3 

Proof. Assume that R is a UFD. If a, b e R and ab e (p), then there is r e R with 

ab = rp. 

Factor each of a,b, and r into irreducibles; by unique factorization, the left side of the 
equation must involve an associate of p. This associate arose as a factor of a or b, and 
hence a e (p) or b e (p). 

The proof of the converse is merely an adaptation of the proof of the fundamental theo¬ 
rem of arithmetic. Assume that 


upi---p m — vqi---q n , 

where p, and qj are irreducible elements and u , v are units. We prove, by induction on 
max{m, n) > 1, that n = in and the q’s can be reindexed so that qi and p, are associates 
for all i. If ma x{m,n] = 1, then up j = vq \, up\ — v, or u — vq\. The latter two 
cases cannot occur, for irreducible elements are not units, and so the base step is true. For 
the inductive step, the given equation shows that p\ \ q\ ■ ■ -q n . By hypothesis, (pi) is a 
prime ideal (which is the analog of Euclid’s lemma), and so there is some qj with p\ \ qj. 
But qj, being irreducible, has no divisors other than units and associates, so that qj and 
pi are associates: qj = upi for some unit u. Canceling pi from both sides, we have 
P 2 - ■ ■ Pm — uq i • • • qj ■ ■ ■ q n . By the inductive hypothesis, m — 1 = n — 1 (so that m = n), 
and, after possible reindexing, qj and pi are associates for all i. • 

The proofs we have given that Z and fc[x], where A: is a field, are UFDs involve the 
division algorithm; as a consequence, it is not difficult to generalize them to prove that 
every euclidean ring is a UFD. We now show that every PID is, in fact, a UFD; the proof 
uses a new idea: chains of ideals. 

“To avoid long phrases, we allow a product of irreducibles to have only one factor; that is, an irreducible 
element is regarded as a product of irreducibles. 

3 An element p for which ( p ) is a nonzero prime ideal is often called a prime element. Such elements have 
the property that p \ ab implies p \ a or p \ b. 
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Lemma 6.18. 

(i) If R is a commutative ring and 

h C I 2 C • • • C I n C /„+, C . . . 

is an ascending chain of ideals in R, then J — U«>i In is an ideal in R. 

(ii) If R is a PID, then it has no infinite strictly ascending chain of ideals 


h c h c ■ • • c /„ c i n+l c • • • . 


(iii) Let R be a PID. If r e R is neither 0 nor a unit, then r is a product of irreducibles. 

Proof, (i) We claim that J is an ideal. If a € J, then a e /„ for some n; if r e R, then 
ra e because I n is an ideal; hence, ra e J. If a, b e J , then there are ideals /„ and I m 
with a e I„ and b e since the chain is ascending, we may assume that /„ C /„,, and so 
a, b e /,„. As /„, is an ideal, a + b e I m and, hence, a + b e J. Therefore, J is an ideal. 

(ii) If, on the contrary, an infinite strictly ascending chain exists, then define J — [_Jh>i Ai- 
By (i), J is an ideal; since R is a PID, we have J — (d) for some d e J. Now d got into J 
by being in /„ for some n. Hence 


J — id) ^ In ^ In+l ^ J > 


and this is a contradiction. 

(iii) A divisor r of an element a e R is called a proper divisor of a if r is neither a unit 
nor an associate of a. If r is a divisor of a, then (a) c (r)\ if r is a proper divisor, then 
(i a ) C (r), for if the inequality is not strict, then (a) = (r), and this forces a and r to be 
associates, by Proposition 6.15. 

Call a nonzero nonunit a e R good if it is a product of irreducibles; otherwise, call a 
bad. We must show that there are no bad elements. If a is bad, it is not irreducible, and so 
a — rs, where both r and .v are proper divisors. But the product of good elements is good, 
and so at least one of the factors, say r, is bad. The first paragraph shows that (a) C (r ). 
It follows, by induction, that there exists a sequence a\ — a, as = r,ai,... ,a n ,... of 
bad elements with each a n + 1 a proper divisor of a n , and this sequence yields a strictly 
ascending chain 

(at) C {as) C ••• C ( a„) C (a n + 1 ) C • • • , 
contradicting part (i) of this lemma. • 


Theorem 6.19. If R is a PID, then R is a UFD. In particular, every euclidean ring is a 
UFD. 

Proof. In view of the last two results, it suffices to prove that (p) is a prime ideal whenever 
p is irreducible. Since R is a PID, Proposition 6.16 shows that ( p) is a prime ideal. • 
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The notion of gcd can be defined in any commutative ring R. Example 6.21 shows that 
there exist domains R containing a pair of elements having no gcd. If a\,... ,a n e R, 
then a common divisor of a \,..., a n is an element c e R with c | a,- for all i. A greatest 
common divisor or gcd of a\,... ,a n is a common divisor d with c | d for every common 
divisor c. Even in the familiar examples of Z and k[x}, gcd’s are not unique unless an extra 
condition is imposed. For example, in k[x ], where k is a field, we impose the condition 
that nonzero gcd’s are monic polynomials. In a general PID, however, elements may not 
have favorite associates. 

If R is a domain, then it is easy to see that if d and d' are gcd’s of elements a i, ..., a n , 
then d \ d' and d' \ d. It follows from Proposition 6.15 that d and d' are associates and, 
hence, that ( d ) = id'). Thus, gcd’s are not unique, but they all generate the same principal 
ideal. 

The idea in Proposition 1.17 carries over to show that gcd’s do exist in UFDs. 

Proposition 6.20. If R is a UFD, then a gcd of any finite set of elements a\,..., a n in R 
exists. 

Proof It suffices to prove that a gcd of two elements a and b exists; the general result 
follows by induction on the number of elements. 

There are units u and v and distinct irreducibles p \, with 

e\ eo et 

a = up l p 2 • • • p t 
and 

b = vp{ i p( 2 ---pf‘, 

where et > 0 and fi > 0 for all i. It is easy to see that if c \ a, then the factorization of c 
into irreducibles is c — wpf 1 pf 2 • • • p f , where w is a unit and 0 < gj < e; for all i. Thus, 
c is a common divisor of a and b if and only if gi < m, for all;, where 

m, = min{e,-, fi}. 

It is now clear that p™ 1 p'" 2 ■ ■ ■ p™' is a gcd of a and b. • 

It is not difficult to see that if a, = Ujp^' 1 p e f 2 ■ ■ • p e t n , where > 0, m/ are units, and 
i = l,... ,n, then 

, fll fin n-l 

d = P\P2 ■'' Pt 

is a gcd of «i,..., a„, where p.j = min{ei j, eij ,..., e„j}. 

We caution the reader that we have not proved that a gcd of elements a\,... ,a n is a 
linear combination of them; indeed, this may not be true (see Exercise 6.21 on page 339). 

Example 6.21. 

Let k be a field and let R be the subring of k[x} consisting of all polynomials f(x) e k[. r] 
having no linear term; that is, f(x) — ao+a 2 X 2 + - ■ ■+a n x n . In Exercise 3.60 on page 158, 
we showed that x 5 and x ( have no gcd in R. It now follows from Proposition 6.20 that R 
is not a UFD. [Another example of a domain that is not a UFD is given in Exercise 6.3 l(ii) 
on page 340.] ◄ 
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Definition. Elements a \,..., a n in a UFD R are called relatively prime if their gcd is a 
unit; that is, if every common divisor of a \,..., a n is a unit. 

We are now going to prove that if R is a UFD, then so is R[x], Recall Exercise 3.21 on 
page 130: If R is a domain, then the units in R[x] are the units in R. 

Definition. A polynomial f(x ) = a„x n + ■ ■ ■ + a i x + c/q e R\x], where R is a UFD, is 
called primitive if its coefficients are relatively prime; that is, the only common divisors of 
a„,... ,a i, ao are units. 

Of course, every monic polynomial is primitive. Observe that if fix) is not primitive, 
then there exists an irreducible q e R that divides each of its coefficients: If the gcd is a 
nonunit cl. then take for q any irreducible factor of d. 

Example 6.22. 

We now show, for a UFD R , that every irreducible p (x ) e R[x] of positive degree is 
primitive. If not, then there is an irreducible q e R with p (x ) = qg(x ); note that deg (c/) = 
0 because q e R. Since p(x ) is irreducible, its only factors are units and associates, and so 
q must be an associate of p(x). But every unit in R [x | has degree 0 [i.e., is a constant (for 
uv = 1 implies deg(n) + deg(i>) = deg(l) = 0)]; hence, associates in R[x] have the same 
degree. Therefore, q is not an associate of p(x), because the latter has positive degree, and 
so p(x) is primitive. < 

We begin with a technical lemma. 

Lemma 6.23 (Gauss’s Lemma). If R is a UFD and fix), g(x) e R[x] are both primi¬ 
tive, then their product f(x)g(x) is also primitive. 

Proof. If n : R —> R/ip) is the natural map tt : a a + ip), then Proposition 3.48 
shows that the function Jr: R[x] —*■ iR/ip)) [x], which replaces each coefficient c of 
a polynomial by 7 r(c), is a ring homomorphism. If a polynomial h (x) e R[x] is not 
primitive, there is some irreducible p such that all the coefficients of Jcih) are 0 in R/(p): 
that is, JIih) — 0 in iR/ip)) [x]. Thus, if the product fix)gix) is not primitive, there is 
some irreducible p with 0 = Jr ifg) = Ttif)Jcig) in (R/ip)) [x]. Since (p) is a prime 
ideal, R/ip) is a domain, and hence iR/ip)) [x] is also a domain. But, neither Jr if) nor 
Jr(g) is 0 in (R/ip)) [x], because / and g are primitive, and this contradicts (R/ip)) [x] 
being a domain. • 


Lemma 6.24. Let R be a UFD, let Q = Frac( R), and let f(x) e Q[x] be nonzero. 

(i) There is a factorization 

f(x)=c(f)f*(x), 

where c(f) e Q and f*(x) e R[x] is primitive. This factorization is unique in the 
sense that if fix) = qg*(x), where q e Q and g*(x) e /? [x] is primitive, then there 
is a unit w e R with q — wc(f) and g*(x) = w~ l f*(x). 



332 


Commutative Rings II Ch. 6 


(ii) If fix), g(x) e /?[x], then cifg) and c(f)c(g) are associates in R and ifg)* and 
f*g* are associates in f?[x]. 

(iii) Let fix) e Q[x] have a factorization fix) = qg*ix), where q e Q and g*(x) G 

[jc] is primitive. Then fix) e /?[x] if and only ifq e R. 

(iv) Let g*ix), fix) G /? [-v]. If g*ix) is primitive and g*(x) | bfix), where b e R and 
b 0, then g*(x) | fix). 

Proof, (i) Clearing denominators, there is b e R with bfix) e R [x ]. If d is the gcd of 
the coefficients of bfix), then ib/d)fix) e /?[x] is a primitive polynomial. If we define 
cif) — d/b and /*(x) = c(/) fix), then /*(x) is primitive and fix) = c(/) f*ix). 

To prove uniqueness, suppose that c(/)/*(x) = fix) = qg*ix), where cif), q e Q 
and f*ix), g*ix) e /?[x] are primitive. Exercise 6.17 on page 339 allows us to write 
q/cif) in lowest terms: q/cif) — u/v, where u and v are relatively prime elements of R. 
The equation i >/*(x) = ug*ix) holds in /?[x]; equating like coefficients, v is a common 
divisor of each coefficient of ug*ix). Since u and v are relatively prime. Exercise 6.18(i) 
on page 339 gives i> a common divisor of the coefficients of g*ix). But g*(x) is primitive, 
and so v is a unit. A similar argument shows that u is a unit. Therefore, q/cif) — u/v is 
a unit in R, call it w; we have wc( f) = q and /*(x) = wg*ix), as desired. 

(ii) There are two factorizations of /(x)g(x) in /?[x]: fix)g(x) — ci fg)ifix)gix))* and 
/(x)g(x) = cif) f*ix)cig)g*ix) = cif)cig) f*ix)g*ix). Since the product of primitive 
polynomials is primitive, each of these is a factorization as in part (i), and the uniqueness 
assertion there gives cifg) an associate of c(/)c(g) and ifg)* an associate of f*g*. 

(iii) If q e R, then it is obvious that fix) = qg*ix) e /? [x]. Conversely, if fix) e /? [x], 
then there is no need to clear denominators, and so cif) — d e R, where d is the gcd of 
the coefficients of fix). Thus, fix) = df*ix). By uniqueness, there is a unit w e R with 
q — wd e R. 

(iv) Since bf — hg*, we have bcif)f* = cih)h*g* = cih)ihg)*. By uniqueness, /*, 

ihg)*, and h*g* are associates, and so g* | /*. But / = c(/)/*, and so g* \ f. • 


Definition. Let R be a UFD with Q = Frac( R ). If fix) e Q[x], there is a factorization 
fix) = cif)f*ix), where cif) e Q and /*(x) e /?[x] is primitive. We call cif) the 
content of fix) and /*(x) the associated primitive polynomial. 

In light of Lemma 6.24(i), both cif) and /*(x) are essentially unique, differing only 
by a unit in R. 

Theorem 6.25 iGauss). If R is a UFD, then /?[x] is also a UFD. 

Proof. We show first, by induction on deg(/), that every fix) e /?[x], neither zero 
nor a unit, is a product of irreducibles. If deg(/) = 0, then fix) is a constant, hence 
lies in R. Since R is a UFD, / is a product of irreducibles. If deg(/) > 0, then fix) = 
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c(/)/*(x), where c(/) e R and f*(x) is primitive. Now c(/) is either a unit or a product 
of irreducibles, by the base step. If f*(x) is irreducible, we are done. Otherwise, f*(x) = 
g(x)hix), where neither g nor h is a unit. Since f*(x) is primitive, however, neither g nor 
h is a constant; therefore, each of these has degree less than deg(/*) = deg(/), and so 
each is a product of irreducibles, by the inductive hypothesis. 

Proposition 6.17 now applies: /?[x] is a UFD if ( p(pc )) is a prime ideal for every ir¬ 
reducible pix) e /?[x]; that is, if p \ fg, then p \ f or p \ g. Let us assume that 
P(x) t /(*)• 

Case (i). Suppose that dcg( p) = 0. Write 

fix) = c(f)f*(x) and g(x) = c(g)g*(x), 
where c(/), c(g) e R , and f*(x), g*(x) are primitive. Now p \ fg, so that 

P I c{f)c(g)f*(x)g*(x). 

Since f*(x)g*(x) is primitive. Lemma 6.24(ii) says that c(f)c(g) an associate of c(fg). 
However, if p \ f(x)g(x), then p divides each coefficient of fg; that is, p is a common 
divisor of all the coefficients of fg, and hence in R, which is a UFD, p divides the asso¬ 
ciates c(fg) and c( f)c(g). But Proposition 6.17 says that ( p) is a prime ideal in R, and 
so p | c(/) or p | c(g). If p | c(/), then p divides c(/)/*(x) — fix), a contradiction. 
Therefore, p \ c(g) and, hence, p \ g(x), as desired. 

Case (ii). Suppose that deg(p) > 0. Let 

ip, f) = {six)p{x) + t(x)f(x): s(x),t(x) e /?[x]}; 

of course, ( p , /) is an ideal containing p(x) and fix). Choose m{x) e ip, f) of minimal 
degree. If Q — Frac( R) is the fraction field of R, then the division algorithm in Q[x] gives 
polynomials q'ix), r'(x) e Q[x] with 

fix) — m(x)q'(x) + r(x), 

where either r'{x) — 0 or deg(r ') < deg(m). Clearing denominators, there are polynomials 
q(x), r(x) e /?[x] and a constant b e R with 

bfix) — qix)m(x) + r(x), 

wherer(x) = Oordeg(r) < deg(m). Since m e ip, /), there are polynomials s(x), t{x) e 
/?[x] with mix) — six)pix) + f(x) fix); hence r — bf — qm e ip, /). Since m has 
minimal degree in ip, /), we must have r — 0; that is, bfix) = m{x)q{x), and so bfix) = 
cim)m*ix)qix). But m*(x) is primitive, and m*{x) \ bfix), so that m*{x) \ fix), by 
Lemma 6.24(iv). A similar argument, replacing f(x) by pix) (that is, beginning with an 
equation b"pix) = q"ix)mix) + r"{x) for some constant b"), gives m*(x) | pix). Since 
pix) is irreducible, its only factors are units and associates. If m*(x) were an associate of 
pix), then pix) \ fix) (because pix) \ m*fx) \ fix)), contrary to the hypothesis. Hence, 
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m*(x ) must be a unit; that is, mix) = elm) e R, and so ( p, f) contains the nonzero 
constant c(m). Now c(m) = sp + tf, and so 

c(m)gix) = s(x)p(x)g(pc) +t(x)f(x)g(x). 

Since pix) \ f(x)gix), we have p(x) \ c(m)g(x). But pix) is primitive, because it is 
irreducible, by Example 6.22, and so Lemma 6.24(iv) gives p(x ) | g(x). • 


Corollary 6.26. Ifk is afield, then k[x i, ..., x n ] is a UFD. 

Proof. The proof is by induction on n > 1. We proved, in Chapter 3, that the polynomial 
ring &[xi] in one variable is a UFD. For the inductive step, recall that k[x i ,... ,x„, x n +\ ] = 
R[x n + 1 ], where R — k[x i,..., x n ]. By induction, R is a UFD, and by Corollary 6.25, so 
is tf[x„ + i]. • 

Proposition 6.20 shows that if k is a field, then ged’s exist in k[x i, ..., x n ]. 

Corollary 6.27 (Gauss). Let R be a UFD, let Q — Frac (R), and let fix) e R[x]. If 

fix) = G(x)H(x) in Q[x ], 

then there is a factorization 

fix) = g(x)h(x) in /?[*], 

where deg(g) = deg(G) and deg(7i) = deg (H); in fact, G(x) is a constant multiple of 
g(x) and H{x) is a constant multiple of h(x). Therefore, if fix) does not factor into 
polynomials of smaller degree in /?[x], then fix) is irreducible in Q[x]. 

Proof By Lemma 6.24(i), the factorization fix) = Gix)H (x) in Q[x] gives q, q' e Q 
with 

fix) = qG*ix)q'H*ix) in g[x], 

where G*(x), H*ix) e /?[x] are primitive. But G*(x)Il*ix) is primitive, by Gauss’s 
lemma. Since fix) e /?[x], Lemma 6.24(iii) applies to say that the equation fix) = 
qq' [G*(x) H* (x)] forces qq' e R. Therefore, qq'G*ix) e /?[x], and a factorization of 
fix) in /?[x] is fix) — [qq f G*ix)]H*ix). • 

The special case R = Z and Q — Q is, of course, important. 

Example 6.28. 

We claim that fix, y) = x 2 + y 2 — 1 e k[x, y] is irreducible, where k is a field. Write 
Q = kiy) — Frac(T[y]), and view fix, y) e Q[x], Now the quadratic g(x) = x 2 + 
(y 2 — 1) is irreducible in Q[x] if and only if it has no roots in Q — kiy), and this is so, by 
Exercise 3.34 on page 142. 

It follows from Proposition 6.17 that (x 2 +y 2 — 1) is a prime ideal because it is generated 
by an irreducible polynomial. ■* 

Recall that a complex number is an algebraic integer if it is a root of a monic polynomial 
in Z[x]. Each algebraic integer has an irreducible polynomial associated with it. 
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Corollary 6.29. If a is an algebraic integer, then irr(o!, Q) lies in Z[x], 

Proof. Let p (x ) e Z[x] be the monic polynomial of least degree having a as a root. If 
p(x) — G(x)H(x) in Q[x], where deg(G) < deg (p) and deg (H) < deg(/?), then a is a 
root of either G(x) or H (x). By Gauss’s theorem, there is a factorization p(x) — g(x)h(x) 
in Z[x] with deg(g) = deg(G) and deg(/i) = deg(//); in fact, there are rationals c and d 
with gix) = cG(x) and h{x) = dH(x). If a is the leading coefficient of gix) and b is the 
leading coefficient of h(x), then ab — 1, for p(x ) is monic. Therefore, we may assume 
that a = 1 = b, for a, b e Z; that is, we may assume that both gix) and h(x) are monic. 
Since a is a root of gix) or h (x), we have contradicted p(x) being a monic polynomial in 
Z[x] of least degree having a as a root. It follows that p(x) = irr(a, Q), for the latter is 
the unique monic irreducible polynomial in Q[x] having a as a root. • 

Definition. If a is an algebraic integer, then its minimal polynomial is the monic poly¬ 
nomial in Z[x] of least degree having a as a root. 

Corollary 6.29 shows that every algebraic integer a has a unique minimal polynomial 
m(x) e Z[x ], namely, mix) — irr(a, Q), and mix) is irreducible in Q[x]. 

Remark. We define the (algebraic) conjugates of a to be the roots of irr(«, Q), and we 
define the norm of a to be the absolute value of the product of the conjugates of a. Of 
course, the norm of a is just the absolute value of the constant term of irr(oi, Q), and so it is 
an (ordinary) integer. Norms are very useful in algebraic number theory, as we have seen in 
the proof of Theorem 3.66: Fermat’s two-squares theorem. We have also considered them 
in the proof of Hilbert’s Theorem 90, which was used to prove that if the Galois group of a 
polynomial fix) e k[x~\ is solvable, where k has characteristic 0, then fix) is solvable by 
radicals. ■* 

The next criterion uses the integers mod p. 

Theorem 6.30. Let fix) = ao + a\x + ajx 2 + •••-)- x n e Z[x] be monic, and let p be 
a prime. If fix) is irreducible modp, that is, if 

fix) — [ao] + [«t]x + [a 2 ]x 2 H-h x n e F p [x], 

is irreducible, then fix) is irreducible in Q[x], 

Proof. By Proposition 3.48, the natural map ip: Z —> F ; , defines a homomorphism 
ip: Z[x] —> F p [x] by 

< p(bo + b\x + b 2 X 2 H-) = [£>o] + [b\\x + [b 2 \x 2 H-; 

that is, just reduce all the coefficients mod p. If gix) e Z[x], denote its image (p(gix)) e 
F p [x] by gix). Suppose that fix) factors in Z[x]; say, fix) — gix)hix), where deg(g) < 
deg(/) and deg(/z) < deg(/); of course, deg(/) = deg(g) + deg (h). Now fix) = 
gix)h)x), because ip is a ring homomorphism, so that deg(/) = deg(g) + deg(/;). Since 
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fix) is monic, fix) is also monic, and so deg(/) = deg (/). Thus, both gix) and hix) 
have degrees less than deg(/), contradicting the irreducibility of fix). Therefore, fix) 
is irreducible in Z[x], and, by Gauss’s theorem. Corollary 6.27, fix) is irreducible in 

QM. • 

Example 6.31. 

The converse of Theorem 6.30 is false. It is easy to find an irreducible polynomial fix) e 
Z[x] C Q[x] with f(x) factoring mod p for some prime p , but we now show that fix) — 
x 4 + 1 is an irreducible polynomial in Z[x] that factors mod p for every prime p. 

First, fix) is irreducible in Q[jc], By Corollary 3.44, fix) has no rational roots, and so 
the only possible factorization in Q[x] has the form 

x 4 + 1 = ix 1 + ax + b)ix 2 — ax + c). 

Multiplying out and equating like coefficients gives 

c + b — a 2 — 0 
aic — b) = 0 
be — 1 . 

The second equation forces a = 0 or c — b. and it is quickly seen that either possibility 
leads to a contradiction. 

We now show, for all primes p, that x 4 + 1 is not irreducible in F p [x], If p — 2, 
then x 4 + 1 = ix + 1) , and so we may assume that p is an odd prime. As we saw in 
Example 1.21 (i), every square is congruent to 0, 1, or 4 mod 8 ; since p is odd, we must 
have p 2 = 1 mod 8 . Therefore, |(F p 2 ) x | = p 2 — 1 is divisible by 8 . But (F p 2 ) x is a cyclic 
group, and so it has a (cyclic) subgroup of order 8 , by Lemma 2.85. It follows that ¥ p i 
contains all the 8 th roots of unity; in particular, F /; 2 contains all the roots of x 4 + 1. Hence, 
the splitting field E p of x 4 + 1 over F^ is F p 2 , and so [E p : F^,] = 2. But if x 4 + 1 were 
irreducible in F^IA], then 4 | [E p : F p ], by Corollary 4.9. Therefore, x 4 + 1 factors in 
F p [x] for every prime p. ◄ 

Theorem 6.30 says that if we can find a prime p with fix) irreducible in F p [x], then 
fix) is irreducible in Q[x]. Until now, the finite fields ¥ p have been oddities; F p has 
appeared only as a curious artificial construct. Now the finiteness of F p is a genuine 
advantage, for there are only a finite number of polynomials in F p [x] of any given degree. 
In Examples 3.35(i) and 3.35(ii), we displayed all the monic irreducible polynomials over 
F 2 and over F 3 of degree < 3. In principle, then, we can test whether a polynomial of 
degree n in F p [x] is irreducible by just looking at all the possible factorizations of it. 

Example 6.32. 

(i) We show that fix) — x 4 — 5x 3 + 2x + 3 is an irreducible polynomial in Q[x], 

By Corollary 3.44, the only candidates for rational roots of fix) are 1, —1, 3, —3, and 
the reader may check that none of these is a root. Since fix) is a quartic, we cannot yet 
conclude that fix) is irreducible, for it might be a product of (irreducible) quadratics. 
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Let us try the criterion of Theorem 6.30. Since fix) = x 4 + x 3 + 1 in F 2 M is ir¬ 
reducible, by Example 3.35(i), it follows that fix) is irreducible in Q[x], [It was not 
necessary to check that fix) has no rational roots; the irreducibility of fix) is enough to 
conclude the irreducibility of fix).] 

(ii) Let 3>5 (x) — x 4 + x 3 + x 2 + x + 1 e Q[x], 

In Example 3.35(i), we saw that <J> 5 (x) = x 4 + x 3 + x 2 + x + 1 is irreducible in F 2 U'], 
and so Osfx) is irreducible in Q[x], ◄ 

Recall that if n is a positive integer, then the nth cyclotomic polynomial is 

<*>«(*) = ]""[(* - f)> 

where f ranges over all the primitive nth roots of unity. 

By Proposition 1.37, for every integer n > 1, 

x n - 1 = Y\ 
d\n 


where d ranges over all the divisors d of n. Now <f> | (x) = x — 1. When p is prime, then 
x p — 1 = <t>i(x)<t> p (x) = (x — l)4>p(x), 


and so 

<J> p (x) = (x p — l)/(x — 1) = x p ~ l + x p ~ 2 + • • • + X + 1. 

As any linear polynomial, <l> 2 (x) = x + 1 is irreducible in Q[x]; the cyclotomic poly¬ 
nomial <f> 3 (x) = x 2 + x + 1 is irreducible in Q[x] because it has no rational roots; we have 
just seen that ^(x) is irreducible in Q[x]. Let us introduce another irreducibility criterion 
in order to prove that O p (x) is irreducible in Q[x] for all primes p. 

Lemma 6.33. Let g{x) e Z[x], If there is c e Z with g(x + c ) irreducible in Z[x], then 
g(x) is irreducible in Q[x], 

Proof. By Exercise 3.43 on page 149, the function <p: Z[x] -> Z[x], given by fix) i-> 
fix + c ), is an isomorphism. If g(x) = s(x)f (x), then g(x + c) = <pigix)) — <pist) = 
(pis)(pit) is a forbidden factorization of g(x + c). Therefore, g(x) is irreducible in Z[x] 
and hence, by Gauss’s theorem. Corollary 6.27, g(x) is irreducible in Q[x], • 

The next result was found by G. Eisenstein. The following elegant proof of Eisenstein’s 
criterion is in a 1969 paper of R. Singer; see Montgomery-Ralston, Selected Papers in 
Algebra. 

Theorem 6.34 (Eisenstein Criterion). Let R be a UFD with Q = Frac( R), and let 
fix) — ao + a\x + ■ ■ ■ + a n x n € R[x]. If there is an irreducible element p e R with p \ ai 
for all i < n but with p \ a n and p 2 \ ao, then fix) is irreducible in Q[x], 
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Proof. Let <p : Z[x] —>• F p [x] be the ring homomorphism that reduces coefficients mod p, 
and let f(x) denote <^(/(x)). If fix) is not irreducible in Q[x], then Gauss’s theorem. 
Corollary 6.27, gives polynomials g(x),h(x) e Z[x] with f(x) — g(x)h(x), where 
g(x) — b o + b\x + • • • + b m x m and h(x) = Co + C\x + • • • + c^x k . There is thus an 
equation fix) = g(x)h)x) in F p [x]. 

Since p does not divide a„, we have fix) f 0; in fact, fix) — ux n for some unit u e 
F p , because all its coefficients aside from its leading coefficient are 0. By Theorem 3.42, 
unique factorization in F p [x], we must have glx) — vx m , where v is a unit in F p , for 
any irreducible factor of g(x) is an irreducible factor of fix); similarly, li(x) — wx k , 
where w is a unit in F p . It follows that each of glx) and hlx) has constant term 0; that is, 
[fro] = 0 = [co] in ¥ p ; equivalently, p \ bo and p | co- But do — froco, and so p 2 \ cio , a 
contradiction. Therefore, fix) is irreducible in Q[x], • 

Of course, Eisenstein’s criterion holds for polynomials in Z[x], The generalization from 
Z to PIDs is instantaneous. 

Corollary 6.35 (Gauss). For every prime p, the pth cyclotomic polynomial <t> / ,(x) is 
irreducible in Q[x]. 

Proof Since Op(x) = (x p — l)/(x — 1), we have 
<S> p Ix + 1) = [(x + l) p - l]/x 



Since p is prime. Proposition 1.12 shows that Eisenstein’s criterion applies; we conclude 
that O p (x + 1) is irreducible in Q[x], By Lemma 6.33, ‘I’pCx) is irreducible in Q[x], • 

We do not say that x" _1 + x" -2 + • • • + x + 1 is irreducible when n is not prime. For 
example, x 3 + x 2 + x + 1 = (x + l)(x 2 + 1). 

Irreducibility of a polynomial in several variables is more difficult to determine than 
irreducibility of a polynomial of one variable, but here is one criterion. 

Proposition 6.36. Let k be a field and let fix i, ..., x„) be a primitive polynomial in 
R[x n ], where R — k[x i, ..., x„_ i ]. If f cannot be factored into two polynomials of lower 
degree in /?[x n ], then f is irreducible in k[x i,..., x„]. 

Proof Let us write /(xj, ..., x n ) — F(x„) if we wish to view / as a polynomial in 
R[x n ] (of course, the coefficients of F are polynomials in k[x i,..., x„_i]). Suppose that 
Fix,,) — G(x n )II(x n ); by hypothesis, the degrees of G and H (in x n ) cannot both be 
less than deg(E), and so one of them, say, G, has degree 0. It follows, because F is 
primitive, that G is a unit in k[x i,..., x„_i]. Therefore, fix i, ..., x„) is irreducible in 
R[x n ] = k[x i,...,x„]. • 

Of course, the proposition applies to any variable x;, not just to x„ . 
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Corollary 6.37. If k is a field and g(x i,..., x n ), h(x\, ..., x „) e k[x \...., x n ] are 
relatively prime, then fixi,..., x„, y ) — ygix, x„) + h(x i, ..., x„) is irreducible in 
k[x i,- x n ,y]. 

Proof. Let R — k[x i, • • • ,x n ]. Note that / is primitive in /?[y], because (g, h) = 1 
forces any divisor of its coefficients g, h to be a unit. Since / is linear in y, it is not the 
product of two polynomials in R[y] of smaller degree, and hence Proposition 6.36 shows 
that / is irreducible in /? [y] = k[x i ,..., x n , y], • 

For example, xy 2 + z is an irreducible polynomial in k[x,y,z\ because it is a primitive 
polynomial that is linear in x. 

Exercises 

6.17 Let R be a UFD and let Q — Frac(f?) be its fraction field. Prove that each nonzero a/b e Q 
has an expression in lowest terms; that is, a and b are relatively prime. 

6.18 Let R be a UFD. 

(i) If a, b, c G R and a and b are relatively prime, prove that a \ be implies a \ c. 

(ii) If a,c\,... ,c n € R and c,- [ ci for all i, prove that c \ a, where c = lcm{cj ,... ,c n }. 

6.19 If R is a domain, prove that the only units in R[x\ . x n ] are units in R. On the other hand, 

prove that 2x + 1 is a unit in I 4 |x]. 

6.20 Prove that a UFD R is a PID if and only if every nonzero prime ideal is a maximal ideal. 

6.21 (i) Prove that x and y are relatively prime in k[x. y\, where k is a field. 

(ii) Prove that 1 is not a linear combination of x and y in k[x , y], 

6.22 (i) Prove that Z[xj..... x«] is a UFD for all n > 1. 

(ii) If R is a field, prove that the ring of polynomials in infinitely many variables, R — 
k[x 1 , X 2 i ■ ■ ., x n ... ■], is also a UFD. 

Hint. We have not given a formal definition of R (it will be given in Chapter 8 ). 
but, for the purposes of this exercise, regard R as the union of the ascending chain 
k[x 1 ] C k[x 1 , xfi C • • ■ C k\x\ , X 2 . x n ] C ■ • • . 

6.23 Determine whether the following polynomials are irreducible in Q[x]. 

(i) f(x) = 3x 2 — lx — 5. 

(ii) fix) = 2 x 3 — x — 6. 

(iii) fix) = 8 x 3 — 6 x — 1 . 

(iv) fix) — x 3 + 6 x 2 + 5x + 25. 

(v) fix) = x 4 + 8 x + 12 . 

Hint. In F 5 L:], fix) = (x 4- l)g(x), where g(x) is irreducible. 

(vi) fix) - x 5 - 4x + 2. 

(vii) fix) = x 4 + x 2 + x + 1 . 

Hint. Show that fix) has no roots in F 3 and that a factorization of fix) as a product 
of quadratics would force impossible restrictions on the coefficients. 

(viii) fix) = x 4 — 10 x 2 + 1 . 

Hint. Show that fix) has no rational roots and that a factorization of fix) as a product 
of quadratics would force impossible restrictions on the coefficients. 
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6.24 Is x 5 + x + 1 irreducible in F 2 [je ] ? 

Hint. Use Example 3.35(i). 

6.25 Let /(x) = (x p — l)/(x — 1 ), where p is prime. Using the identity 

fix + 1 ) = x p ~ l + pq(x). 


where q{x) e Z[x] has constant term 1, prove that x p i p ^ + • • ■ 4- x p + 1 is irreducible in 
Q[x] for all n > 0 . 

6.26 (i) If a is a squarefree integer, prove that x n — a is irreducible in Q[x] for every n > 1. 

Conclude that there are irreducible polynomials in Q[x] of every degree n > 1. 

Hint. Use the Eisenstein criterion. 

(ii) If a is a squarefree integer, prove that !lfa is irrational for all n > 2. 

6.27 Let k be a field, and let f(x) — oq + a\x + • • • + a n x n e k[x] have degree n and nonzero 
constant term ag. Prove that if fix) is irreducible, then so is a n + a n _\x + • • • + flo x "- 

6.28 Let k be a field and let f{x\ , .... x n ) e k[x\ . x n \ be a primitive polynomial in R\x n \, 

where R = k[x\, ..., x n _ j ]. If / is either quadratic or cubic in x n , prove that / is irreducible 
in k[x 1 , .... x n ] if and only if / has no roots in k(x 1 , ..., x n _\). 

6.29 Let R be a UFD with Q = Frac(^). If f(x) e F[x], prove that f(x) is irreducible in i?[x] if 
and only if f{x) is primitive and fix) is irreducible in Q[x], 

6.30 Prove that 

fix, y) = xy 3 + x 2 y 2 - x 5 y + x 2 + 1 


is an irreducible polynomial in R[x, y]. 


6.31 Let D = det 


so that D lies in the polynomial ring Z[x, y, z, w]. 


x y 
_z w\ 

(i) Prove that ( D) is a prime ideal in Z[x, y, z, w]. 

Hint. Prove first that D is an irreducible element. 

(ii) Prove that Z[x, y, z, w]/(D) is not a UFD. Another example of a domain which is not 
a UFD is given in Example 6.21. 


6.3 Noetherian Rings 

One of the most important properties of k[x \, ..., x n ], when A: is a field, is that every ideal 
in it can be generated by a finite number of elements. This property is intimately related to 
chains of ideals, which we have already seen in the course of proving that PIDs are UFDs 
(I apologize for so many acronyms, but here comes another one!). 

Definition. A commutative ring R satisfies the ACC, the ascending chain condition , if 
every ascending chain of ideals 


h c I 2 c • • • c /„ c ■ ■ ■ 


stops', that is, the sequence is constant from some point on: there is an integer N with 

In — In+ 1 = In+2 — ■ ■ ■ ■ 
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Lemma 6.18(ii) shows that every PID satisfies the ACC. 

Here is an important type of ideal. 

Definition. If X is a subset of a commutative ring R, then the ideal generated by X is 
the set of all finite linear combinations 

I — ( X ) = | r i a i ■ e R and Xj e X J. 

finite 

We say that I is finitely generated , often abbreviated to f.g., if X = {a \,..., a n }; that is, 
every element in I is an /^-linear combination of the a,. We write 

I = (a i,..., a n ), 

and we call I the ideal generated by a\,... ,a n . 

A set of generators a \,..., a n of an ideal I is sometimes called a basis of I (even 
though this is a weaker notion than that of a basis of a vector space, for we do not assume 
that the coefficients r,- in the expression c — r,a, are uniquely determined by c). 

Of course, every ideal I in a PID is finitely generated, for it can be generated by one 
element. 

Proposition 6.38. The following conditions are equivalent for a commutative ring R. 

(i) R has the ACC. 

(ii) R satisfies the maximum condition : Every nonempty family T of ideals in R has a 
maximal element; that is, there is some Iq e T for which there is no I € IF with 
/() C I. 

(iii) Every ideal in R is finitely generated. 

Proof, (i) => (ii): Let T be a family of ideals in R, and assume that T has no maximal 
element. Choose l\ e T. Since I\ is not a maximal element, there is h e T with 

/1 C / 2 . Now li is not a maximal element in T, and so there is It, e T with h_ C f. 

Continuing in this way, we can construct an ascending chain of ideals in R that does not 
stop, contradicting the ACC. 

(ii) =£> (iii): Let I be an ideal in R, and define T to be the family of all the finitely 
generated ideals contained in I ; of course, F f 0 (for {0} e T). By hypothesis, there 
exists a maximal element M e T. Now M C I because M e T. If M C /, then there is 
a e I with a f M. The ideal 

J — [m + ra: m e M and r e R] c I 

is finitely generated, and so / e T\ but M C J, and this contradicts the maximality of M. 

Therefore, M = I , and so I is finitely generated. 

(iii) => (i): Assume that every ideal in R is finitely generated, and let 


h C I 2 C • • • C I n C • • • 
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be an ascending chain of ideals in R. By Lemma 6.18(i), the ascending union J — (J >j /„ 
is an ideal. 

By hypothesis, there are elements a,- e J with J = (a\, .... a q ). Now a; got into J by 
being in I nj for some If N is the largest n ,, then I, u C I N for all i; hence, a,- e /y for 
all i , and so 

J = (a\,, a q ) c /jv c 7. 

It follows that if n > N, then J = /,v C /„ c J, so that /„ = /; therefore, the chain stops, 
and R has the ACC. • 

We now give a name to a commutative ring that satisfies any of the three equivalent 
conditions in the proposition. 

Definition. A commutative ring R is called noetherian 4 if every ideal in R is finitely 
generated. 

We shall soon see that k[x i,..., x n ] is noetherian whenever A; is a field. On the other 
hand, here is an example of a commutative ring that is not noetherian. 

Example 6.39. 

Let R — iF(R) be the ring of all real-valued functions on the reals, under pointwise opera¬ 
tions (see Example 3.7). It is easy to see, for every positive integer n, that 

/„ = {/: R -> R : /(x) = 0 for all x > n} 

is an ideal and that I n C I n+ i for all n. Therefore, R does not satisfy the ACC, and so R is 
not noetherian. < 

Here is an application of the maximum condition. 

Corollary 6.40. If I is a proper ideal in a noetherian ring R, then there exists a maximal 
ideal M in R containing I. In particular, every noetherian ring has maximal ideals . 5 

Proof. Let T be the family of all those proper ideals in R which contain /; note that 
T ^ 0 because led 7 . Since R is noetherian, the maximum condition gives a maximal 
element M in IF. We must still show that M is a maximal ideal in R (that is, that M is a 
maximal element in the larger family T' consisting of all the proper ideals in IF). Suppose 
there is a proper ideal J with M C. J. Then / C J, and so J e IF', therefore, maximality 
of M gives M — J , and so M is a maximal ideal in R. • 

Here is one way to construct a new noetherian ring from an old one. 

l Tho name honors Emmy Noether (1882-1935). who introduced chain conditions in 1921. 

-This corollary is true without assuming R is noetherian, but the proof of the general result needs Zorn's 
lemma; see Theorem 6.46. 



Sec. 6.3 Noetherian Rings 


343 


Corollary 6.41. If R is a noetherian ring and I is an ideal in R, then R/I is also noethe¬ 
rian. 

Proof. If A is an ideal in R/I, then the correspondence theorem provides an ideal J 
in R with J/I = A. Since R is noetherian, the ideal J is finitely generated, say, J = 

{b \,..., b n ), and so A — J/I is also finitely generated (by the cosets b\ + I . b n + I). 

Therefore, R // is noetherian. • 

The following anecdote is well known. Around 1890, Hilbert proved the famous Hilbert 
basis theorem, showing that every ideal in C[xi,..., x„] is finitely generated. As we 
will see, the proof is nonconstructive in the sense that it does not give an explicit set of 
generators of an ideal. It is reported that when P. Gordan, one of the leading algebraists 
of the time, first saw Hilbert’s proof, he said, “This is not mathematics, but theology!” On 
the other hand, Gordan said, in 1899 when he published a simplified proof of Hilbert’s 
theorem, “I have convinced myself that theology also has its advantages.” 

The proof of the Hilbert basis theorem given next is due to H. Sarges (1976). 

Theorem 6.42 (Hilbert Basis Theorem). If R is a commutative noetherian ring, then 
/?[x] is also noetherian. 

Proof. Assume that I is an ideal in /?[x] that is not finitely generated; of course, I f {0}. 
Define fo(x) to be a polynomial in I of minimal degree and define, inductively, f„ + \ (x) 
to be a polynomial of minimal degree in I — (/o, ...,/„). It is clear that 

deg(/o) < deg(/i) < deg(/ 2 ) < • • • . 

Let a n denote the leading coefficient of /„ (x). Since R is noetherian. Exercise 6.32 on 
page 344 applies to give an integer m with a m +\ e (ao,..., a m )\ that is, there are r ,• e R 
with a m + j = roao + ■ ■ ■ + r m a m . Define 


/*(*) = fm+lix) - J^X^^nfiix), 
i= 0 

where d t — deg(/,). Now f*(x) el- (fo(x) - - otherwise f n +\(x) e 

ifo(x), ..., f m (x )). It suffices to show that deg(/*) < deg(/ m+ i), for this contradicts 
/„,+i(x) having minimal degree among polynomials in I that are not in (/o,..., /,„). If 
fi (x) = aix di + lower terms, then 

m 

f*{x) = fm+l(x) - x d m+l -di n fj(x) 

i=0 

m 

= (a m +ix dm+l + lower terms) — x d " ,+l ~ dj r, ( aix di + lower terms). 

1=0 

The leading term being subtracted is thus Yl'iLo r i a iX dm+] — a m +\x dm + l . • 
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Corollary 6.43. 

(i) Ifk is afield, then k[x i, ..., x n ] is noetherian. 

(ii) The ring Z[xi, ..., x n ] is noetherian. 

(iii) For any ideal I in k[x\, ..., x n ], where k = Z or k is a field, the quotient ring 
k[x i , ..., x n ]/I is noetherian. 

Proof. The proofs of the first two items are by induction on n > 1, using the theorem, 
while the proof of item (iii) follows from Corollary 6.41. • 


Exercises 


6.32 Let R be a commutative ring. Prove that R is noetherian if and only if, for every sequence 
a\,ci 2 ,.of elements in R. there is an integer m > 1 with a m +\ an R -linear combina¬ 
tion of its predecessors; that is, there are r\ ,..., r m e f?withfl,„_|_i =r\a \ + • ■ • + r m a m . 

6.33 (i) Give an example of a noetherian ring R containing a subring that is not noetherian. 

(ii) Give an example of a commutative ring R containing proper ideals I C J C R with J 
finitely generated but with I not finitely generated. 


6.34 Let R be a noetherian domain such that every a, b e R has a gcd that is an .R-linear combi¬ 
nation of a and b. Prove that R is a PID. (The noetherian hypothesis is necessary, for there 
exist non-noetherian domains, called Bezout rings, in which every finitely generated ideal is 
principal.) 

Hint. Use induction on the number of generators of an ideal. 

6.35 Give a proof that every nonempty family T of ideals in a PID R has a maximal element 
without using Proposition 6.38. 


6.36 Example 6.39 shows that R = JF(R), the ring of all functions on R under pointwise operations, 
does not satisfy the ACC. 

(i) Show that the family of ideals {/„ : n > 1) in that example does not have a maximal 
element. 


(ii) Define 


fn(x) = 


if x < n 
if x > n , 


and define J n — (/), ..., /„)• Prove that J* — (J«>l fi I s an ideal that is not finitely 
generated. 


6.37 


If R is a commutative ring, define the ring of formal power series in several variables induc¬ 
tively: 

*[[*!, ...,*„+!]] = A \\x„+\ II- 


where A = f?[[xi,..., x n ]]. 

Prove that if R is a noetherian ring, then i?[[xi. ..., x n ]], is also a noetherian ring. 

Hint. Use Exercise 3.54(i) on page 151 if n — 1; use the proof of the Hilbert basis theorem 
when n > 1 , but replace the degree of a polynomial by the order of a power series (where the 
order of a nonzero power series CjX 1 is n if n is the smallest i with Cj 0). 
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6.38 Let 


S 2 = {{a, b, c) 6 R 3 : a 2 + b 2 + c 2 = 1} 


be the unit sphere in R , and let 


/ = {/(x, z) e R[x, y, z ]: f {a, b,c) = 0 for all (a, b, c) e 5“}. 


Prove that / is a finitely generated ideal in R[x, y, z]. 

6.39 If R and S are noetherian, prove that their direct product R x 5 is also noetherian. 

6.40 If R is a commutative ring that is also a vector space over a field k, then R is called a commu¬ 
tative k-algebra if 

(au) v = a(uv ) = u(av) 

for all a e k and u.v e R. Prove that every commutative k-algebra that is finite-dimensional 
over k is noetherian. 


6.4 Applications of Zorn’s Lemma 

Dealing with infinite sets may require some appropriate tools of set theory. 

Definition. If A is a set, let V(A )* denote the family of all its nonempty subsets. The 
axiom of choice states that if A is a nonempty set, then there exists a function ft : V(A ) # —» 
A with ft(S) e S for every nonempty subset S of A. Such a function fi is called a choice 
function. 

Informally, the axiom of choice is a harmless looking statement; it says that we can 
simultaneously choose one element from each nonempty subset of a set. 

The axiom of choice is easy to accept, and it is one of the standard axioms of set theory. 
Indeed, the axiom of choice is equivalent to the statement that the cartesian product of 
nonempty sets is itself nonempty (see Proposition A.l in the Appendix). However, the 
axiom is not convenient to use as it stands. There are various equivalent forms of it that are 
more useful, the most popular of which are Zorn’s lemma and the well-ordering principle. 

Recall that a set A is a partially ordered set if there is a relation x < y defined on X 
that is reflexive, antisymmetric, and transitive. 

We introduce some definitions to enable us to state the well-ordering principle. 

Definition. A partially ordered set X is well-ordered if every nonempty subset S of X 
contains a smallest element ; that is, there is sq e S with 

so < s for all s e S. 

The set of natural numbers N is well-ordered (this is precisely what the least integer 
axiom in Chapter 1 states), but the set Z of all integers is not well-ordered because Z itself 
is a subset having no smallest element. 

Well-ordering principle. Every set X has some well-ordering of its elements. 
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If A happens to be a partially ordered set, then a well-ordering, whose existence is 
asserted by the well-ordering principle, may have nothing to do with the original partial 
ordering. For example, Z can be well-ordered: 

We will be able to state Zorn’s lemma after the following definitions. 

Definition. An element m in a partially ordered set A is a maximal element if there is no 
x e X for which m < x; that is. 


if m < x, then m = x. 

Recall that an upper bound of a nonempty subset Y of a partially ordered set X is an 
element xo e X, not necessarily in Y, with y < xq for every y e Y. 

Example 6.44. 

(i) A partially ordered set may have no maximal elements. For example, R, with its usual 
ordering, has no maximal elements. 

(ii) A partially ordered set may have many maximal elements. For example, if X is the 
partially ordered set of all the proper subsets of a set U , then a subset S is a maximal 
element if and only if S — U — {«} for some it e U\ that is, S is the complement of a 
point. 

(iii) If X is the family of all the proper ideals in a commutative ring R , partially ordered by 
inclusion, then a maximal element in A is a maximal ideal. < 

Zorn’s lemma gives a condition that guarantees the existence of maximal elements. 

Definition. A partially ordered set A is a chain if, for all x,y e A, either x < y or 

y < x. 

The set of real numbers R is a chain if one takes x < y to be the usual inequality x < y. 

Zorn’s lemma. If X is a nonempty partially ordered set in which every chain has an 
upper bound in X, then X has a maximal element. 

Theorem. The following statements are equivalent. 

(i) Zorn’s lemma. 

(ii) The well-ordering principle. 

(iii) The axiom of choice. 

Proof. See the Appendix. • 

Henceforth, we shall assume, unashamedly, that all these statements are true, and we 
will use any of them whenever convenient. 

The next proposition is frequently used when verifying that the hypothesis of Zorn’s 
lemma does hold. 
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Proposition 6.45. If C is a chain and S — {x \, ..., x n } C C, then there exists some x,-, 
for 1 < i < n, with xj < X; for all xj e S. 

Proof. The proof is by induction on n > 1. The base step is trivially true. Let S = 
{xi,..., x„+i}, and define S' = {xi,..., x„j. The inductive hypothesis provides x,, for 
1 < i < n, with Xj < x; for all xj e S'. Since C is a chain, either x, ^ x n +\ or x n+ \ < x/. 
Either case provides a largest element of S. • 

Here is our first application of Zorn’s lemma. 

Theorem 6.46. If R is a nonzero commutative ring, then R has a maximal ideal. Indeed, 
every proper ideal I in R is contained in a maximal ideal. 

Proof. The second statement implies the first, for if R is a nonzero ring, then the ideal 
(0) is a proper ideal, and so there exists a maximal ideal in R containing it. 

Let X be the family of all the proper ideals containing I (note that X 0 because 
I e X), and partially order X by inclusion. It is easy to see that a maximal element of X 
is a maximal ideal in R: There is no proper ideal strictly containing it. 

Let C be a chain of X\ thus, given I, J e C, either I C J or J C /. We claim that 
I* — |J /sC I is an upper bound of C. Clearly, I C I* for all I e C, so that it remains 
to prove that I* is a proper ideal. The argument that I* is an ideal is, by now, familiar. 
Finally, we show that I* is a proper ideal. If I* = R, then 1 e /*; now 1 got into I* 
because 1 e / for some I e C, and this contradicts I being a proper ideal. 

We have verified that every chain of X has an upper bound. Hence, Zorn’s lemma 
provides a maximal element, as desired. • 

Remark. Theorem 6.46 would be false if the definition of ring R did not insist on R 
containing 1. An example of such a “ring without unit” is any additive abelian group G 
with multiplication defined by ah — 0 for all a. h e G. The usual definition of ideal 
makes sense, and it is easy to see that the ideals in G are its subgroups. Thus, a maximal 
ideal I is just a maximal subgroup, which means that G/I has no proper subgroups, by 
the correspondence theorem. Thus, G/I is a simple abelian group; that is, G/7 is a finite 
group of prime order. In particular, take G — Q as an additive abelian group, and equip 
it with the zero multiplication. The reader can show that Q has no nonzero finite quotient 
groups, so that it has no maximal subgroups. Therefore, this “ring without unit” has no 
maximal ideals. -4 

We emphasize the necessity of checking, when applying Zorn’s lemma to a partially 
ordered set X, that X be nonempty. For example, a careless person might claim that Zorn’s 
lemma can be used to prove that there is a maximal uncountable subset of Z. Define X to 
be the set of all the uncountable subsets of Z, and partially order X by inclusion. If C is a 
chain in X, then it is clear that the uncountable subset S* = Uscc ' s an upper bound of 
C, for S C .S'* for every S e C. Therefore, Zorn’s lemma provides a maximal element of 
X, which must be a maximal uncountable subset of Z. The flaw, of course, is that X — 0 
(for every subset of a countable set is itself countable). 
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Here is our second application of Zorn’s lemma. We begin by generalizing the usual def¬ 
inition of a basis of a vector space so that it applies to all, not necessarily finite-dimensional, 
vector spaces. 

Definition. Let V be a vector space over some field k, and let Y C V be an infinite 
subset. 6 

(i) Y is linearly independent if every finite subset of Y is linearly independent. 

(ii) Y spans V if each v e V is a linear combination of finitely 7 many elements of Y. We 
write V = (Y) when V is spanned by Y. 

(iii) A basis of a vector space V is a linearly independent subset that spans V. 

Thus, an infinite subset Y — {y, : i e 1} is linearly independent if, whenever «/V; = 
0 (where only finitely many a, ^ 0), then a, = 0 for all i. 

Example 6.47. 

Let k be a field, and let V = k[x] regarded as a vector space over k. We claim that 

Y = {1, x, x 2 ,..., x n ,.. .} 

is a basis of V. Now Y spans V , for any polynomial of degree cl is a ^-linear combi¬ 
nation of 1, x, x 2 ,x d . Also, Y is linearly independent, because there are no scalars 
ao,ai,..., a n , not all 0, with = 0 (a polynomial is the zero polynomial pre¬ 

cisely if all its coefficients are 0). Therefore, Y is a basis of V. ◄ 


Theorem 6.48. Every vector space V over afield F has a basis. Indeed, every linearly 
independent subset B of V is contained in a basis ofV\ that is, there is a subset B' so that 
BUB' is a basis ofV. 

Proof. Note that the first statement follows from the second, for B = 0 is a linearly 
independent subset contained in a basis. 

Let X be the family of all the linearly independent subsets of V that contain B. The 
family X is nonempty, for B e X. Partially order X by inclusion. We use Zorn’s lemma 
to prove the existence of a maximal element in X. Let B — {B j : j e J] be a chain of 
X. Thus, each B/ is a linearly independent subset containing B and, for all i , j e J , either 
Bj C If or If C Bj. It follows from Proposition 6.45 that if Bj ] ,..., Bj n is any finite 
family of Bj’ s, then one contains all of the others. 

Let B* — U/s 7 Bj. Clearly, B* contains B and Bj C B* for all j e J. Thus, B* is 
an upper bound of B if it belongs to X\ that is, if B* is a linearly independent subset of 
V. If B* is not linearly independent, then it has a finite subset y (| ,..., y, m that is linearly 
dependent. How did y, ( . get into B*1 Answer: y lk e Bj k for some index jk ■ Since there are 
only finitely many yj k , there exists B /0 containing all the If ,; that is, y,-.. y, m e Bj 0 . 

^ When dealing with infinite bases, it is more convenient to work with subsets instead of lists. 

7 Only finite sums of elements in V are allowed. Without limits, convergence of infinite series does not make 
sense, and so a sum with infinitely many nonzero terms is not defined. 
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But B j 0 is linearly independent, by hypothesis, and this is a contradiction. Therefore, B* 
is an upper bound of the simply ordered subset B. We have verified that every chain of X 
has an upper bound. Hence, Zorn’s lemma applies to say that there is a maximal element 
in X. 

Let M be a maximal element in X. Since M is linearly independent, it suffices to 
show that it spans V (for then M is a basis of V containing B ). If M does not span 
V. then there is i>o € V with r>o f (M>, the subspace spanned by M. Consider the subset 
M* = MU{i>o}. Clearly, M C M*. Now M* is linearly independent: if aovo+jfaiyi — 0, 
where y,- e M and ciq, ai e F are not all 0, then ao / 0 (otherwise the collection of y,- 
appearing in the equation would be linearly dependent, a contradiction). But if ciq f 0, 
then vo — — aL 1 yfaiyt, contradicting vo f (M). Therefore, M is a basis of V. The last 
statement follows if we define B' = M — B. • 

Recall that a subspace W of a vector space V is a direct summand if there is a subspace 
W' of V with {0} = W C\W' and V = W + W’ (i.e., each v e V can be written v — w + w', 
where w e W and w' e W'). We say that V is the direct sum of W and W\ and we write 
V = W © W’. 


Corollary 6.49. Every subspace W of a vector space V is a direct summand. 

Proof. Let B be a basis of IV. By the theorem, there is a subset B’ with B U /(' a basis 
of V. It is straightforward to check that V — W © (B 1 ), where (fi') denotes the subspace 
spanned by B’. • 

The ring of real numbers K is a vector space over Q; a basis is usually called a Hamel 
basis, and it is useful in constructing analytic counterexamples. For example, we may use 
a Hamel basis to prove the existence of a discontinuous function / : R —»• M that satisfies 
the functional equation f(x + y) = fix) + /(y). 8 

Example 6.50. 

An inner product on a vector space V over a field k is a function 

V x V -» k, 

whose values are denoted by (v, w), such that 

(i) (v + v',w) — ( v, w) + (v', vo) for all v,v',we V; 

8 Here is a sketch of a proof, using infinite cardinal numbers, that such discontinuous functions / exist. As 
in the finite-dimensional case, if B is a basis of a vector space V, then any function / : B —> V extends to a 
linear transformation F : V —> V (see Proposition 7.49); namely, F(J2 = 5Zr, /(^,). A Hamel basis has 
cardinal c = |R|, and so there are c c = 2 C > c functions / : R —> R satisfying the functional equation, for 
every linear transformation is additive. On the other hand, every continuous function on R is determined by its 
values on Q, which is countable. It follows that there are only c continuous functions on R. Therefore, there exist 
discontinuous functions / : R —> R satisfying the functional equation f(x + y) = fix ) + f{y). 

We have just proved that there exists a discontinuous / : R —> R such that f(x-\-y) = fix) + fiy) for all 
x, y e R; that is, there is some a e R with / discontinuous at a. Thus, there is some e > 0 such that, for every 
<5 > 0, there is a b e R with \b—a \ < 8 and | fib)—fid) \ > e. Let us show that / is discontinuous at every cel. 
The identity b — a = ib-\-c — a) — c gives \ib+c — a) — c\ < 8, and the identity fib+c—a) — /(c) = fib) — f(a) 
gives | fib + c - a) - /(c) | > €. 
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(ii) (oiv, w) — ot(v, w ) for all v, w e V and a e k; 

(iii) ( v, w ) = ( w , v) for all v, w e V. 

We say that the inner product is definite if (v, v) 0 whenever v 0. 

We are now going to use a Hamel basis to give a definite inner product on R all of 
whose values are rational. Regard R as a vector space over Q, and let Y he a basis. Using 
0 coefficients if necessary, for each v, w e R, there are y; e Y and rationals a,- and bj with 
v = ai y, and w — bj y; (the nonzero a,- and nonzero bj are uniquely determined by 
v and w, respectively). Define 

(v, w) — y^ajbr, 

note that the sum has only finitely many nonzero terms. It is routine to check that we have 
defined a definite inner product. -4 

There is a notion of dimension for infinite-dimensional vector spaces; of course, dimen¬ 
sion will now be a cardinal number. In the following proof, we shall cite and use several 
facts about cardinals. We denote the cardinal number of a set X by |X|. 

Fact I. Let X and Y be sets, and let f: X —> Y be a function. If is finite for 

every y e Y, then |X| < Rol^l/ hence, ifY is infinite, then |X| < |7|. 

See Kaplansky, Set Theory and Metric Spaces; since X is the disjoint union X = 
UveF / -1 (>0. this result follows from Theorem 16 on page 43. 

Fact II. If X is an infinite set and Fin(X) is the family of all its finite subsets, then 
|Fin(X)| = |X|. 

See Kaplansky, Set Theory and Metric Spaces; this result also follows from Theorem 16 
on page 43. 

Fact III (Schroeder-Bernstein Theorem). If X and Y are sets with \X\ < Y\ and 

\Y\ < \X\, then \X\ = \Y\. 

See Birkhoff-Mac Lane, A Survey of Modern Algebra, page 387. 

Theorem 6.51. Let k be afield and let V be a vector space over k. 

(i) Any two bases ofV have the same number of elements (that is, they have the same 
cardinal number ); this cardinal is called the dimension of V and it is denoted by 
dim(V). 

(ii) Vector spaces V and V' over k are isomorphic if and only if dim( V) = dim(y / ). 

Proof (i) Let B and B' be bases of V. If B is finite, then V is finite-dimensional, and 
hence B' is also finite (Corollary 3.90); moreover, we have proved, in Theorem 3.85, that 
\B\ = \B'\. Therefore, we may assume that both B and B' are infinite. 

Each v e V has a unique expression of the form v = ^ otbh, where a/, e k and 

almost all ab = 0. Define the support of v (with respect to B) by 


supp(u) = {k B : ffij / 0}; 
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thus, supp(i>) is a finite subset of B for every v e V. Define f: B' Fin(Z?) by f(b') = 
supp(£>'). Note that if supp(£/) = {b i, ..., b n ], then b' e (b 1 ,..., b n ) = (supp(//)), the 
subspace spanned by supp(/r'). Since (supp((/)] has dimension n, it contains at most n 
elements of B ', because B' is independent (Corollary 3.88). Therefore, f~ l (T ) is finite 
for every finite subset T of B [of course, f~ l (T) = 0 is possible]. By Fact I, we have 
\B'\ < Fin(/i)|, and by Fact II, we have \B'\ < \B\. Interchanging the roles of B and B' 
gives the reverse inequality |B| < \B'\, and so Fact III gives B = \B'\. 

(ii) Adapt the proof of Corollary 3.105, the finite-dimensional version. • 

The next application is a characterization of noetherian rings in terms of their prime 
ideals. 

Lemma 6.52. Let R be a commutative ring and let T be the family of all those ideals in 
R that are not finitely generated. If T 0, then T has a maximal element. 

Proof. Partially order T by inclusion. It suffices, by Zorn’s lemma, to prove that if 
C is a chain in T, then I* — U/sC ^ ' s not finitely generated. If, on the contrary, 
I* — (ai,..., a„), then aj e Ij for some Ij e C. But C is a chain, and so one of 
the ideals I \,..., call it Iq, contains the others, by Proposition 6.45. It follows that 
I* — (ai,..., a n ) c /o. The reverse inclusion is clear, for I C I* for all I e C. There¬ 
fore, Iq — I* is finitely generated, contradicting /q € T. • 

Theorem 6.53 (I. S. Cohen). A commutative ring R is noetherian if and only if every 
prime ideal in R is finitely generated. 

Proof. Only sufficiency needs proof. Assume that every prime ideal is finitely generated. 
Let T be the family of all ideals in R that are not finitely generated. If T ^ 0, then the 
lemma provides an ideal I that is not finitely generated and that is maximal such. We will 
show that I is a prime ideal; with the hypothesis that every prime ideal is finitely generated, 
this contradiction will show that T — 0; that is, that R is noetherian. 

Suppose that ab e I but a f. I and b ^ I. Since a f. /, the ideal I + Ra is strictly 
larger than /, and so I + Ra is finitely generated; indeed, we may assume that 


I + Ra — (i\ + rya, ...,/„+ r n a), 

where 4 e I and rj e R for all k. Consider J — (I : a) — {x e R : xa e I}. 
Now I + Rb c /; since b f. /, we have I C /, and so J is finitely generated. We 
claim that I = (i i,..., i n , Ja). Clearly, (i i,..., i n , Ja) C /, for every 4 e I and 
J a c /, For the reverse inclusion, if z e I C I + Ra, there are m- e R with z = 
J2k u k(lk + rka). Then {J2 k u k r k )a — z - Jf k u k i k e /, so that ff k u k r k e J. Hence, 
z = "}2 k u k i k + Cff k u k r k )a e (i i,..., i n , Ja). It follows that I = (i i, ..., i n , Ja) is 
finitely generated, a contradiction, and so I is a prime ideal. • 

W. Krull has proved that every noetherian ring has the DCC on prime ideals (see Corol¬ 
lary 11.163). 
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Our next application involves algebraic closures of fields. Recall that a field extension 
K/k is algebraic if every a e K is a root of some nonzero polynomial fix) e k[x\, that 
is, K/ k is an algebraic extension if every element a e K is algebraic over k. 

We have already discussed algebraic extensions in Proposition 3.117 on page 185, and 
the following proposition will add a bit more. 

Proposition 6.54. Let K/k be an extension. 

(i) If z £ K, then z is algebraic over k if and only ifk(z)/k is finite. 

(ii) If z i, Z2, • ■ •, Zn £ K are algebraic over k, then k(zi, Z 2 , ■ ■ •, z n )/k is a finite exten¬ 
sion. 

(iii) If y,z € K are algebraic over k, then y + z, yz, and y -1 (for y 0) are also 
algebraic over k. 

(iv) Define 

K a ig = [z e K : z is algebraic over k}. 

Then K-^g is a subfield of K. 

Proof, (i) If k(z)/k is finite, then Proposition 3.117(i) shows that z is algebraic over k. 
Conversely, if z is algebraic over k, then Proposition 3.117(v) shows that k(z)/k is finite. 

(ii) We prove this by induction on n > 1; the base step is part (i). For the inductive step, 
there is a tower of fields 

k c k(zi ) c k(z\,zi) c ••• c k(zt,.. -,z n ) c k(zi,. ■ -,z n + 1 ). 

Now [£(z„+i) : k] is finite, and we have [k(z \,..., z n ) : k] finite, by the inductive hypoth¬ 
esis. Indeed, [A'(z„+i) : k] — d, where d is the degree of the monic irreducible polynomial 
in £[jc] having z n +1 as a root (by Proposition 3.117). But if ’, 1+ i satisfies a polynomial 
of degree d over k , then it satisfies a polynomial of degree d' < d over the larger field 
F = k(z,\,... ,z n )- We conclude that 

[k(z\ -- z n + 1 ) : k(z\,... ,z n )} = [F(z n + 1 ) : F] < [^(z„+i) : k]. 


Therefore, 


[k(zu ..., z„+ 1 ) : k] = [F(zn+ 1 ) : k] = [F(z n +i ) : F][F : k] 

is finite. 

(iii) Now k(y,z)/k is finite, by part (ii). Therefore, k(y + z) C k(y,z ) and k(yz) C 
k(y,z) are also finite, for any subspace of a finite-dimensional vector space is itself finite¬ 
dimensional [Corollary 3.90(i)]. By part (i), y + Z, yz, and C 1 are algebraic 
over k. 

(iv) This follows at once from part (iii). • 
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Definition. Given the extension C/Q, define the algebraic numbers by 

A = C a i g . 

Thus, A consists of all those complex numbers that are roots of nonzero polynomials in 
Q[x], and the proposition shows that A is a subfield of C that is algebraic over Q. 

Example 6.55. 

We claim that A/Q is an algebraic extension that is not finite. Suppose, on the contrary, that 
[A : Q] = n, for some integer n. Now there exists an irreducible polynomial p{x ) e Q[x] 
of degree n + 1; for example, take p(x) = x" +1 — 2. If a is a root of p(x), then a e A, 
and so Q(a) C A. Thus, A is an n -dimensional vector space over Q containing an (n + 1)- 
dimensional subspace, and this is a contradiction. -4 

Lemma 6.56. 

(i) Ifkz_ K C E is a tower of fields with E/K and K/k algebraic, then E/k is also 
algebraic. 

(ii) Let 

Kq c K x c • • • c K n c K n+1 c • • • 

be an ascending tower of fields; if K n+ \/K n is algebraic for all n > 0, then K* = 
U„>o K„ is a field that is algebraic over Kq. 

(iii) Let K = k{A); that is, K is obtained from k by adjoining the elements in a set A. If 
each element a e A is algebraic over k, then K/k is an algebraic extension. 

Proof, (i) Let e e E; since E/K is algebraic, there is some f(x) — ^ ; A (I aix 1 e 
A[x] having e as a root. If F — k(ao, ..., a„), then e is algebraic over F, and so 
k(ao, ..., a n , e) — F(e) is a finite extension of F\ that is, [ F(e ) : F] is finite. Since 
K/k is an algebraic extension, each a, is algebraic over k, and Corollary 3.90 on page 170 
shows that the intermediate field F is finite-dimensional over k\ that is, [A : k] is finite. 

[k(a 0 , ...,a n ,e):k] = [F(e) : k] = [F(e) : F][F : k] 

is finite, and so e is algebraic over k, by Proposition 6.54(i). We conclude that E/k is 
algebraic. 

(ii) If v, z e K*, then they are there because y e K m and z e K n \ we may assume that 
m < n, so that both y, z e K„ C K* . Since K„ is a field, it contains y + z, yz , and y -1 if 
y 0. Therefore, K* is a field. 

If z € K*, then z must lie in K n for some n. But K„/Kq is algebraic, by an obvious 
inductive generalization of part (i), and so z is algebraic over Kq. Since every element of 
K* is algebraic over Kq, the extension K*/Kq is algebraic. 

(iii) Let z € k(A); by Exercise 3.95 on page 197, there is an expression for z involv¬ 
ing k and finitely many elements of A; say, ai,..., a m . Hence, z e k(a\ ,..., a m ). By 
Proposition 6.54(ii), k(z)/k is finite and hence z is algebraic over k. • 
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Definition. A field K is algebraically closed if every nonconstant fix) e K [x | has a root 
in K. An algebraic closure of a field k is an algebraic extension k of k that is algebraically 
closed. 

The algebraic closure of Q turns out to be the algebraic numbers: Q = A. The funda¬ 
mental theorem of algebra says that C is algebraically closed; moreover, C is an algebraic 
closure of R. We have already given an algebraic proof. Theorem 4.49, but perhaps the sim¬ 
plest proof of this theorem is by Liouville’s theorem in complex variables: Every bounded 
entire function is constant. If fix) e C[x] had no roots, then 1 /fix) would be a bounded 
entire function that is not constant. 

There are two main results here. First, every field has an algebraic closure; second, any 
two algebraic closures of a field are isomorphic. Our proof of existence will make use of a 
“big” polynomial ring: We assume that if k is a field and T is an infinite set, then there is 
a polynomial ring k[T) having one variable for each t e T. (We have already constructed 
k[T] when T is finite, and the infinite case is essentially a union of k[U], where U ranges 
over all the finite subsets of T. A construction of k[T) for infinite T will be given in 
Exercise 9.93 on page 756.) 

Lemma 6.57. Let k be afield, and let k[T] be the polynomial ring in a set T of variables. 
If t\,... ,t n € T are distinct and if fit,) G k[t,] C k[T] are nonconstant polynomials, 
then the ideal I = (/i(fi), ..., /„(f,,)) in k[T] is a proper ideal. 

Remark. If n = 2, then f) it\) and fiUi) are relatively prime, and this lemma says that 
1 is not a linear combination of them. -4 

Proof. If I is not a proper ideal in k[T], then there exist h, )T) e k[T] with 
l = hiIT)Mti) + ■ ■ ■ + h„IT)f n It n ). 

Consider the field extension k(a \,..., a n ), where a, is a root of fi(tj) for i = 1,... ,n 
(the ft are not constant). Denote the variables involved in the /;, (T) other than t\,..., t n , if 
any, by t n +\, ..., t m . Evaluating when t, = a ; if i < n and tj = 0 if i > n + 1 (evaluation 
is a ring homomorphism k[T] -> k(a i, ..., a„)), the right side is 0, and we have the 
contradiction 1=0. • 

Theorem 6.58. Given afield k, there exists an algebraic closure k ofk. 

Proof. Let T he a set in bijective correspondence with the family of nonconstant poly¬ 
nomials in k[x). Let R — k[T] be the big polynomial ring, and let I be the ideal in R 
generated by all elements of the form fltf), where tfeT; that is, if 

fix) — x n + a n -\x n ~ l H-1% 

where «; e k, then 

/ Itf) — Itf )" + a n -\(tf) n ~ l H-h oo- 
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We claim that the ideal I is proper; if not, 1 6 /, and there are distinct t \.... ,t„ e T 
and polynomials h\(T ),..., h n (T ) e Ar[T] with 1 = /zi(T)/i(fi) + • • • + h n (T)f n (t n ), 
contradicting the lemma. Therefore, there is a maximal ideal M in R containing /, by 
Theorem 6.46. Define K — R/M. The proof is now completed in a series of steps. 

(i) K / k is afield extension. 

We know that K — R/M is a field because M is a maximal ideal. Moreover, the ring 
map 6, which is the composite 

k -4 k[T] = R 44 R/M = K, 

(where i is the inclusion) is not identically 0 because 1 i—> 1, and hence 9 is injective, by 
Corollary 3.53. We identify k with im0 C K. 

(ii) Every nonconstant f(x) e A;[x] splits in K[x ]. 

By definition, there is tf e T with fit/) e I c M. and the coset tf + M e R/M — K 
is a root of fix). It now follows by induction on degree that fix) splits over K. 

(iii) The extension K/k is algebraic. 

By Lemma 6.56(iii), it suffices to show that each tf + M is algebraic over k [for K = 
&(all tf + M)]; but this is obvious, for tf is a root of f(x) e k[x\. 

(iv) K is algebraically closed 

Let g(x ) e K [x ] and let E — K (ot \,..., a m ) be a splitting field of g(x) over K. We 
have a tower of fields k C K C E in which K/k and E/K are algebraic extensions. By 
Lemma 6.56(i), E/k is an algebraic extension. Hence, p(x) — irr(aq,£) e k[x ]. By 
item (ii), p(x) splits over K, so that {«|,..., u m | C K ; that is, E C K. Therefore, g(x) 
splits in K [a], and so K is algebraically closed. • 


Corollary 6.59. If k is a countable field, then it has a countable algebraic closure. In 
particular, an algebraic closure ofQ or of¥ p is countable. 

Proof. If k is countable, then the set T of all nonconstant polynomials is countable, say, 
T — {fj, tg ,...}, because £[x] is countable. Hence, k[T ] = Ue>i • • •, tf\ is count¬ 
able, as is its quotient k\ (in the proof of Theorem 6.58). It follows, by induction on n > 0, 
that every k n is countable. Finally, a countable union of countable sets is itself countable, 
so that an algebraic closure of k is countable. • 

We are now going to prove the uniqueness of an algebraic closure. 

Definition. If F/k and K/k are extensions, then a k-map is a ring homomorphism 
ip : F —¥■ K that fixes k pointwise. 

We note that if K/k is an extension, if (p : K —> K is a k- map, and if a <= K is 
a root of some irreducible polynomial p(x) e k[x ], then <p permutes all the roots {a = 
a\, ag,.. ., a r } of p(x) that lie in K. If p(x) = x" + c„-\x n ~ x + • • • + co, then 

0 = piflj) = a'/ + c n -\a ” + • • • + cq. 
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and so 

0 = [<K«;)]" + <p(c„_i)[<p(a;)]" -1 H-h <p(co) 

— W(ai)] n + c n -\[(p(ai)] n ~ l H-h c 0 , 

because <p fixes all c/ e A. Therefore, (piuj ) is a root of p(x) lying in K. Finally, since <p is 
injective and {ai,..., a r ) is finite, (p is a permutation of these roots. 

Lemma 6.60. If K/k is an algebraic extension, then every k-map <p : K —> K is an 
automorphism of K. 

Proof By Corollary 3.53, the A-map <p is injective. To see that <p is suijective, let a e K. 
Since K/k is algebraic, there is an irreducible polynomial pix) e A[;r] having a as a root. 
As we remarked earlier, <p being a A'-map implies that it permutes the set of those roots of 
p(x) that lie in K. Therefore, a e im (p because a = (piuj) for some i. • 

The next lemma will use Zorn’s lemma by partially ordering a family of functions. 
Since a function is essentially a set, its graph, it is reasonable to take a union of functions 
in order to obtain an upper bound; we give details below. 


Lemma 6.61. Ifk/k is an algebraic closure, and if F / k is an algebraic extension, then 
there is an injective k-map js : F —>■ k. 

Proof If E is an intermediate field, k C E c h\ let us call an ordered pair (E, f) an 
“approximation” if / : E -> k is a A-map. In the following diagram, all arrows other than 
/ are inclusions. 

A 


A - 

Define 

X — {approximations (E. /)}. 

Note that X ^ 0 because (A, 1*) £ X. Partially order X by 

(E,f)<(E',f) if E C E' and f'\E = f. 

That the restriction f'\E is / means that f extends /; that is, both functions agree when¬ 
ever possible: f'(u) = f(u) for all u e E. 

It is easy to see that an upper bound of a chain 



5 = {(/•./. fj) :./ c ./} 


is given by ([J Ej, [J fj). That [J Ej is an intermediate field is, by now, a routine argu¬ 
ment. We can take the union of the graphs of the fj’s, but here is a more down-to-earth 
description of <I> = [J fj'- If u € (J Ej, then u e Ejo for some jo, and O: u m* fj Q ( u). 
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Note that <l> is well-defined: If u e Ej l , we may assume, for notation, that E /0 C E h , and 
then fj t ( u ) = fj 0 (u ) because extends fj 0 . The reader may check that <J> is a k-map. 

By Zorn’s lemma, there exists a maximal element (Eq, /o) in X. We claim that Eq = F, 
and this will complete the proof (take i fr = fo). If Eq C F, then there is a e F with 
a Eq. Since F/k is algebraic, we have F/Eq algebraic, and there is an irreducible 
p{x ) e FoU'] having a as a root; since k/k is algebraic and k is algebraically closed, we 
have a factorization in k[x]: 


fo'(p(x)) - J> ~bi), 

i= 1 

where / 0 *: Eq[x\ -> k[x] is the map induced by /o. If all the bj lie in /q(Eq) C k, 
then fQ l (bj) e Eq C E for all i, and there is a factorization of p(x) in E[x ], namely, 
p(x) = n ? =1 [x — f Q 1 {bj)]. But a ^ Eq implies a ^ / 0 1 (bj) for any i. Thus, x — a is 
another factor of p(x) in F[x], contrary to unique factorization. We conclude that there is 
some bj ^ im /q. By Theorem 3.120(h), we may define f \: Eq(q) —>• k by 


c o + c\a + cia" + • • • !->■ /o(cq) + fo{c\)bj + fo(c 2 )bf + • • • . 


A straightforward check shows that f\ is a (well-defined) k-map extending /o. Hence, 
(Eq, fo) -< (Eq(q), /i), contradicting the maximality of (Eq, fo). This completes the 
proof. • 


Theorem 6.62. Any two algebraic closures of a field k are isomorphic via a k-map. 

Proof. Let K and L be two algebraic closures of a field k. By Lemma 6.61, there are 
k-maps i [r-.K-^-L and 6 : L —> K. By Lemma 6.60, both composites 6 : K —*■ K and 
fid : L —>■ L are automorphisms. It follows that fi (and 6) is a k-isomorphism. • 

It is now permissible to speak of the algebraic closure of a field. 

In the remainder of this section, we investigate the structure of arbitrary fields; we begin 
with simple transcendental extensions k(x), where k is a field and x is transcendental over 
k; that is, we examine the function field k(x). 


Definition. If <p e k(x), then there are polynomials g(x), h(x) e k[x] with (g, h ) = 1 
and (p — g(x)/h (x). Define the degree of <p by 


degr eefip) — max{deg(g), deg(h)}. 
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A rational function cp e k (x ) is called a linear fractional transformation if 

ax + b 

V = —73- 
cx + a 

where a, b,c, d e k and ad — be 0. 

Now <p e k(x) has degree 0 if and only if cp is a constant (that is, <p e k), while 
Exercise 6.56 on page 375 says that (p e k(x) has degree 1 if and only if (p is a linear 
fractional transformation. If A = [“ *] e GL(2, k), write (A) = (ax + b)/(cx + d). 
If we define (A')(A) = (A'A), then it is easily checked that the set LF(k) of all linear 
fractional transformations with entries in k is a group under this operation. In Exercise 6.57 
on page 375, the reader will prove that LF(k) = PGL(2, k) — GL(2, k)/Z( 2, k), where 
Z(2, k) is the (normal) subgroup of all 2 x 2 (nonzero) scalar matrices. 

Proposition 6.63. If (p € k(x) is nonconstant, then cp is transcendental over k and 
k(x)/k((p) is a finite extension with 

[k(x) : k(cp)] = degree(^). 

Moreover, if cp = g(x)/h(x) and (g, h) — 1, then 

irr(x, k(cp )) = g(y) - (ph(y). 

Proof. Let g(x) = J^aix' and h(x) — J^bjX 1 e k[x]. Now 9(y) — g(y) — (ph(y) is a 
polynomial in k((p)[y\. 

0(y) = ^aiy 1 -(p^biy 1 = ^(a,- - (pbfiy 1 . 

If 0(y) were the zero polynomial, then all its coefficients would be 0. But if bi is a nonzero 
coefficient of h(y), then at — <pbj — 0 gives <p = ai/bi, contradicting the assumption that 
( p is not a constant; that is, <p £ k. It follows that 

deg(0) = deg(g(y) - <ph(y)) = max{deg(g), deg (h)} = degree(^). 

Since x is a root of 9(y), we have x algebraic over k(q>). If tp were algebraic over k, then 
k(cp)/k would be finite, giving [k(x) : k] = [k(x) : k(cp)][k((p) : k] finite, a contradiction. 
Therefore, <p is transcendental over k. 

We claim that 9(y) is an irreducible polynomial in k((p)[y\. If not, then 9(y) factors 
in k[<p][y], by Gauss’s Corollary 6.27. But 9(y) — g(y ) — <ph(y ) is linear in <p , and so 
Corollary 6.37 shows that 9(y) is irreducible. Finally, since deg((9) = degree(ip), we have 
[k(x) : k(<p )] = degree(<p). • 

Corollary 6.64. Let cp e k(x), where k(x) is the field of rational functions over afield k. 
Then k(np ) = k(x) if and only ifq> is a linear fractional transformation. 

Proof. By Proposition 6.63, k((p) — k(x) if and only if degree(^) = 1; that is, q> is a 
linear fractional transformation. • 
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Corollary 6.65. Ifk(x) is the field of rational functions over afield k, then 

Gal (k(x)/k) = LF (k), 
the group of all linear fractional transformations over k. 

Proof. Let a : k(x ) -> k(x) be an automorphism of k(x ) fixing k. Now o: x h-> x a , 
where x a e k(x)\ since a is surjective, we must have k(x a ) = k(x), and so x c is a 
linear fractional transformation, by Corollary 6.64. Define y: Gal (k(x)/k) —> LF( k) by 
y : o \-^f x a . The reader may check that y is a homomorphism ( x ar — x r x a ); y is an 
isomorphism because y ~ l is the function assigning, to any linear fractional transformation 
<p — (ax + b)/(cx + d), the automorphism of k(x) that sends x to <p. • 


Theorem 6.66 (Liiroth’s Theorem). Ifk(x) is a simple transcendental extension, then 
every intermediate field B is also a simple transcendental extension ofk: There is <p e B 
with B = k((p). 

Proof. If f> e B is not constant, then [, k(x) : k(f)] — [£(v) : B][B : k(f)] is finite, 
by Proposition 6.63; hence, [k(x) : B] is finite and x is algebraic over B. The proof of 
Proposition 6.63 shows that if <p e k(x ), then <p is a coefficient of irr(x, k(q>))\ the proof of 
Liiroth’s theorem is a converse, showing that B = k(cp ) for some coefficient <p of irr(x, B). 
Now 


irr(x, B) — y" + f n —\y n 1 + • • • + fio € B[y]. 

Each coefficient f e € B C. k (x) is a rational function, which we write in lowest terms: 
ft = ge(x)/hifx), where gtfx), hi(x) e k[x ] and (g£, hf) — 1. As in Lemma 6.24(i), the 
content c(irr) = d(x)/b(x), where b(x) is the product of the h( : and d(x) is their gcd. It is 
easy to see that f(x), defined by f(x) = b(x)/d(x), lies in k[x]\ in fact, the reader may 
generalize Exercise 1.26 on page 13 to show that fix) is the 1cm of the h e. Define 


i (x, y) — f(x) iir(x, B), 


the associated primitive polynomial in £[v][y] (of course, fc[v][y] = k[x, y], but we wish 
to view it as polynomials in y with coefficients in A'[v]). If we denote the highest exponent 
of y occurring in a polynomial a(x, y) by deg (a), then n — deg (/); let m — deg x (i). 

Since i(x, y) = f(x)y " + J2"Zo fWPiy e , we have m = max £ {deg(/), deg(/^)}. Now 
h((x) | f(x) for all (, so that deg(7z^) < deg(/) < m [because f (x) is one of the 
coefficients of i (x, y)]. Also, 


m 


ho■■■h „-i 

d he 


ho ■ ■■he ■■■ h n -1 


Since (ho ■ ■ ■ hi ■ ■ ■ h n -\)/d e k[x~\, we have deg(g^) < deg (fft) < m. We conclude that 
degfg^) < m and deg (hi) < m. 
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Some coefficient of irr(x, B) is not constant, lest x be algebraic over k. Omit the 
subscripts j, write /3j — g(x)//z(x), and define 

V = Pj = g(x)/h(x) e B. 

Now g(y ) — (ph(y ) = g(y) — g(x)h(x)~ l h(y ) e B[y] has x as a root, and so irr(x, B ) 
divides g(y ) — cph(y ) in fi[y] C k(x)[y}. Therefore, there is q(x, y) e ^(x)[y] with 

irr(x, B)q(x, y) = g(y) - <ph(y). (1) 

Since g(y) — tph(y) — h(x)~ l (h(x)g(y) — g(x)h(y)), the content c(g(y) — <ph(y )) is 
h(x)~ l and the associated primitive polynomial is 

T>(v, >•) = h(x)g{y) - g(x)h(y). 

Notice that 0 (jc, y) e fc[x][y] and that <f>(y, x) — —T>(x, y). 

Rewrite Eq. (1), where c(q) e k(x) is the content of q(x, y): 

f(x)~ l i(x, y)c(q)q(x, y)*h(x) = 4>(x, y) 

(remember that /(x) -1 is the content of irr(x, B) and i(x, y) is its associated primitive 
polynomial). The product i (x, y)q (x, y)* is primitive, by Gauss’s Lemma 6.23. But 
<t>(x, y) e ^[x][y], so that Lemma 6.24(iii) gives f(x)~ l c(q)h(x ) e fc[x]. We now define 


q**(x, y) — f(x)~ l c(q)h(x)q(x, y), so that< 7 **(x, y) e k[x, y] and 

i(x, y)q**(x, y) = <J>(x, y) in k[x, y], (2) 

Let us compute degrees in Eq. (2): the degree in x of the left hand side is 

degxdq**) = deg t (() + deg v 0?**) = m + deg,^**), (3) 

while the degree in x of the right hand side is 

deg v (<f>) = max{deg(g), deg(/z)} < m, (4) 

as we saw above. We conclude that m + deg t .(^**) < m, so that deg v (g**) = 0; that 


is, q**(x, y) is a function of y alone. But 0(x, y) is a primitive polynomial in x, and 
hence the symmetry <J>(y,x) = —0(x,y) shows that it is also a primitive polynomial 
in y. Thus, q** is a constant, and so z(x, y) and <J>(x, y) are associates in k[x, y]\ hence, 
deg^lT*) = degj. (i) = m. With Eq. (4), this equality gives 

m — deg A .(<J>) = max{deg(g), deg(/z)}. 

Symmetry of T> also gives deg y (0) = deg v (T>), and so 

n — deg y (4>) = deg A .(<t>) = m — max{deg(g), deg(7z)}. 

By definition, degrce(^) = max{deg(g), deg(/z)} = m; hence. Proposition 6.63 gives 
[&(x) : k((p)] — m. Linally, since cp e B, we have [£(x) : k(cp)] — [A:(x) : B][B : k(cp)]. 
As [£(x) : B] = n = in, this forces [B : k((p)] — 1; that is, B — k((p). • 

There are examples of intermediate fields B with kC.BC. k(x \,..., x„), for n > 1, 
that are not so easily described. 

We now consider more general field extensions. 
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Definition. Let E/k be a field extension. A subset U of E is algebraically depen¬ 
dent over k if there exists a finite subset {mi, ..., u,,} C U and a nonzero polynomial 
f(x i,..., x n ) e k[x i, ..., x n ] with /(mi, ..., u n ) = 0. A subset B of E is algebraically 
independent if it is not algebraically dependent. 

Let E/k be a field extension, let u \,..., u n e E, and let (p : k[x \, ..., x n ] -> E be the 
evaluation map; that is, <p is the homomorphism sending fix i,..., x n ) to /(mi, ..., m„) 
for all fix i,..., x„) e k[xi,..., x„], Now {mi, ..., m„) is algebraically dependent if and 
only if ker <p / {0}. If {u\,... ,u n } is algebraically independent, then ip extends to an 
isomorphism k(x i,..., x n ) = kin i,..., u n ) C E, where k{x i, ..., x„) is the field of 
rational functions Frac(k[xi,..., x,,]). In particular, {xj,..., x„} C E = k(xi,..., x„) is 
algebraically independent, for q> is the identity map in this case. 

Since algebraically dependent subsets are necessarily nonempty, it follows that the 
empty subset 0 is algebraically independent. A singleton {e} C E is algebraically de¬ 
pendent if e is algebraic over k\ that is, e is a root of a nonconstant polynomial over k, and 
it is algebraically independent if e is transcendental over k, in which case k(e) = kix). 

Proposition 6.67. Let E/k be afield extension, and let U C E. Then U is algebraically 
dependent over k if and only if there is u € U with u algebraic over kiU — {m}). 

Proof. If U is algebraically dependent over k, then there is a finite algebraically depen¬ 
dent subset U' = {mi, ..., m„} C U. We prove, by induction on n > 1, that some m; 
is algebraic over klU' — { m /}). If n — 1, then there is some nonzero fix) e k[x] with 
/(mi) = 0; that is, mi is algebraic over k. But U' — {m i} = 0, and so u\ is algebraic 
over kiU' — {mi}) = k(0) = k. For the inductive step, let U' — {mi, ..., m„+ j} be 
algebraically dependent. We may assume that {mi, ..., u n ] is algebraically independent; 
otherwise, the inductive hypothesis gives some uj , for 1 < j < n, which is algebraic 
over k{u\, ... ,uj ,..., u„), and hence, algebraic over kiU' — {m,}). Since U' is alge¬ 
braically dependent, there is a nonzero fiX, y) e k[x \,..., x„, y] with /(m, m„ + i) = 0, 
where X — (xi,..., x„), y is a new variable, and m = (mi, ..., u n ). We may write 
fiX,y) = J2jgiiX)y l , where gi(X) e k[X] (because k[X, y] — k[A][y]). Since 
fiX,y) / 0, some gi(X) / 0, and it follows from the algebraic independence of 
{mi, ..., m„} that giiu) ^ 0. Therefore, hfy) — JL gi(u)y l e k(U)[y] is not the zero 
polynomial. But 0 = /(w, u n +i) = h(u n + 1 ), so that m,, + i is algebraic over k{u \,..., m„). 

For the converse, assume that u is algebraic over k(U — {m}). We may assume that 
U — {m} is finite, say, U — {m} = {mi, ..., m„}, where n > 0 (if n = 0, we mean that 
U — {u} — 0). We prove, by induction on n > 0, that U is algebraically dependent. If 
n = 0, then u is algebraic over k, and so {m} is algebraically dependent. For the inductive 
step, let U — {m ;!+ i} = {mi, ..., u„}. We may assume that U — {m„+i} = {mi, ..., m„} 
is algebraically independent, for otherwise U — {m„ + i}, and hence its superset U, is al¬ 
gebraically dependent. By hypothesis, there is a nonzero polynomial fiy) = Cjy' e 
kiu \,..., M„)[y] with /(m„+i) = 0. As fiy) ^ 0, we may assume that one of its terms, 
say, cj ^ 0. Now c, e k{u \,..., u n ) for each i, and so there are rational functions 
C;(xi,... ,x n ) with Ci(u) = Cj, where u — (mi, ..., u n ). Since f(u n +\) = 0, we may 
clear denominators and assume that each c,-(xi,..., x„) is a polynomial in k[x i,..., x„]. 
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Moreover, Cj(u) ^ 0 implies cj(x],, x „) ^ 0, and so 

g(x i, ■ ■ ■, y) — Ci (xi,..., x„)y' 

i 

is nonzero. Therefore, {mi, , u n+ 1 } is algebraically dependent. • 

Definition. A field extension E / k is purely transcendental if either E = k or E contains 
an algebraically independent subset B and E — k(B). 

If X — {xi, ..., x„} is a finite set, then 

k(X) — k(x i, ..., x„) = Frac(fe[xi, ..., x„]) 

is called the function field in n variables. 

We are going to prove that if E / k is a field extension, then there exists an intermediate 
field F with F/k purely transcendental and E/F algebraic. In fact, F — k(B), where B 
is a maximal algebraically independent subset of E/k, and any two such subsets have the 
same cardinal. The proof is essentially the same as a proof of the invariance of dimension 
of a vector space, and so we axiomatize that proof. 

Recall that a relation R from a set Y to a set Z is a subset R C Y x Z: we write y R z 
instead of (y, z) e R. In particular, if £2 is a set, P(Q) is the family of all its subsets, and 
< is a relation from £2 to P(Q), then we write 

x < S 


instead of (x, S) e ■<. 

Definition. A dependency relation on a set £2 is a relation ;< from £2 to P(Q) that satisfies 
the following axioms: 

(i) ifx e S , thenx < S; 

(ii) ifx < S , then there exists a finite subset S' c. S with x < 

(iii) ( Transitivity ) ifx < S and if, for some T C £2, we have s < T for every s e S. then 

x < T; 

(iv) (Exchange Axiom) ifx < S and x f S — {)’}, then y < (S — {y}) U {x}. 

The transitivity axiom says that if x is dependent on a set 5, and if each element of S is 
dependent on another set T. then x is dependent on T. 

Example 6.68. 

If £2 is a vector space, then define x < S to mean x e (.S'), the subspace spanned by S. We 
claim that ■< is a dependency relation. The first three axioms are easily checked. We verify 
the exchange axiom. Ifx ^ S and x £ S — {_y}, then S — S' U {y} with y ^ S' . There are 
scalars a,-, a with x = ay + a,s,-, where s, e S'\ since x ^ (S'), we must have a 0. 

Therefore, y = a -1 (x — a,s,) e (S' , x}, and so y < S' U {x}. ◄ 
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Lemma 6.69. If E / k is afield extension, then a < S, defined by a is algebraic over k(S), 
is a dependency relation. 

Proof. It is easy to check the first two axioms in the definition of dependency relation, 
and we now verify axiom (iii): If x < S and if, for some T C £2, we have s < T for every 
s e S, then x < T. If F is an intermediate field, denote the field of all e e E that are 
algebraic over F by F. Using this notation, x < S if and only if x e k(S). Moreover, 
s < T for every s e S says that S c k{T). It follows that x e k{T), by Lemma 6.56(i), 
and so x < T. 

The exchange axiom says. If u < S and u f S — {u}, then v < (S — {u}) U {u}. 
Write S' — S — {u}, so that u is algebraic over k(S ) and u is transcendental over k(S'). 
Now {m, i>} is algebraically dependent over k(S'), by Proposition 6.67, and so there is a 
nonzero polynomial f(x, y) e k(S')[x, >’] with f{u, v) — 0. In more detail, f(x, y) = 
go(x) + gt(x)y + • • • + gn(x)y", where g n (x ) is nonzero. Since u is transcendental over 
k(S '), we must have g„(u) f 0. Therefore, h{y) = f {u, y) e k(S', u)[y] is a nonzero 
polynomial. But h(v) — f(u, v ) = 0, and so v is algebraic over k(S\ u ); that is, v -< 
S' U{«} = (5-{n})U{«}. • 

Example 6.68 suggests the following terminology. 

Definition. Let < be a dependency relation on a set Call a subset S C Q dependent 
if there exists s e S with ,v < S — {,v|; call S independent if it is not dependent. We say 
that a subset S generates £2 if x < S for all x e £2. A basis of £2 is an independent subset 
that generates £2. 

Note that 0 is independent, for dependent subsets have elements. If S 0, then S 
is independent if and only if s f S — {,v} for all ,v e S. It follows that every subset 
of an independent set is itself independent. By Proposition 6.67, algebraic independence 
defined on page 361 coincides with independence just defined for the dependency relation 
in Lemma 6.69. 

Lemma 6.70. Let <be a dependency relation on a set £2. If T c £2 is independent and 
Z T for some :e£2, then T U {z} D T is a strictly larger independent subset. 

Proof. Sincez f T, axiom (i) gives z f T, and so T C 7U{z); itfollows that (TU{z))- 
{z} = T. If T U {z} is dependent, then there exists t e T U {z} with M (I’Ll {z}) — {?}. 
If t = z, then z < T U {z} — {z} = T, contradicting z f_ I . Therefore, t e T. Since T is 
independent, t T — {r}. If we set S = T U {z} — {f}, t — x, and y = z in the exchange 
axiom, we conclude that z < (T U {z} — {f}) — {z} U {f} = T , contradicting the hypothesis 
z.fT. Therefore, 7’ U {z} is independent. • 

We now generalize the proof of the exchange lemma. Lemma 3.84, and its application 
to invariance of dimension. Theorem 3.85. 

Theorem 6.71. If < is a dependency relation on a set £2, then £2 has a basis. In fact, 
every independent subset B of £2 is part of a basis. 

Proof. Since the empty set 0 is independent, the second statement implies the first. 
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We use Zorn’s lemma to prove the existence of maximal independent subsets of Q 
containing B. Let X be the family of all independent subsets of containing B, partially 
ordered by inclusion. Note that X is nonempty, for B e X. Suppose that C is a chain 
in X. It is clear that C* = UceC * s an u PP er bound of C if it lies in X\ that is, if C* is 
independent. If, on the contrary, C* is dependent, then there is y € C* with y < C* — {y}. 
By axiom (ii), there is a finite subset {xi,..., x„) C C* — {_y} with y < [xi,, x n } — {y}. 
Now there is Co e C with y e Co, and, for each i. there is C; e C with x,- e C,-. 
Since C is a chain, one of these, call it C, contains all the others, and the dependent set 
{y, xi,..., x,,} is contained in C'. But since C' is independent, so are its subsets, and this 
is a contradiction. Zorn’s lemma now provides a maximal element M of X; that is, M is 
a maximal independent subset of Q containing B. If M is not a basis, then there exists 
x e with x M. By Lemma 6.70, M U {x} is an independent set strictly larger than M. 
contradicting the maximality of M. Therefore, bases exist. • 

Theorem 6.72. If £2 is a set with a dependency relation <, then any two bases B and C 
have the same cardinality. 

Proof. If B — 0, we claim that C — 0. Otherwise, there exists y e C and, since C is 
independent, y ft C — {y}. But y < B — 0 and 0 C C — {y}, so that axiom (iii) gives 
y < C — {y}, a contradiction. Therefore, we may assume that both B and C are nonempty. 

Now assume that B is finite; say, B — {xi,.... x„}. We prove, by induction on k > 0, 
that there exists {yi,..., y*_i} C C with 

Bk = {yi,..., y*— t, x*, ...,x„} 

a basis: The elements xi ..., Xk-\ in B can be replaced by elements yi,..., }’k -1 e C so 
that Bk is a basis. We define Bq = B. and we interpret the base step to mean that if none 
of the elements of B are replaced, then B — Bo is a basis; this is obviously true. For the 
inductive step, assume that Bk = {yi...., yk- 1 , Xk, ■ ■ ■, x n } is a basis. We claim that there 
is y e C withy f If- — {xO. Otherwise, y < If — {x^-} for all y e C. Butxj < C. because 
C is a basis, and so axiom (iii) gives Xk L If — {x*}, contradicting the independence of 
Bk- Hence, we may choose yk e C with yk f If - — {x^}. By Lemma 6.70, the set lf : + 1 , 
defined by 

Bk+i = (B k - {x*}) U {y*} = {yi, ...,yk, Xk+ i, •.., x„), 

is independent. To see that Bk +1 is a basis, it suffices to show that it generates Q. Now 
yk Bk (because Bk is a basis), and yk f Bk — {x*}; the exchange axiom gives Xk < 
(Bk — {x k }) U \yk) — Bk+ 1 - By axiom (i), all the other elements of If are dependent on 
If+ 1 . Now each element of O is dependent on If : . and each element of Bk is dependent 
on Bk- |-i. By axiom (iii), If+\ generates O. 

If |C| > n — \B\. that is, if there are more y’s than x’s, then B„ C C. Thus a proper 
subset of C generates Q, and this contradicts the independence of C. Therefore, \C\ < \B\. 
It follows that C is finite, and so the preceding argument can be repeated, interchanging 
the roles of B and C. Hence, B < |C|, and we conclude that \B\ — |C'| if Q has a finite 
basis. 
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When B is infinite, the reader may complete the proof by adapting the proof of The¬ 
orem 6.51. In particular, replace supp(u) in that proof by axiom (ii) in the definition of 
dependency relation. • 

We now apply this general result to algebraic dependence. 

Definition. If E/ k is a field extension, then a transcendence basis is a maximal alge¬ 
braically independent subset of E over k, and the transcendence degree of E/k is defined 
by 

tr. deg (E/k) = \B\. 

The next theorem shows that transcendence degree is well-defined. 

Theorem 6.73. If E / k is a field extension, then there exists a transcendence basis B. If 
F = k(B), then F/k is purely transcendental and E/F is algebraic. Moreover, if B and 
C are maximal algebraically independent subsets, then |B| = \C\. 

Proof. In Lemma 6.69, we saw that a < S, defined by a being algebraic over k(S), is a 
dependency relation. By Theorems 6.71 and 6.72, transcendence bases exist, and any two 
of them have the same cardinality; that is, transcendence degree is well-defined. It remains 
to show that if B is a transcendence basis, then E/k(B) is algebraic. If not, then there 
exists a e E with a transcendental over k(B). By Lemma 6.70, B U {a} is algebraically 
independent, and this contradicts the maximality of B. • 

Example 6.74. 

(i) Intermediate fields F, as in the statement of Theorem 6.73, need not be unique. For 
example, if E — Q(7r), then Q(7T 4 ) and Q(7r 2 ) are such intermediate fields. 

(ii) If E = k(x i,..., x„) is the field of rational functions in n variables over a field k, then 
tr. deg( E/k) — n, for {xi,, x n ) is a transcendence basis of E. 

(iii) If E/k is a field extension, then E/k is algebraic if and only if tr. deg (E jk ) = 0. ◄ 

Here is a small application of transcendence degree. 

Proposition 6.75. There are nonisomorphic fields each of which is isomorphic to a sub¬ 
field of the other. 

Proof. Clearly, C is isomorphic to a subfield of C(x). However, we claim that C(x) is 
isomorphic to a subfield of C. Let B be a transcendence basis of C over Q, and discard 
one of its elements, say, b. The algebraic closure F of Q (B — {Z?}) is a proper subfield of 
C, for b F; in fact, b is transcendental over F, by Proposition 6.67. Therefore, F = C, 
by Exercise 6.54 on page 375, and so F(b) = C(x). Therefore, each of C and C(x) is 
isomorphic to a subfield of the other. On the other hand C(x) ^ C, because C(x) is not 
algebraically closed. • 

We continue our investigation into the structure of fields by considering separability in 
more detail. Recall that if E/k is a field extension, then an element a e E is separable 
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over k if either a is transcendental over k or irr(a, k) is a separable polynomial 9 ; that is, 
irr(«, k) has no repeated roots. An extension E/k is separable if every a e E is separable 
over k; otherwise, it is inseparable. 

Proposition 6.76. Let fix) e k[x], where k is afield, and let fix) be its derivative. 

(i) fix) has repeated roots if and only if (/, f) f 1. 

(ii) Ifk is afield of characteristic p > 0, then fix) — 0 if and only if fix) e k[x p ]. 

(iii) Ifk is afield of characteristic p > 0 and if f'{x) — 0, then f{x) has no repeated 
roots. Conversely, if fix) is an irreducible polynomial in k\x\ then the conditions 
in parts (i) and (ii) are all equivalent. 

Proof, (i) If fix) has repeated roots, then fix) = (x — a) 2 gix) in k\x ], so that fix) = 
2(x — oOgCr) + (x — a) 2 g'ix). Therefore, x — a is a common divisor of fix) and fix), 
and so (/, f)f 1. 

Conversely, it suffices to work in a splitting field of fix), by Corollary 3.41. If x — a is 
a divisor of (/, f), then fix) = (x — o;)n(x) and fix) = (x — a)v(x). The product rule 
gives fix) — m(x) + (x — a)u\x), so that m(x) = (x — a)(u(x) — u\x)). Therefore, 

fix) — (x — a)w(x) = (x — a) 2 (ii(x) — u'ix)), 

and so fix) has a repeated root. 

(ii) Assume that fix) = fi a;x' and fix) — 0 = fa,x !_l . If the coefficient at 0, 
then iaix'~ l — 0 if and only if fa; = 0; this happens only if p \ i. Therefore, the only 
nonzero coefficents of fix) must be of the form a; for p \ i ; that is, fix) e k[x p \. 

If fix) e k[x p ], then fix) = J2j a P j xPj and f'(x) = L/ Pja p jx pj ~ l = 0. 

(iii) If fix) — 0, then if, f) = if, 0) = /; hence, if fix) is not constant [in particular, 
if fix) is irreducible], then (/, f) 1. 

Conversely, if fix) is irreducible, then (/, f) — 1 or (/, f) = f. Now if, f) 1, 
so that if, f) — f and, hence, f \ f. We claim that fix) — 0. If, on the contrary, 
fix) ^ 0, then fix) has a degree and deg(/') < deg(/). But / | f implies deg(/) < 
deg (/'), and this is a contradiction. Hence, fix) = 0. • 


Corollary 6.77. If k is afield of characteristic p > 0 and fix) € k\x], then there exists 
e > 0 and a polynomial gix) e k[x] with gix) ^ k[x p ] and fix) — gix p ). Moreover, if 
fix) is irreducible, then gix) is separable. 

Proof. If fix) £ k[x p ], define gix) = fix): if fix) e k[x p ], there is fix) e [x] with 
fix) = f(x p ). Note that deg (/) = pdegif). If fix) <£ k[x p ], define gix) = fix): 
otherwise, there is / 2 (x) e k[x) with /i(x) = fiix p ): that is, 

fix) = fix p ) = f 2 ix p ~). 

^Recall that an irreducible polynomial is separable if it has no repeated roots, and an arbitrary polynomial is 
separable if each of its irreducible factors has no repeated roots. 
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Since deg(/) > deg(/i) > • • •, iteration of this procedure must end after a finite number e 
of steps. Thus, f(x ) = g(x p ), where g(x), defined by gix) — f e (x), does not lie in k[x p \. 
If, now, fix) is irreducible, then f\ (x) is irreducible, for a factorization of /j(x) would 
give a factorization of fix). It follows that f) (x) is irreducible for all i. In particular, f e (x) 
is irreducible, and so it is separable, by Proposition 6.76(iii). • 

Definition. Let A: be a field of characteristic p > 0, and let fix) e k[x\. If fix) = 
g(x p ), where g(x) e k[x] but gix) ^ k[x p ], then 

deg(/) = p e degig). 

We call p e the degree of inseparability of fix), and we call deg(g) the reduced degree of 

fix). 

Example 6.78. 

3 2 

Let fix) — x p + x p + t e F p (f)[x]. If gix) = x p + x + t, then gix) is separable (for 
g\x) — 1^=0). Therefore, fix) has degree of inseparability p and reduced degree p 2 . < 

If A: is a field of prime characteric p > 0, then the Frobenius map F : k -> k, defined by 
F: a i-> a p , is a homomorphism [because (or + f ) p = a p + f p ]. As any homomorphism 
of fields, F is an injection. Denote im F by k p , so that k p is the subfield of k consisting of 
all the pth powers of elements in k: 

k p = im F — {a p : a e k}. 

To say that F is surjective, that is, k = k p , is to say that every element in k has a pth root 
in k. 

Definition. A field k is called perfect if either k has characteristic 0 or if k has character¬ 
istic p > 0 and k — k p . 

Existence of pth roots in k is closely related to separability. 

Proposition 6.79. 

(i) A field k is perfect if and only if every polynomial in A:[x] is separable. 

(ii) Ifk is a perfect field, then every algebraic extension E / k is a separable extension. 

(iii) Every finite field k is perfect, and every algebraic extension E/k is separable. In 
particular, if F p is the algebraic closure of ¥ p , then F p /¥ p is a separable extension. 

Proof, (i) If k has characteristic 0, then Lemma 4.4 shows that every polynomial in A'[x] is 
separable. Assume now that k has characteristic p > 0 and that fix) e k[x] is inseparable. 
By Proposition 6.76, fix) e k[x p ], so that fix) — atx p '. If every element in k has a 
pth root, then «,■ = b 1 ’ for bt e k. Hence, 

fix) = J2 b ? xPi = 

i i 
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and so f (x) is not irreducible. In other words, it'A' = k p , then every irreducible polynomial 
in k[x] is separable and, hence, every polynomial is separable. 

Conversely, assume that every polynomial in A[x] is separable. If k has characteristic 
0, there is nothing to prove. If k has characteristic p > 0 and if a e k, then x p — a has 
repeated roots; since our hypothesis says that irreducible polynomials are separable, x p — a 
factors. Proposition 3.126 now says that a has a pth root in A; that is, a e k p . Therefore, 
k — k p , and so k is perfect. 

(ii) If E/k is an algebraic extension, then every a e E has a minimum polynomial irr(oi, A); 
since irr(a, A) is a separable polynomial, by part (i), a is separable over A, and so E/ A is a 
separable extension. 

(iii) As any homomorphism of fields, the Frobenius F : A -» A is injective. If A is a 
finite field of characteristic p > 0, then Exercise 1.58 on page 36 shows that F must also 
be surjective; that is, k — k p . Therefore, A is perfect, and part (ii) gives the rest of the 
statement. • 

We will soon need a variant of Proposition 3.126. 

Lemma 6.80. Let p be a prime, let e > 0, and let A be afield of characteristic p > 0. If 
c e A and c f k p , then f (x) = x p — c is irreducible in k[x]. 

Proof The proof is by induction on e > 0, the base step being true because every linear 
polynomial is irreducible. For the inductive step, suppose the statement is false. Let g{x) e 
k[x] be irreducible, and let g(x) m , for m > 1, be the highest power of g(x) dividing f(x): 

x pC — c = g(x)' n h(x), 

where (g(x'), /z(x)) = 1. Take the derivative, 0 = mg(x) m ~ 1 g'(x)h(x) + g(x) m h'(x), and 
divide by g{x) m ~ l , 

0 = mg' (x)h(x) + g{x)h'(x). 

Therefore, h(x) \ h'(x), because (g,h) = 1. If h'(x) f 0, then deg(/!') is defined and 
deg(7z') < deg(/z), a contradiction; thus, h'(x) = 0. Proposition 6.76 gives 

h(x) = h\(x p ), where h[(x) e A[x]. 

Now mg'(x)h(x) — 0 gives 


mg'(x) = 0, (5) 

for h(x) f 0, and this implies that (g m (x))' — 0. Hence, Proposition 6.76 gives 
g m (x) = gi(x p ), where gi(x) e A[x], 


Therefore, 


x p -c = g{x) m h{x) = gi(x p )h l (x p ), 
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and so, replacing x p by x, we have 

1 

x' - c = g\{x)h\{x). 

£ - 1 , , 

Since x p — c is irreducible, by the inductive hypothesis, one of "i, /; i must be constant. 
But if g i (x ) is constant, then g\{x p ) is constant and g m (x) is constant, a contradiction. 
Therefore, h\(x) is constant; absorbing it into g i(x), we have x p< — c = gi(x) and 

x‘ f - c = gi(x p ) = g(x) m . 

If p | m. then x p ‘ — c = (g(x) p ) m / p , and so all the coefficients lie in k p , contradicting 
c f. k p \ therefore, p \ m. Eq. (5) now gives g'(x) — 0, so that g(x) e k[x p \, say, 
g(x) — g 2 (x p ). This forces m = 1, because x pe — c = g(x) m gives x p ‘ — c — g 2 (x) m , 
which is a forbidden factorization of the irreducible x p< — c. • 

If E/k is a field extension, where k has characteristic p, then k p C E p , but we do not 
know whether k c E p \ that is, E p may not be an intermediate field of E/k (for example, 
take E = k). Denote the subfield of E obtained by adjoining E p to k by k(E p ). 

Proposition 6.81. 

(i) Let k C B C E be a tower of fields with E/k algebraic. If E /k is separable, then 
E /B is separable. 

(ii) Let E/k be an algebraic field extension, where k has characteristic p > 0. If E / k is 
a separable extension, then E — k(E p ). Conversely, if E / k is finite and E — k(E p ), 
then E/k is separable. 

Proof, (i) If a e E, then a is algebraic over B, and irr(a, B) \ irr (a, k) in B[x], for their 
gcd is not 1 and irr( a, B) is irreducible. Since irr(«, k) has no repeated roots, irrfor, B) 
has no repeated roots, and hence irr(«, B) is a separable polynomial. Therefore, E/B is a 
separable extension. 

(ii) Let E/k be a separable extension. Now k(E p ) C E, and so E/k(E p ) is a separable 
extension, by part (i). But if f e E, then f p e E p C k(E p )\ say, f p — a. Hence, 
irr(/S, k(E p )) \ ( x p — a) in [k{E p f)[x], and so this polynomial is not separable because it 
divides x p — a — (x — f) p . We conclude that f e k(E p )\ that is, E — k(E p ). 

Conversely, suppose that E = k(E p ). We begin by showing that if /Jj,..., f) s is a 
linearly independent list in E (where E is now viewed only as a vector space over k), then 
/if, ..., fs is also linearly independent over k. Extend ft\,..., ft s to a basis fti ,..., ft n of 
E, where n — [E : k]. Now /if,..., ft ft spans E p over k p , for if ij e E, then i] = at fti, 
where a; e k, and hence r] p — aft ftft. Now take any element y e E. Since E — k(E p ), 
we have y — Cjrjj, where cj e k and rjj e E p . But rjj = aftjftft for a ;! - e k, as 

we have just seen, so that y — (ftHj c j a fti ) Pi't that is, ftft ,..., ftft spans E over k. 

Since dim^E) = n, this list is a basis, and hence its sublist ftft, ..., ftft must be linearly 
independent over k. 
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Since E/k is finite, each a is algebraic over k. If irr(a, k ) has degree m, then 1, a, 
a 2 ,.. .,a m is linearly dependent over k, while 1, a , a 2 ,..., a nl ~ l is linearly independent. 
If a is inseparable, then irr(a, k) — f e (x p ) and m — p e r where r is the reduced degree of 
irr(a, k). Since r — m / p e < m, we have 1, a, a 2 , ... ,a r linearly independent over k. But 

a p is aroot of f e {x), so there is a nontrivial dependency relation on 1, a p ,a 2p . a rp 

(for rp e = m). We have seen, in the preceding paragraph, that linear independence of 
1, a, a 2 ,..., a r implies linear independence of 1, a p , a 2p ,..., a rp . This contradiction 
shows that a must be separable over k. • 


Corollary 6.82. Let E/k be a finite separable extension, where k is afield of character¬ 
istic p. If a list fi . f r in E is linearly independent over k, then for all e > 1, the list 

p e p‘ 

, ..., fir i s also linearly independent over k. 

Proof. The proof is by induction one > 1, with the hypothesis of separability used in the 
form E = k(E p ), as in the proof of Proposition 6.81(h). • 


Corollary 6.83. Ifk C B C E is a tower of algebraic extensions, then B /k and E/B 
are separable extensions if and only if E / k is a separable extension. 

Proof. Since B/k and E/B are separable. Proposition 6.81(h) gives B — k(B p ) and 
E — B(E P ). Therefore, 

E = B(E P ) = k(B p )(E p ) = k(B p U E P ) = k(E p ) C E, 

because B p C E p . Therefore, E/k is separable, by Proposition 6.81(h). 

Conversely, if every element of E is separable over k, we have, in particular, that each 
element of B is separable over k\ hence, B/k is a separable extension. Finally, Proposi¬ 
tion 6.8l(i) shows that E/B is a separable extension. • 


Proposition 6.84. If E/ K is an algebraic extension, define 

E s = {a e E : a is separable over £}; 

then E s is an intermediate field that is the unique maximal separable extension of k con¬ 
tained in E. 

Proof. This follows from Proposition 4.38(h), for if a, ft are separable over k, then 
k(a. f)/k is separable, and hence a + f, aft, and a -1 are all separable over k. • 

Not surprisingly, if E/ k is an algebraic extension, then the extension E/E s has a special 
property. Of course, E s is of interest only when k has characteristic p > 0 (otherwise, 
E s = E). 

The next type of extension is “complementary” to separable extensions. 
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Definition. Let E/k be a field extension, where k has characteristic p > 0. Then E/k 
is a purely inseparable extension if E/k is algebraic and, for every a<= E, there is e > 0 
with a p e k. 

If E/ k is a purely inseparable extension and B is an intermediate field, then it is clear 
that E/B is purely inseparable. 

Proposition 6.85. If E/k is an algebraic field extension, where k has characteristic 
p > 0, then E / E s is a purely inseparable extension; moreover, if a e E, then irr(a, E s ) = 
x p — c for some m > 0. 

Proof. If a e E, write irr(a, k) — f e (x p ' ), where e > 0 and f e (x) e £[x] is a separable 
polynomial. It follows that a p is separable over k and a p e E s . If a (/ E s , choose 

m m m 

m minimal with a p e E s . Now a is a root of x p — a p , which is irreducible, by 
Lemma 6.80, and so irr(a, E s ) = x p — c, where c — a 1 ’ . • 


Definition. If E/k is a finite extension, define the separability degree by [E : k] s = 
[E s : k], and define the inseparability degree by [E : k]j = [E : E s ]. 

Note that E/k is separable if and only if [E : k]j — 1. It is clear that 


[E : k] = [E : k] s [E : k]j. 

Proposition 6.86. Let E / k be a finite extension, where k is afield of characteristic p > 0. 
If E / k is purely inseparable, then [£:£] = p e for some e > 0. Hence, for some e > 0, 

[E : k]i = [E : E s ] = p e . 

Proof. If a e E, then a is purely inseparable over k; if a is not constant, then irr(a, E s ) = 
x p — c for some c e k, where m > 1. Therefore, 

[E : k] = [E : k(a)][k(a) :&] = [£: k(a)]p m . 

Now [E : k(ct)\ < [E : k]; since E/k(a ) is purely inseparable, the proof can be com¬ 
pleted by induction. The second statement follows from Proposition 6.85, for E is purely 
inseparable over E s . • 


Proposition 6.87. Ifkc.BC.Eisa tower of finite extensions, where k is a field of 
characteristic p > 0, then 

[E : k] s = [E : B] S [B : k] s and [E : k] t = [E : B]i[B : k] t . 


Proof. In light of the equation [£ : k] = [L : : k]j, it suffices to prove [£ : = 

[E : B] S [B :k] s . 
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The notation B s is unambiguous, but the notation E s here is ambiguous. We write E s to 
denote the intermediate field consisting of all those elements of E that are separable over 
k, and we write 

Eg — {« e E : a is separable over B}. 

We have k C B s C E s C Eb c E ; let us see that E s C Eb- If o' € E is separable over 
k, then irr(a, k) has no repeated roots; hence, a is separable over B, because irr(a, B) \ 
irr(a, k ) in S[x], and so a e Eb- With this notation, 

[E : k\ s — [E s : k ], [E : B] s = [E B : B], and [B : k] s = [B s : k]. 

Now 

[E ; k] s = [E s : k] = [E s : : k] = [E s : B S ][B : *],. 

Thus, it suffices to prove 

[E s : B s ] = [E b : B], 


for [E B '■ B] — [E : B] s . 

We show that [£, : B s ] < [Eb : B] by proving that a list /Ji,, f) r in E s C Eb 
linearly independent over B s is also linearly independent over B. Suppose that bj ft/ — 
0, where bj e B are not all 0. For all e > 0, we have 0 = (^ Pi) p< = fij’ . But 
there is e > 0 with bf € B s for all i, because B/B s is purely inseparable, and so the 

D e D e 

list P[ , ■■■, fir is linearly dependent over B s , contradicting Corollary 6.82 (for E s /B s 
is a separable extension). For the reverse inequality [E v : B s ] > [Eb : B], take a list 
yi,... ,Yt in Eb that is linearly independent over B. Since Eb/E s is purely inseparable 
(it is an intermediate field of E/E s ), there is e > 0 with v e E s for all i. But E s /B 

pH p H 

is a separable extension, so that Corollary 6.82 gives y( ,..., y, linearly independent 
over B; a fortiori, yj’ ,, y, p is linearly independent over B s . Therefore, [E s : B v ] = 
[Eb : B], . 

We merely state some further results about separability. 

Definition. If A and B are intermediate fields of a field extension E/k, then A and B are 
linearly disjoint if every finite list a\,.... a n in A that is linearly independent over k is 
linearly independent over B. That is, if JT c, a,- = 0 implies all c,- = 0 whenever c, e k, 
then Y2i Pi a i — 0 implies all /3; = 0 whenever fl, e B. 

This condition on A and B can be shown to be symmetric; that is, every finite list in 
B that is linearly independent over k is also linearly independent over A. In Chapter 4, 
we defined two intermediate fields A and B to be linearly disjoint if A fi B — k. This 
new definition is stronger than the old one; If a € A and a j k. then 1, a is linearly 
independent over k. If a e A D B. then —a ■ 1 + 1 ■ cy = 0 is a dependency relation 
over B (for —a, 1 e B). However, there are examples of intermediate fields A and B with 
AH B — k that are not linearly disjoint in this new sense. 
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Definition. Let k be a field of characteristic p: for n > 1, define 

k l/p = {a ek : a p e k}, 
where k is the algebraic closure of k. 


Theorem. An algebraic field extension E/k is separable if and only ifk l / p and E are 
linearly disjoint (as intermediate fields of E/k, where E is the algebraic closure of E). 

Proof. See Zariski-Samuel, Commutative Algebra I, page 109. • 

If we do not assume that a field extension E/k is algebraic, are the generalizations of 
Propositions 6.8 l(ii) through 6.85 still true? 

Definition. A separating transcendence basis of a field extension E/k is a transcendence 
basis B with E/k(B ) a separable extension. 

Not every extension E/k has a separating transcendence basis. For example, if E/k is 
an inseparable algebraic extension, then the only transcendence basis is 0; but k(0) = k, 
and E/k(0) is inseparable. 

Theorem (Mac Lane). If a field extension E/k has a separating transcendence basis, 
then E and k l / p are linearly disjoint intermediate fields of E, the algebraic closure of 
E. Conversely, if E and k l ^ p are linearly disjoint and E/k is finitely generated, that is, 
E — k(u\, ..., u n ), then E/k has a separating transcendence basis. 

Proof. See Jacobson, Basic Algebra II, page 519. • 

The following example shows why one assumes, in Mac Lane’s theorem, that E/k is 
finitely generated. 

Example 6.88. 

Let k be a perfect field of characteristic p, let k(x) be the function field, and define 

E = k({u n , for n > 1 : u p = x}). 

Since k is perfect, every extension of k is separable, and so E D k l/p — k. However, 
we claim that E/k does not have a separating transcendence basis. By Exercise 6.52 on 
page 375, tr. deg (E/k) — 1, because any pair x l ^ p and x x ! p are algebraically depen¬ 
dent; let {f} be a transcendence basis. Now k(f) ^ E , and so there exists some u„ with 
Ufi f k(fiy, choose n minimal. Consider the tower £ (f) C k(f,u n ) C E. If [ft] were a sep¬ 
arating transcendence basis, then E/k(f>, u„ ) would be separable, by Proposition 6.81 (i). 
But irr (u„, k(f )) is a nonlinear divisor of y p — x p , because u n f k(f), and hence it has 
repeated roots; therefore, E/k(f, u„) is inseparable, a contradiction. ◄ 
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Exercises 

6.41 Let k be a field of characteristic p > 0, and let f{x) = x~ p — x p + t e k(t)[x], 

(i) Prove that f(x) is an irreducible polynomial in k(t)[x]. 

(ii) Prove that f{x) is inseparable. 

(iii) Prove that there exists an algebraic extension E/k(t) for which there is no intermedi¬ 
ate field E[ with E,/k purely inseparable and E/Ej separable. (Compare with Corol¬ 
lary 6.85 and Proposition 4.38.) 

6.42 Let m be a positive integer, and let X be the set of all its (positive) divisors. Prove that X is a 
partially ordered set if one defines a < b to mean a \ b. 

6.43 Recall that if S is a subset of a partially ordered set X, then the least upper bound of S (should 
it exist) is an upper bound m of S such that m < u for every upper bound u of S. If X is the 
following partially ordered set (in which d < a is indicated by a joining with a line and having 
a higher than d), 

a b 

X 

c d 

prove that the subset 5 = (c, d} has an upper bound but no least upper bound. 

6.44 Let G be an abelian group, and let S C G be a subgroup. 

(i) Prove that there exists a subgroup H of G maximal with the property that H fl S — {0}. 
Is this true if G is not abelian? 

(ii) If H is maximal with H n 5 = (0), prove that G/(H + S) is torsion. 

6.45 Call a subset C of a partially ordered set X cofinal if, for each x e X, there exists c e C with 
x < c. 

(i) Prove that Q and X are cofinal subsets of R. 

(ii) Prove that every chain X contains a well-ordered cofinal subset. 

Hint. Use Zorn's lemma on the family of all the well-ordered subsets of X. 

(iii) Prove that every well-ordered subset in X has an upper bound if and only if every chain 
in X has an upper bound. 

6.46 (i) Give an example of a commutative ring containing two prime ideals P and Q for which 

P fl Q is not a prime ideal. 

(ii) If P| D Pj 3 • ■ ■ 3 P n + 1 ''' is a decreasing sequence of prime ideals in a commu¬ 

tative ring R, prove that fj (1> j Pn is a prime ideal. 

(iii) Prove that every commutative ring R has a minimal prime ideal ; that is, a prime ideal I 
for which there is no prime ideal P with PCI. 

Hint. Partially order the set of all prime ideals by reverse inclusion: P < Q means 
P^Q. 

6.47 Let V be a vector space, and let 5 be a subspace of V. Prove that there exists a subspace W of 
V maximal with the property that W n S = 0 and that V = S ® W. 

6.48 Recall that a subset 5 of a commutative ring R is called multiplicatively closed if 0 ^ 5 
and s,s' e S implies ss' e S. Complete Exercise 6.9 on page 325 by proving that if S is 
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a multiplicatively closed set with S Cl I = 0, then there exists an ideal J maximal with the 
property that J contains I and J fl S — 0. 

6.49 Prove that every nonunit in a commutative ring lies in some maximal ideal. [This result was 
used to solve Exercise 6.16(ii) on page 326.] 

6.50 If pi , ..., p„ are distinct primes in Z, prove that J~p\, .... J~Pn is a linearly independent list 
over Q. 

6.51 Prove that a field extension E/k may not have an intermediate field K with K/k algebraic 
and E/ K purely transcendental. 

Hint. Prove that there is no intermediate field K with Q C K C C with C/K purely 
transcendental. 

6.52 If E = k(X) is an extension of a field k. and if every pair u, v e X is algebraically dependent, 
prove that tr. deg(£ /k) < 1. Conclude that if 

k C k\ C k 2 C • • • 

is a tower of fields with tr. deg (k n /k) = 1 for all n > 1, then tr. deg (k*/k) = 1, where 

** = U„>1**- 

6.53 Prove that if k is the prime field of a field E and if tr. deg (E/k) < Kq, then E is countable. 

6.54 Prove that two algebraically closed fields of the same characteristic are isomorphic if and only 
if they have the same transcendence degree over their prime fields. 

Hint. Use Lemma 6.61. 

6.55 (i) If k C B C E is a tower of fields, prove that 

tr. deg(E/k) = tr. deg (E/B) + tr. deg (B/k). 

Hint. Prove that if X is a transcendence basis of B/k and Y is a transcendence basis of 
E/B, then X U Y is a transcendence basis for E/k. 

(ii) Let E/k be a field extension, and let B and C be intermediate fields. Prove that 

tr. deg(B V C) + tr. deg(B n C) = tr. deg(B) + tr. deg((C), 

where B V C is the compositum. 

Hint. Extend a transcendence basis of B fl C to a transcendence basis of B and to a 
transcendence basis of C. 

6.56 Prove that <p e k(x) has degree 1 if and only if <p is a linear fractional transformation. 

6.57 Prove, for any field k, that PGL(2, k) = LF(Jt), where PGL(2, k ) = GL(2, k)/Z( 2, k) and 
Z(2, k) is the (normal) subgroup of GL(2, k) consisting of all the (nonzero) scalar matrices 
[Z(2, k) is the center of GL(2, k)]. 

6.58 Prove that if E / k is an algebraic extension and fl e E is both separable and purely inseparable, 
then fl e k. 

6.59 Give an example of two intermediate fields A and B of an extension E/k with An B = k but 
that are not linearly disjoint in the sense of the definition on page 372. 
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6.5 Varieties 

Analytic geometry gives pictures of equations. For example, we picture a function 
/: R —* R as its graph, which consists of all the ordered pairs (a. f{o)) in the plane; 
that is, / is the set of all the solutions (a. b) e R 2 of 

g(x,y) = y- f(x) = 0. 

We can also picture equations that are not graphs of functions. For example, the set of all 
the zeros of the polynomial 

h(x, y) = x 2 + y 2 - 1 

is the unit circle. We can also picture simultaneous solutions in R 2 of several polyno¬ 
mials of two variables, and, indeed, we can picture simultaneous solutions in R" of sev¬ 
eral polynomials of n variables. But there is a very strong connection between the rings 
k[x i,, x n ] = k[X] and the geometry of subsets of k n going far beyond this. Given a set 
of polynomials ..., ft(X) in n variables, call the subset V C k n consisting of their 

common zeros a variety. Of course, we can study varieties because solutions of systems 
of polynomial equations (an obvious generalization of systems of linear equations) are in¬ 
trinsically interesting. On the other hand, some systems are more interesting than others; 
investigating a problem may lead to a variety, and understanding the variety and its prop¬ 
erties (e.g., irreducibility, dimension, genus, singularities, and so forth) may contribute 
to an understanding of the original problem. For example, Leibniz raised the question 
of determining those functions that could be integrated explicitly in terms of "elementary 
functions:” algebraic combinations of polynomials, trigonometric and inverse trigonomet¬ 
ric functions, exponentials, and logarithms. In 1694, John Bernoulli conjectured that the 
integral arising from the arclength of an ellipse could not be so integrated. Similar integrals 
arise in finding periods of pendulums, as well as in other problems in mechanics, and all 
of them can be reduced to the form 


r x dt 
Jo VpU)' 

where p(t) is either a cubic or quartic polynomial; that is, the polynomial y 2 — p(x) in 
R[x, y] has arisen. In analogy to 


sin 1 x — 

Jo 


dt 


VT^T 2 


Jacobi introduced the inverse function 


u *(x) = 


/' 

Jo 


dt 


and he called u(x) an elliptic function. Just as sinx determines the unit circle via the 
parametrization (sinx, cosx), so, too, does an elliptic function determine a curve via the 
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parametrization (u(x), u'(x)), where u'(x) is the derivative of u(x). It was also noted that 
elliptic functions are periodic (as is sin x); that is, there is some number q with u(x+mq) = 
u(x) for all real x and all m e Z. With the development of integration of functions of a 
complex variable. Gauss viewed elliptic functions as 


u ] (z) 


L 


dw 


o V/Aw)' 


where p(w) is either a cubic or quartic polynomial in C[?./;]; that is, the polynomial 
m 2 — p(z) in C[z, u] has arisen. In viewing elliptic functions in this way, he saw that 
they are doubly periodic, that is, there are (complex) numbers q and r so that 


u(z + mq + nr) — u(z.) 


for all complex z and all m, n e Z. Moreover, u (z ) determines a complex curve (called an 
elliptic curve) consisting of all (u(z), u'{z)). A one-dimensional complex space is a two- 
dimensional real space, and double periodicity says that this complex curve is a torus; that 
is, the surface of a doughnut, possibly having several holes. One consequence is that the 
behavior of an elliptic function depends on whether the associated curve is nonsingular; 
that is, whether it has an appropriate tangent space at every point. The subject was further 
enriched when Riemann introduced Riemann surfaces into the study of elliptic functions 
and elliptic curves. This is the beginning of a very rich subject 10 ; indeed, further deep 
investigations of such matters were essential in A. Wiles’s proof of Fermat’s last theorem, 
in which he proves elliptic curves have certain sophisticated properties. More generally, 
the interplay between k[xi,, x n ] and varieties has evolved into what is nowadays called 
algebraic geometry, and this section may be regarded as an introduction to this subject. 

Notation. Let k be a field and let k" denote the set of all n -tuples 
k n — { a — {a\,..., a n ): at e k for all i }. 

The polynomial ring k[x\,..., x n ] in several variables may be denoted by k[X\, where X 
is the abbreviation 

X = (x\, ...,x n ). 

In particular, f(X) e k[X] may abbreviate f(x\,..., x n ) e k[x\,..., x n ]. 

In what follows, we regard polynomials fix i, ..., x n ) e k[x\, ..., x n ] as functions of 
n variables k n —> k. Here is the precise definition. 


Definition. If f(X) e k[X] define its polynomial function f ’: k" —> k by evaluation: 
If (flj,..., a n ) e k", then 

f b : (ai,... ,a n ) h* f{a\, .. .,a n ). 

The next proposition generalizes Corollary 3.28 from one variable to several variables. 

10 For a leisurely and more detailed account of the development of elliptic functions, see Chapters 14 and 15 of 
Stillwell, Mathematics and Its History. 
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Proposition 6.89. Let k be an infinite field and letk[X] = k[x i,, x n ], If f{X), g(X) e 
k[X] satisfy f~° = g b , then f{x i,..., x n ) — g(x i,- x n ). 

Proof The proof is by induction on n > 1; the base step is Corollary 3.28. For the 
inductive step, write 

f(X, y) = '^p i (X)y l and g(X, y) = ^ ~^q({X)y l , 
i i 


where X denotes (x\, ..., x n ). If f b — g\ then we have f(a, f) = g(a, ft) for every 
a e k" and every f e k. For fixed a e k n , define F a (y) = pi(a)y l and G a (y) = 
qi ( a)y‘. Since both F a (y) and G a (y) are in ^[y], the base step gives pj (a) = q, (a) for 
all a e k n . By the inductive hypothesis, piiX) — qi(X) for all i, and hence 

f(X, y) = Pi(X)y l = J2qi ( X)y‘ = y), 

i i 


as desired. • 

As a consequence of this last proposition, we drop the / ’ notation and identify polyno¬ 
mials with their polynomial functions when k is infinite. 

Definition. If f{X ) e k[X] = k[x\,..., x n \ and f{a) = 0, where a e k n , then a is 
called a zero of f(X). [If fix) is a polynomial in one variable, then a zero of fix) is also 
called a root of fix).] 


Proposition 6.90. Ifk is an algebraically closed field and f(X) e k\X\ is not a constant, 
then f(X) has a zero. 

Proof. We prove the result by induction on n > 1, where X — (xi, ..., x„). The base 
step follows at once from our assuming that k l — k is algebraically closed. As in the 
previous proof, write 

f(X,y) = Y J giW. 

i 

For each a e k n , define f a )y) — gi)a)y l ■ If /(2f, y) has no zeros, then each f a {y) e 
k[y ] has no zeros, and the base step says that f„(y) is a nonzero constant for all a e 
k" . Thus, gi(a) — 0 for all i > 0 and all a e k n . By Proposition 6.89, which applies 
because algebraically closed fields are infinite, gifiX) — 0 for all i > 0, and so f(X, y) = 
go(2f)y° = g()(X). By the inductive hypothesis, go{X) is a nonzero constant, and the proof 
is complete. • 

We now give some general definitions describing solution sets of polynomials. 
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Definition. If F is a subset of k[X ] = k[x \, ..., x„], then the variety 11,12 defined by F 
is 

Var (F) = {a e k n : f (a) = 0 for every f (X) e F}\ 
thus, Var (F) consists of all those a e k n which are zeros of every f (X) <= F. 


Example 6.91. 

(i) If k is algebraically closed, then Proposition 6.90 says that if /( X) e k[X] is not 
constant, then Var(/(A)) ^ 0. 

(ii) Here are some varieties defined by two equations: 

Var(x, y) = [(a, b) e k 2 : x = 0 and y = 0} = {(0, 0)} 


and 

Var(xy) = x-axis U y-axis. 

(iii) Here is an example in higher-dimensional space. Let A be an m x n matrix with entries 
in k. A system of m equations in n unknowns, 

AX = B, 

where B is an n x 1 column matrix, defines a variety, Var(.4 X — B), which is a subset of 
k". Of course, AX — B is really shorthand for a set of m linear equations in n variables, 
and Var (AX = B) is usually called the solution set of the system AX — B\ when this 
system is homogeneous, that is, when B — 0, then Var (AX — 0) is a subspace of k", 
called the solution space of the system. ■* 

The next result shows that, as far as varieties are concerned, we may just as well assume 
that the subsets F of k[X] are ideals of £[V], 

11 There is some disagreement about the usage of this term. Some call this an affine variety , in contrast to the 
analogous projective variety. Some insist that varieties should be irreducible , which we will define later in this 
section. 

12 The term variety arose as a translation by E. Beltrami (inspired by Gauss) of the German term Mannig- 
faltigkeit used by Riemann; nowadays, this term is usually translated as manifold. The following correspondence, 
from Aldo Brigaglia to Steven Kleiman, contains more details. 

“I believe the usage of the word varieta by Italian geometers arose from the (unpublished) Italian translation 
of Riemann’s Habilitationsvortrag, which was later translated into French by J. Hoiiel and published in the Italian 
journal Annali. Indeed, Beltrami wrote to Hoiiel on 8 January, 1869: 

J’ai traduit Mannigfaltigkeit par varieta, dans le sens de multitudo variarum rerum... 

And later, on 14 February, 1869, he wrote 

Je croirais toujours convenable de traduire Mannigfaltigkeit par variete: j’ai remarque que Gauss, 
dans ses Memoires sur les residus biquadratiques appelle en latin varietas la meme chose qui, dans 
les comptes-rendus rediges par lui meme en allemand dans les Gelehrte Anzeige, est designee par 
Mannigfaltigkeit. 

The correspondence of Beltrami and Hoiiel can be found in the beautiful book La decouverte de la geometrie non 
euclidienne sur la pseudosphere: les lettres d’Eugenio Beltrami a Jules Hoiiel (1868-1881), edited by L. Boi, L. 
Giacardi, and R. Tazzioli, and published by Blanchard, Paris, 1998.” 
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Proposition 6.92. Let k be afield, and let F and G be subsets ofk[X], 

(i) If F C G c k[X], then Var(G) C Var(F). 

(ii) If F C k[X] and I — ( F ) is the ideal generated by F, then 

Var (F) = Var (/). 

Proof, (i) If a e Var(G), then g(a) — 0 for all g(X) e G; since F C G, it follows, in 
particular, that f(a) — 0 for all f(X) e F. 

(ii) Since F C ( F) = 7, we have Var(7) C Var (F), by part (i). For the reverse inclusion, 
let a e Var(T’), so that f{a) = 0 for every f(X) e F. If g(X) e 7, then g(X) = 
Jfi r,(X)fi(X), where r/(X) e k[X] and f(X) e F\ hence, g(a) = n(a)fi(a) = 0 
andfleVar(Z). • 

It follows that not every subset of k" is a variety. For example, if n = 1, then k[x] is a 
PID. Hence, if F is a subset of k[x], then ( F ) = (gCO) for some g(x) e £[x], and so 

Var(F) = Var((F)) = Var((g(x))) = Var(g(x)). 

But if g(x) f 0, then it has only a finite number of roots, and so Var (F) is finite. If k 
is algebraically closed, then it is an infinite field, and so most subsets of k l — k are not 
varieties. 

In spite of our wanting to draw pictures in the plane, there is a major defect with k — R: 
Some polynomials have no zeros. For example, /(x) = x 2 + 1 has no real roots, and so 
Var(x 2 + 1) = 0. More generally, g(xi,..., x„) = x 2 + • • • + x 2 + 1 has no zeros in R", 
and so Var(g(X)) = 0. Since we are dealing with (not necessarily linear) polynomials, it 
is a natural assumption to want all their zeros available. For polynomials in one variable, 
this amounts to saying that k is algebraically closed and, in light of Proposition 6.90, we 
know that Var (f(X)) ^ 0 for every nonconstant f (X) e k[X] if k is algebraically closed. 
Of course, varieties are of interest for all fields k, but it makes more sense to consider the 
simplest case before trying to understand more complicated problems. On the other hand, 
many of the first results are valid for any field k. Thus, we will state the hypothesis needed 
for each proposition, but the reader should realize that the most important case is when k 
is algebraically closed. 

Here are some elementary properties of Var. 

Proposition 6.93. Let k be a field. 

(i) Var(l) = 0 and Var(0) = k n , where 0 is the zero polynomial. 

(ii) If I and J are ideals in k[X], then 

Var (IJ) = Var(7 0 7) = Var(7) U Var(7), 
where IJ = {]T ( fi(X)gi(X): ffiX) e I and gi(X) eij. 
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(iii) If {h '■ I € L] is a family of ideals in k[X], then 

Var(E ! <) = fl VarC h), 

t l 

where f is the set of all finite sums of the form r^ + • • • + re with r^ e h; r 

Proof, (i) That Var(l) = 0 is clear, for the constant polynomial 1 has no zeros. That 
Var(O) = k" is clear, for every point a is a zero of the zero polynomial. 

(ii) Since IJ C I n J. it follows that Var(77) 0 Var(7 fl 7); since IJ C 7, it follows that 
Var(77) 0 Var(7). Hence, 

Var(77) 0 Var(7 0 7)3 Var(7) U Var(7). 

To complete the proof, it suffices to show that Var(77) C Var(7) U Var(7). If a £ 
Var(7) U Var(7), then there exist f(X) e I and g(A) e 7 with f(a) 0 and g(a) 0. 
But f (X)g(X) e 77 and ( fg){a ) = f(a)g(a) ^ 0, because k is a domain. Therefore, 
a Var(77), as desired. 

(iii) For each 7, the inclusion f C f l f gives Var(^ ( f j C Var( If), and so 

vm (e '<) s n Var(^). 

t t 

For the reverse inclusion, if g(X) e ff, If , then there are finitely many l with g(X) = 
ft, where fi(X) e f. Therefore, if a e P| £ Var(7^), then fi(a) — 0 for all 7, and so 
g(a) = 0; that is, a e Var(J] £ if). • 

Definition. A topological space is a set X together with a family T of subsets of X, 
called closed sets, 13 which satisfy the following axioms: 

(i) 0 e T and Ie7; 

(ii) if Fi, 72 e T, then 7) U Fj e T\ that is, the union of two closed sets is closed; 

(iii) if {Ff: 7 e L] C T, then p| f Ff e T\ that is, any intersection of closed sets is also 
closed. 

Proposition 6.93 shows that the family of all varieties are the closed sets that make k n 
a topological space. Varieties are called Zariski closed sets, and they are very useful in 
the deeper study of k[X], The usual way of regarding R as a topological space has many 
closed sets; for example, every closed interval is a closed set. In contrast, the only Zariski 
closed sets in R, aside from R itself, are finite. 

Definition. A hypersurface in k n is a subset of the form Var(/) for some nonconstant 
f(X)ek[X], 


13 We can also define a topological space by specifying its open subsets which are defined as complements of 
closed sets. 
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Corollary 6.94. Every variety Var(7) in k" is the intersection of finitely many hypersur¬ 
faces. 

Proof. By the Hilbert basis theorem, there are f\{X), ..., ft(X) e k[X ] with I — 
(ft, ■ ■ ■, ft) = E, (//)■ fi y Proposition 6.93(iii), we have Var(/) = f] t Var(/,). • 

Given an ideal I in k[X\, we have just defined its variety Var(/) C k n . We now reverse 
direction: Given a subset A C k", we assign an ideal in k\X] to it; in particular, we assign 
an ideal to every variety. 

Definition. If A c k". define its coordinate ring k[A] to be the commutative ring 

k[A] = {f b \A: f(X)ek[X]} 

under pointwise operations [recall that f b : k n —»■ k is the polynomial function arising from 

/(*)]• 

The polynomial ffx i, ..., x n ) — x; e k[X], when regarded as a polynomial function, 
is defined by 

Xi : (ai,..., a n ) i->- a,-; 

that is, Xi picks out the i th coordinate of a point in k". The reason for the name coordinate 
ring is that if a e V, then (xi (a),... ,x n (a)) describes a. 

There is an obvious ring homomorphism res: k[X] -* k[A], given by f(X) f b |A, 
and the kernel of this restriction map is an ideal in k[X\. We will assume, from now on, 
that all fields k are infinite, and so we will drop the notation / ’. 

Definition. If A C k". define 

Id(A) = {f(X) e k[X] = k[x i,..., x„]: f(a) — 0 for every a e A}. 

The Hilbert basis theorem tells us that Id(A) is always a finitely generated ideal. 
Proposition 6.95. If A C k", then there is an isomorphism 

k[X]/Id(A) = Jt[A], 
where k[A\ is the coordinate ring of A. 

Proof. The restriction map res: k[X] —> k[A] is a surjection with kernel Id(A), and so 
the result follows from the first isomorphism theorem. Note that two polynomials agreeing 
on A lie in the same coset of Id(A). • 

Although the definition of Var(,F) makes sense for any subset F of k[X], it is most 
interesting when F is an ideal. Similarly, although the definition of Id(A) makes sense 
for any subset A of k", it is most interesting when A is a variety. After all, varieties are 
comprised of solutions of (polynomial) equations, which is what we care about. 
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Proposition 6.96. Let k be a field. 

(i) ld(0) = k[X] and, ifk is infinite, Id (k") = {0}. 

(ii) If A C B are subsets ofk", then Id ( B) C Id(A). 

(iii) If {Ai: l e L} is a family of subsets ofk", then 

Id(U A 0 = H Id(A/). 

t l 

Proof, (i) By definition, f(X) e Id(A) for some subset A C k" if and only if f(a) = 0 
for all a e A; hence, if f(X) <f IdCA), then there exists a e A with f(a) f 0. In 
particular, if A — 0, every f (X) e k[X] must lie in Id(0), for there are no elements 
a e 0. Therefore, ld(0) = k[X], 

If f(X) e Id(fc"), then / b = 0 b , and so f(X) = 0, by Proposition 6.89, because k is 
infinite. 

(ii) If ffX) e Id 1/1), then f(b) — 0 for all b e B\ in particular, f(a) = 0 for all a e A, 
because A C B, and so f (X) e Id(A). 

(iii) Since Ai C [Jwe have Id(Ag) 0 Id([J f A() for all l\ hence, p|^Id(A^) 0 
Id((J £ A^). For the reverse inclusion, suppose that f(X) e p| t ; Id(A^); that is, f(af) — 0 
for all l and all ai e At. If b e At, then b e Ag for some l, and hence f(b) — 0; 
therefore, f (X) e IdfU^ Ar). • 

We would like to have a formula for Id(A OB). Certainly, it is not true that Id(A OB) = 
Id(A) U Id( B ), for the union of two ideals is almost never an ideal. 

The next idea arises in characterizing those ideals of the form Id( V) when V is a variety. 

Definition. If I is an ideal in a commutative ring R, then its radical, denoted by fl, is 
V7 — {r e R : r m e I for some integer m > 1}. 

An ideal I is called a radical ideal 14 if 

V7 = I. 

Exercise 6.62 on page 397 asks you to prove that \fl is an ideal. It is easy to see that 
I C V7, and so an ideal I is a radical ideal if and only if \J1 C I. For example, every 
prime ideal P is a radical ideal, for if /" e P, then f e P. Here is an example of an ideal 
that is not radical. Let b e k and let I — ({x — b) 2 ). Now I is not a radical ideal, for 
(x — b) 2 e I while x — b f I. 

Definition. An element a in a commutative ring R is called nilpotent if « f 0 and there 
is some n > 1 with a" — 0. 

Note that I is a radical ideal in a commutative ring R if and only if R/I has no nonzero 
nilpotent elements. A commutative ring having no nilpotent elements is called reduced. 

14 This term is appropriate, for if r m € I. then its mth root r also lies in I. 
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Proposition 6.97. If an ideal I = Id (A) for some A C k n , then it is a radical ideal. 
Hence, the coordinate ring k[A\ has no nonzero nilpotent elements. 

Proof. Since I C is always true, it suffices to check the reverse inclusion. By hy¬ 
pothesis, I — Id(A) for some A C k n ; hence, if / e \fl, then f m e Id(A); that is, 

f(a) m — 0 for all a e A. But the values of f(a) m lie in the field k, and so f (a) m — 0 

implies f(a) = 0; that is, / e Id(A) = I. • 

Proposition 6.98. 

(i) If I and J are ideals, then I D J = \fl fl \fl. 

(ii) If I and J are radical ideals, then I fl J is a radical ideal. 

Proof, (i) If / e ~Jl n J, then f m g in J for some m > 1. Hence, f m e I and f m e J, 
and so f g \fl and / e VT; that is, / e \fl fl \fj. 

For the reverse inclusion, assume that / e \fl fl \fl, so that f m e I and f q e J. We 
may assume that m > q, and so f m e / fl 7; that is, / e I fl J . 

(ii) If I and J are radical ideals, then I — \H and J = \fl and 

i n / c V/ n / = V7 nV7 = / n j. • 


We are now going to prove Hilbert’s Nullstellensatz for C[X], The reader will see 
that the proof we will give generalizes to any uncountable algebraically closed field. The 
theorem is actually true for all algebraically closed fields (we shall prove it in Chapter 11), 
and so the proof here does not, alas, cover the algebraic closures of the prime fields, for 
example, which are countable. 



The next proof will use a bit of cardinality. 
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Theorem 6.100 (Weak Nullstellensatz 15 over C). If f\(X),..., f,(X) e C[X], then 
I — (/i, ..., ft) is a proper ideal in C[X] if and only if Var(/i, ...,/}) ^4 0. 

Remark. The reader should note that the only properties of C used in the proof are that 
it is an uncountable algebraically closed field. a 

Proof. It is clear that if Var(7) ^ 0, then I is a proper ideal, because Var(C[X]) = 0. 

For the converse, suppose that I is a proper ideal. By Corollary 6.40, there is a maximal 
ideal M containing /, and so K — C[X]/M is a field. It is plain that the natural map 
C[X] — > C [X]/M — K carries C to itself, so that K /C is an extension field; it follows that 
K is a vector space over C. Now C[X] has countable dimension, as a C-space, for a basis 
consists of all the monic monomials 1, x, x , x ,.... Therefore, dime (AT) is countable 
(possibly finite), for it is a quotient of C[X], 

Suppose that K is a proper extension of C; that is, there is some t e K with t C. 
Since C is algebraically closed, t cannot be algebraic over C, and so it is transcendental. 
Consider the subset B of K , 


B = {1 /{t-c): ceQ 

(note that f-c/0 because t f C). The set B is uncountable, for it is indexed by the 
uncountable set C. We claim that B is linearly independent over C; if so, then the fact 
that dime (TO is countable is contradicted, and we will conclude that K = C. If B is 
linearly dependent, there are nonzero ai,... ,a r e C and distinct ci,... ,c r e C with 
Y7i =t a i/(f — c i ) = 0. Clearing denominators, we have a polynomial h(t) e C[f]: 

h(t) = — ci) • • • (t — C{) • • • (t — c r ) — 0. 

i 

Now h{c\) — a i (ci — cf) • • ■ (ci — c r ) 0, so that h(t) is not the zero polynomial. But this 
contradicts t being transcendental; therefore, K = C. Lemma 6.99 now applies to show 
that Var (M) f 0. But Var (M) c Var(7), and this completes the proof. • 

Consider the special case of this theorem for I — (/(x)) c C[x], where /(x) is not 
a constant. To say that Var(/) C C is nonempty is to say that /(x) has a complex root. 
Thus, the weak Nullstellensatz is a generalization to several variables of the fundamental 
theorem of algebra. 

Theorem 6.101. Ifk is an ( uncountable ) algebraically closed field, then every maximal 
ideal M in k[x i, ..., x n ] has the form 

M = (x\ — a\,..., x n — a n ), 

where a = {a\, ... ,a n ) e k'\ and so there is a bijection between k n and the maximal 
ideals in k[x\, ..., x„]. 


15 The German word Nullstelle means root. In the context of polynomials of several variables, we may translate 
it as zero , and so Nullstellensatz means the theorem of zeros. 
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Remark. The uncountability hypothesis will be removed in Chapter 11. ◄ 

Proof. Since k[X]/M = k. Lemma 6.99 gives Var (M) ^ 0. As in the proof of that 
lemma, there are constants a,- e k with x; + M — cii + M for all i, and so x,- — at e M. 
Therefore, there is an inclusion of ideals 

(x\ — at,, x„ — a n ) c M. 

But (xi —ai,..., x n — a n ) is a maximal ideal, by Exercise 6.6(i) on page 325, and so 
M = (xi — a \,..., x„ — a n ). • 

The following proof of Hilbert’s Nullstellensatz uses the “Rabinowitch trick” of imbed¬ 
ding a polynomial ring in n variables into a polynomial ring in n + 1 variables. Again, 
uncountability is not needed, and we assume it only because our proof of the weak Null¬ 
stellensatz uses this hypothesis. 

Theorem 6.102 (Nullstellensatz). Let k be an ( uncountable ) algebraically closed field. 
If I is an ideal in k[X], then Id(Var(/)) = V7. Thus, f vanishes on Var(7) if and only if 
f m e I for some m > 1. 

Proof. The inclusion Id(Var(7)) 0 fl is obviously true, for if f m (a) — 0 for some 
m > 1 and all a e Var(7), then f (a) — 0 for all a, because f (a) e k. 

For the converse, assume that h e Id(Var(7)), where 7 = (f \that is, if 
f (a) = 0 for all i, where a e k", then h (a) — 0. We must show that some power of h lies 
in 7. Of course, we may assume that h is not the zero polynomial. Let us regard 

k[x i,... ,x„] c k[x i-,x„, y]; 

thus, every fi(x \,... ,x„) is regarded as a polynomial in n + 1 variables that does not 
depend on the last variable y. We claim that the polynomials 

fi,...,f„i-yh 

in k[x i,..., x„, y] have no common zeros. If (a \,..., a „, b) e k" +l is a common zero, 
then a = (a\,..., a n ) e k n is a common zero of ft,..., f t , and so h(a) — 0. But 
now 1 — bh(a ) = 1 f 0. The weak Nullstellensatz now applies to show that the ideal 
(/i,..., fi, 1 — yh) in k[x i,..., x„, v] is not a proper ideal. Therefore, there are g j,..., 
gr +1 e k[x 1 ,... ,x„, y] with 

1 = flgl H-b ftgt + (1 - yh)g t + 1 - 

Make the substitution y — \ / h, so that the last term involving g t +\ vanishes. Rewriting, 
gi(X, y) - o u j( x )y j ’ and so gi( x > h ~ l ) = Y?j =o u j( x )h~ j ■ It follows that 

h d -gi(X,h~ l ) ek[X]. 

Therefore, if m — max{c/|,..., d ,), then 

h m = (h m gi)f[ + • • • + (h m g t )f t e I. . 
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We continue the study of the operators Var and Id. 


Proposition 6.103. Let k be any field. 

(i) For every subset F C k n , 

Var(Id(F)) d F. 

(ii) For every ideal I C k[X], 

Id(Var(7)) D I. 

(iii) If V is a variety of k n , then Var(Id(y)) = V. 

(iv) If F is subset of k n , then F, the intersection of all those varieties that contain F, is 
equal to Var(Id( F)). One calls F the Zariski closure 16 of F. 

(v) IfV C V* C k" are varieties, then 

y* = vu v* - v, 

the Zariski closure ofV* — V. 


Proof, (i) This result is almost a tautology. If a e F, then g(a) — 0 for all g(X) e Id( F). 
But every g(X) e Idf F) annihilates F, by definition of Idf F), and so a e Var (Id ( F)). 
Therefore, VarlldlT’)) 3 F. 

(ii) Again, we merely look at the definitions. If f (X) e 7, then f(a) = 0 for all a e 
Var(7); hence, f (X) is surely one of the polynomials annihilating Var(7). 

(iii) If V is a variety, then V = Var(7) for some ideal 7 in k[X], Now 

Var (Id (Var (7))) 3 Var(7), 

by part (i). Also, part (ii) gives Id(Var(7)) 3 /, and applying Proposition 6.92(i) gives the 
reverse inclusion 

Var(Id(Var(7))) C Var(7). 

Therefore, Var(Id(Var(7))) = Var(7); that is, Var(Id(V)) = V. 

(iv) By Proposition 6.93(iii), F = PltOF V is a variety containing F. Since Var(Id(7’)) 
is a variety containing F, it is one of varieties V being intersected to form F , and so F C 
Var(Id(7’)). For the reverse inclusion, it suffices to prove that if V is any variety containing 
F, then V ^ Var(Id(7’)). If V 3 F, then Id(V) C Id(T’), and V = Var(Id(V)) 3 
Var(Id(7’)). 

(v) Since V* — V c V*, we have V* — V c V* = V*. By hypothesis, V c V*, 
and so V U V* — V c V*. For the reverse inclusion, there is an equation of subsets, 
V* = VU (V* — V). Taking closures, 

y* = v* = v u y* - y = yu y* - y, 

because V = V. • 


*®If F is a subset of a topological space X. then its closure is defined as the intersection of all the closed sets 
in X that contain F. 



388 


Commutative Rings II Ch. 6 


Corollary 6.104. 

(i) IfV i and V3 are varieties and Id(Vi) = IcUVS), then V\ — V3. 

(ii) Let k be an ( uncountable ) algebraically closed field. If I\ and Ii are radical ideals 
and Var(/i) = WariH), then I\ = 12- 


Proof. (i) If Id(Vi) = Id(V 2 ), then Var(Id(Vi)) = Var(Id(V 2 )); it now follows from 
Proposition 6.103(iii) that V\ — V 2 . 

(ii) If Var(/i) = Var(/ 2 ), then Id(Var(/i)) = Id(Var(/ 2 )). By the Nullstellensatz, which 
holds because k is an (uncountable) algebraically closed field, ~JT\ — sfh.- Since I\ and 
I 2 are radical ideals, by hypothesis, we have I\ — h. • 

Can a variety be decomposed into simpler subvarieties? 

Definition. A variety V is irreducible if it is not a union of two proper subvarieties; that 
is, V f^W' U W", where both W' and W" are varieties that are proper subsets of V. 

Proposition 6.105. Every variety V in k n is a union of finitely many irreducible subvari¬ 
eties: 

v = y 1 uy 2 u..-uy m . 

Proof. Call a variety W e k n good if it is irreducible or a union of finitely many irre¬ 
ducible subvarieties; otherwise, call W bad. We must show that there are no bad varieties. 
If W is bad, it is not irreducible, and so VV = W U W", where both VV" and W" are proper 
subvarieties. But a union of good varieties is good, and so at least one of W' and W" is 
bad; say, W' is bad, and rename it VV" = VV]. Repeat this construction for VV'i to get a bad 
subvariety W 2 - It follows by induction that there exists a strictly descending sequence 


W 2 Wi D ■ ■ ■ D W n D • • • 


of bad subvarieties. Since the operator Id reverses inclusions, there is a strictly increasing 
chain of ideals 

Id(W) C Id(Wi) C • • • C Id(W„) C ■ ■ ■ 

[the inclusions are strict because of Corollary 6.104(i)], and this contradicts the Hilbert 
basis theorem. We conclude that every variety is good. • 

Irreducible varieties have a nice characterization. 

Proposition 6.106. A variety V hi k n is irreducible if and only if Id( y) is a prime ideal 
in k[X], Hence, the coordinate ring k[V] of an irreducible variety V is a domain. 

Proof. Assume that V is an irreducible variety. It suffices to show that if f\ (A), / 2 (A) f 
Id(y), then /i(A)/ 2 (A) (f Id(y). Define, for 1 = 1,2, 


Wi = V nVar(/, (A)). 
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Note that each Wj is a subvariety of V, for it is the intersection of two varieties; moreover, 
since fi(X) f. Id(V), there is some a, e V with /, (a,) 0, and so W, is a proper 

subvariety of V. Since V is irreducible, we cannot have V = W i U VVS. Thus, there is 
some b e V that is not in W[ U W 2 ; that is, f\ (b) ^ 0 ^ fi(b). Therefore, f\(b) fi(b) ^ 0, 
hence f\ (X)fi(X) f Id(V), and so Id(V) is a prime ideal. 

Conversely, assume that Id(V) is a prime ideal. Suppose that V = Vi U Vi, where V 1 
and Vi are subvarieties. If Vi C V. then we must show that V = V\. Now 

Id(V) = Id(Ci) nid(V 2 ) 2 Id(Vi)Id(V 2 ); 

the equality is given by Proposition 6.96, and the inequality is given by Exercise 6.10 on 
page 325. Since Id(V) is a prime ideal. Proposition 6.13 says that Id(Vi) C Id( V) or 
Id(V 2 ) c Id(V). But Vi C V implies Id(y 2 ) D Id( V ), and we conclude that Id(Vi) C 
Id(V). Now the reverse inequality Id(Vi) 3 Id( V ) holds as well, because Vi c V, and so 
Id(Vi) = \d( V ). Therefore, V 1 = V, by Corollary 6.104, and so V is irreducible. • 

We now consider whether the irreducible subvarieties in the decomposition of a variety 
into a union of irreducible varieties are uniquely determined. There is one obvious way to 
arrange nonuniqueness. If P C Q in k[X] are two prime ideals (for example, (x) C (x. y ) 
are such prime ideals in k[x , y]), then Var( Q) C Var(P); if Var(P) is a subvariety of a 
variety V, say, V — Var (P) U Vi U • • • U V m , then Var (Q) can be one of the Vj or it can be 
left out. 

Definition. A decomposition V = V\ U • • • U V m is an irredundant union if no V, can 
be omitted; that is, for all i, 

V ^ Vi U--- U Vi U--- U V m . 

Proposition 6.107. Every variety V is an irredundant union of irreducible subvarieties 

V = Vi U ... U V m ; 

moreover, the irreducible subvarieties Vi are uniquely determined by V. 

Proof. By Proposition 6.105, V is a union of finitely many irreducible subvarieties; say, 
V = V\ U • • • U V m . If m is chosen minimal, then this union must be irredundant. 

We now prove uniqueness. Suppose that V = VV'i U • • • U W s is an irredundant union of 

irreducible subvarieties. Let X — {Vi, ..., V m } and let Y = {Wi.W s }; we shall show 

that X — Y. If Vi e X, we have 

Vi = Vi n y - U (Vi n wp. 

j 

Now Vi fl Wj / 0 for some j ; since V; is irreducible, there is only one such Wj. Therefore, 
Vj — Vi fl Wj, and so Vj C Wj. The same argument applied to Wj shows that there is 
exactly one V f with Wj c V(. Hence, 


Vi c Wj c V £ . 
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Since the union Vi U • • • U V m is irredundant, we must have V; = Vi , and so V,- = Wj = Vi ; 
that is, V, e Y and X C. Y. The reverse inclusion is proved in the same way. • 

Definition. An intersection / = J\ IT • • • IT J m is irredundant if no /, can be omitted; 
that is, for all i, 

Corollary 6.108. Every radical ideal J in k\X] is an irredundant intersection of prime 
ideals, 

J = Pi n • • • n P,„; 

moreover, the prime ideals Pj are uniquely determined by J . 

Remark. This corollary is generalized in Exercise 6.72 on page 399: An ideal in an 
arbitrary commutative noetherian ring is a radical ideal if and only if it is an intersection 
of finitely many prime ideals. ■* 

Proof. Since / is a radical ideal, there is a variety V with J = Id(V). Now V is an 
irredundant union of irreducible subvarieties, 

V = Vi U---U v„„ 


so that 


J = Id(V) = Id(Vi) n • • • n Id(V m ). 


By Proposition 6.106, V, irreducible implies Id(Vj-) is prime, and so J is an intersection 
of prime ideals. This is an irredundant intersection, for if there is I with J = Id(V) = 

PI .m M(v>), then 


V = Var(Id(V)) = J Var(Id(V 7 )) = (J Vj, 

J¥=t I¥* 

contradicting the given irredundancy of the union. 

Uniqueness is proved similarly. If / = Id ( VV'i) IT • • • fl Id(W s ), where each Id (VP,) is a 
prime ideal (hence is a radical ideal), then each VP; is an irreducible variety. Applying Var 
expresses V — Var(Id(y)) = Var(7) as an irredundant union of irreducible subvarieties, 
and the uniqueness of this decomposition gives the uniqueness of the prime ideals in the 
intersection. • 

Given an ideal I in k[x i,..., x n ], how can we find the irreducible components C, of 
Var (/)? To ask the question another way, what are the prime ideals P/ with C, = Var (Pi)? 
The first guess is that I = P\ C\ • • • C\ P r , but this is easily seen to be incorrect: There 
are ideals I that are not an intersection of prime ideals. For example, in k[x], the ideal 
((x — l) 2 ) is not an intersection of prime ideals. In light of the Nullstellensatz, we can 
replace the prime ideals Pj by ideals Q t with ~J~Qf = Pi, for Var (Pj) = Var( Q ,). We are 
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led to the notion of primary ideal, defined soon, and the primary decomposition theorem, 
which states that every ideal in a commutative noetherian ring, not merely in k\X], is an 
intersection of primary ideals. 

We can now give a geometric interpretation of the colon ideal. 

Proposition 6.109. Let k be an ( uncountable ) algebraically closed field, and let I be a 
radical ideal in k[X). Then, for every ideal J, 

Var((7 : /)) = Var(7) - Var(7). 

Proof. We first show that Var((7 : /)) 3 Var(7) — Var(7). If / e (7 : J), then fgel 
for all g e J. Hence, if x e Var(7), then f(x)g(x) = 0 for all g e J. However, 
if x f. Var (J), then there is g e J with g(x) ^ 0. Since k[X] is a domain, we have 
f(x) = 0 for all x e Var(7) - Var(7); that is, / e Id(Var(7) - Var(7)). Thus, (7 : J ) c 
Id(Var(7) - Var(7)), and so 

Var((7 : /)) 3 Var(Id(Var(7) - Var(7))) = Var(7) - Var(7), 
by Proposition 6.103(iv). 

For the reverse inclusion, take x e Var((7 : J)). Thus, if / e (7 : J), then fix) — 0; 
that is, 

if fg e I for all g e J, then fix) — 0. 

Suppose now that h e ld(Var(7) — Var(7)). Ifg € 7, then hg vanishes on Var(7) (because 
g does); on the other hand, hg vanishes on Var(7) — Var(7) (because h does). It follows 
that hg vanishes on Var(7) U (Var(7) — Var(7)) = Var(7); hence, hg e \fl — I for 
all g e J, because 7 is a radical ideal, and so h e (7 : 7). Therefore, /z(x) = 0 for all 
h e (7 : J), which gives x e Var(Id(Var(7) — Var(7))) = Var(7) — Var(7), as desired. • 

Definition. An ideal Q in a commutative ring R is primary if it is a proper ideal and if 
ab e Q (where a,b e R) and b f. Q, then a" <= Q for some n > 1. 

It is clear that every prime ideal is primary. Moreover, in Z, the ideal )p e ), where p is 
prime and e > 2, is a primary ideal that is not a prime ideal. Example 6.114 shows that 
this example is misleading: There are primary ideals that are not powers of prime ideals; 
there are powers of prime ideals which are not primary ideals. 

Proposition 6.110. If Q is a primary ideal, then its radical P = sfQ is a prime ideal. 
Moreover, if Q is primary, then ab e Q and a f Q implies b e P. 

Proof. Assume that ab e */Q, so that iab) m — a m b m e Q for some m > 1. If a $. \[Q, 
then a m f. Q. Since Q is primary, it follows that some power of b m , say, //"" e Q\ that is, 
b e \[Q. We have proved that ^[Q is prime, as well as the second statement. • 

If Q is primary and P — «JQ, then we often call Q a P-primary ideal, and we say that 
Q and P belong to each other. 

We now prove that the properties in Proposition 6.110 characterize primary ideals. 



392 


Commutative Rings II Ch. 6 


Proposition 6.111. Let J and T be ideals in a commutative ring. If (i) J C T, 
(ii) t e T implies there is some m > 1 with t m e J, and (iii) if ab e J and a £ J, 
then b e T, then J is a primary ideal with radical T. 

Proof First, / is a primary ideal, for if ab e J and a f J , then axiom (iii) gives b e T, 
and axiom (ii) gives b' n e ./. It remains to prove that T — ^fj. Now axiom (ii) gives 
T C -Jf. For the reverse inclusion, if r e VT, then r m e /; choose m minimal. If m = 1, 
then axiom (i) gives r e J C T, as desired. If m > 1, then rr m 1 e /; since r m 1 ^ J, 
axiom (iii) gives r e T. Therefore, T = ~J~J. • 

Let R be a commutative ring, and let M be an ideal. Each a e R defines an P-map 
om : M —> M by um '■ m m > am. 

Lemma 6.112. Let Q be an ideal in a commutative ring R. Then Q is a primary ideal if 
and only if, for each a e R, the map or/q : R/Q —> R/Q, given by r + Q i—>- ar + Q, is 
either an injection or is nilpotent [(a_R/g)” = 0 for some n > 1], 

Proof. Assume that Q is primary. If a <= R and or/q is not an injection, then there is 
b e R with b f Q and aR/Q(b + Q) — ab + Q = Q\ that is, ab e Q. We must prove that 
ur/Q is nilpotent. Since Q is primary, there is n > 1 with a n e Q\ hence, a"r e Q for all 
r e R, because Q is an ideal. Thus, ( aR/Q) n (r + Q) = a n r + Q — Q for all r e R, and 
(or/q)" = 0; that is, or/q is nilpotent. 

Conversely, assume that every or/q is either injective or nilpotent. Suppose that ab e Q 
and a £ Q. Then Ijr/q is not injective, for a + Q e ker bR/Q. By hypothesis, (bR/Q) n — 0 
for some n > 1; that is, b n r e Q for all r e R. Setting r — 1 gives b" e Q , and so Q is 
primary. • 

The next result gives a way of constructing primary ideals. 

Proposition 6.113. If P is a maximal ideal in a commutative ring R, and if Q is an ideal 
with P e C Q C P for some e > 0, then Q is a P-primary ideal. In particular, every 
power of a maximal ideal is primary. 

Proof. We show, for each a e R, that or/q is either nilpotent or injective. Suppose first 
that a e P. In this case, a e e P e c Q\ hence, a e b e Q for all b e R , and so (ur/qY = 0; 
that is, or/q is nilpotent. Now assume that a f P; we are going to show that a + Q is 
a unit in R/Q , which implies that or/q is injective. Since P is a maximal ideal, the ring 
R/P is a field; since a f. P, the element a + P is a unit in R/P: there is a' e R and z e P 
with aa' = 1 — z. Now z + Q is a nilpotent element of R/Q, for z e e ! >e C Q. Thus, 
1 — z + Q is a unit in R/Q (its inverse is 1 +z + ■ —I- z e ~ l ). It follows that a + Q is a unit 
in R/Q, for aa' + Q = 1— z+ Q- The result now follows from Lemma 6.112. Finally, Q 
belongs to P, for P — -J~P^ C «JQ C ^fp — P. • 

Example 6.114. 

(i) We now show that a power of a prime ideal need not be primary. Suppose that R is a 
commutative ring containing elements a,b,c such that ab — c 2 , P — (a, c ) is a prime 
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ideal, a $. P 2 , and b f P. Now ab — c 2 e P 2 ; were P 2 primary, then a f P 2 would 
imply that h e sfp 2 - — P, and this is not so. We construct such a ring R as follows. 
Let k be a field, and define R = k[x, y, z]/ixy — z. 2 ) (note that R is noetherian). Define 
a. b, c e R to be the cosets of x, y, z, respectively. Now P — (a, c) is a prime ideal, for 
the third isomorphism theorem for rings. Exercise 3.82 on page 196, gives 


R/ia, c) 


k[x,y,z\/jxy - z 2 ) 
(x,z)/(xy - z 2 ) 


k[x, y, z] 
ix,z) 


= k[y ], 


which is a domain. The equation ab — c 2 obviously holds in R. Were a e P 2 , then lifting 
this relation to k[x, y, z] would yield an equation 

x = fix , y, z)x 2 + g(x, y, z)xz + hfx, y, z)z 2 + l(x, y, z)(xy — z 2 ). 


Setting y — 0 — z (i.e., using the evaluation homomorphism k[x, y, z] —>■ fc[x]) gives the 
equation x — fix, 0, 0)x 2 in k[x], a contradiction. A similar argument shows that b f P. 

(ii) We use Proposition 6.113 to show that there are primary ideals Q that are not powers 
of prime ideals. Let R = k[x, y], where k is a field. The ideal P — ix, y) is maximal, 
hence prime (for R/P = k); moreover, 

P 2 C ix 2 , y) C (x, y) = P 

[the strict inequalities follow from x f. ix 2 , y) and y f P~~\. Thus, Q — ix 2 , y) is not a 
power of P\ indeed, we show that Q f L e , where L is a prime ideal. If Q — L e , then 
P 2 C L ,J C P. hence sTp 2 C -JU’ C \J~P, and so P C L C P, a contradiction. ■* 


We now generalize Corollary 6.108 by proving that every ideal in a noetherian ring, in 
particular, in k[X ] for k a field, is an intersection of primary ideals. This result, along with 
uniqueness properties, was first proved by E. Lasker; his proof was later simplified by E. 
Noether. Note that we will be working in arbitrary noetherian rings, not merely in k[X\. 


Definition. A primary decomposition of an ideal I in a commutative ring R is a finite 
family of primary ideals Q\,..., Q r with 

I = eine 2 n---ne,. 


Theorem 6.115 (Lasker-Noether I). If R is a commutative noetherian ring, then every 
proper ideal I in R has a primary decomposition. 

Proof. Let T be the family of all those proper ideals in R that do not have a primary 
decomposition; we must show that T is empty. Since R is noetherian, if T ^ 0, then it 
has a maximal element, say, J. Of course, J is not primary, and so there exists a e R with 
or/j: R/J —> R/J neither injective nor n i I potent. The ascending chain of ideals of R/J, 


ker a R /j c ker ia R /j) 2 c ker ia R /j) 3 C • ■. , 
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must stop (because R/J, being a quotient of the noetherian ring R, is itself noetherian); 
there is m > 1 with ker(a^y) = ker (a’f,j) for all t > m. Denote (, aR/j) m by <p, so that 
ker(<p 2 ) = ker^>. Note that ker <p V {0}, because {0} C ker cir/j C ker(a«//) m = ker cp, 
and that ivtup = im(a^/ j) m V {0}, because or/j is not nilpotent. We claim that 

ker<p Pimtp = {0}. 

If x e ker <p fl ivtup, then tp{x) = 0 and x = tp(y) for some y e R/J. But tp(.x) = 
<p((p(y)) — <p 2 (y ), so that y e ker(^ 2 ) = ker <p and x = tp(y) — 0. 

If 7 r: R —¥■ R/J is the natural map, then A — 7r _1 (ker<p) and A! = (im cp) are 
ideals of R with A fl A' — J. It is obvious that A is a proper ideal; we claim that A' is 
also proper. Otherwise, A 1 = R, so that A H A' — A; but A fl A' = J, as we saw above, 
and A J , a contradiction. Since A and A 1 are strictly larger than J , neither of them lies 
in T\ There are primary decompositions A — Q \ fl • • • fl Q m and A' = Q’ x fl • • • fl Q’ n . 
Therefore, 

J = ADA' = g! n ■ ■ ■ n Q m n q[ n • • • n Q' n , 

contradicting J not having a primary decomposition (for J e IF). • 


Definition. A primary decomposition I = Q\ (T • • • fl Q r is irredundant if no Qj can 
be omitted; for all i, 

//2in..-nan..-ne r . 

The prime ideals P\ — Q\ ,..., P r — Q r are called the associated prime ideals of the 
irredundant primary decomposition. 

It is clear that any primary decomposition can be made irredundant by throwing away, 
one at a time, any primary ideals that contain the intersection of the others. 

Theorem 6.116 (Lasker-Noether II). If I is an ideal in a noetherian ring R, then any 
two irredundant primary decompositions of I have the same set of associated prime ideals. 
Hence, the associated prime ideals are uniquely determined by I. 

Proof. Let I — Q \ fl • • • fl Q r be an irredundant primary decomposition, and let P, — 
sfQf. We are going to prove that a prime ideal P in R is equal to some P/ if and only if 
there is c f I with (7 : c) a P-primary ideal; this will suffice, for the colon ideal (7 : c) is 
defined solely in terms of I and not in terms of any primary decomposition. 

Given P, , there exists c e H Qj with c / Qj, because of irredundancy; we show 
that (7 : c,) is TVprimary. Recall Proposition 6.111: If the following three conditions 
hold: (i) (7 : c) C P ,; (ii) h e Pj implies there is some m > 1 with b m e (7 : c); and (iii) 
if ab e (I : c ) and a /(I : c), then b e Pj and (7 : c) is Pj -primary. 

To see (i), if u e (7 : c), then uc e I c I/. As c f Qj, we have u e Pj, by 
Proposition 6.110. To prove (ii), we first show that Qj C (7 : c). If a e Qj, then 
ca e Qj, since Qj is an ideal. If j f i, then c e Qj, and so ca e Qj. Therefore, 
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ca e Q\ fl • • • fl Qr = I. and so a e (/ : c). If, now, b e Pj, then b m e Q, c (/ : c). 

Finally, we establish (iii) by proving its contrapositive: If xy e (/ : c) and x f Pj, then 

y e (I : c). Thus, assume that xyc e /; since I c Qj and x f Pj = \/~Qf, we have 

yc e Qj. But yc e Q y for all j ^ i, for c e Qj. Therefore, yc e Q\ fl • • • fl Q r = I, and 

so }’€(/: c). We conclude that (/ : c) is Pj -primary. 

Conversely, assume that there is an element c ^ I and a prime ideal P such that (/ : c) 
is P -primary. We must show that P = Pj for some i. Exercise 6.14(ii) on page 326 gives 
(/ : c) = (<2l : c) IT • • • fl (Q r : c). Therefore, by Proposition 6.98, 

P = V(7 : c) = V(0i : c) n • • • n 7(0 r : c). 

If c e (I,, then ((I, : c) = R: if c ^ (I,, then we saw, in first part of this proof, that 

(Qi : c) is Pj -primary. Thus, there is ,v < r with 

p = s/(Qn ■ c ) n • • • n 7(gi,: c) = p {i n ■ ■ ■ n 

Of course, P C Pj. for all j. On the other hand. Exercise 6.10(iii) on page 325 gives 

Pj CP for some /, and so P = P, , as desired. • 

Example 6.117. 

(i) Let R — Z, let (n) be a nonzero proper ideal, and let n — • • • p e ' be the prime 

factorization. Then 

(n) = (pj 1 ) n • • • n (p e t ‘) 
is an irredundant primary decomposition. 

(ii) Let R — k[x, y], where A: is a field. Define Q \ — (x) and Q 2 = (x, y) 2 . Note that Q \ 

is prime, and hence Q\ is Pj-primary for Pi = Q \. Also, P 2 — (x, y) is a maximal ideal, 
and so Q 2 = P-r is P 2 -primary, by Proposition 6.113. Define I = Q 1 D Q 2 - This primary 
decompostion of I is irredundant. The associated primes of / are thus {Pi, P 2 }. ◄ 

There is a second uniqueness result that describes a normalized primary decomposition, 
but we precede it by a lemma. 

Lemma 6.118. If P is a prime ideal and Qi,...,Q n eire P-primary ideals, then 
Q\ (T • • • (T Q n is also a P-primary ideal. 

Proof. We verify that the three items in the hypothesis of Proposition 6.111 hold for 
I — Q\ fl • • • fl Q n . Clearly, / CP. Second, if b e P, then b m ' e Qj for all i, because Qj 
is P-primary. Hence, b m e /, where m — max{mi, ..., m „). Finally, assume that ab e I. 
If a f /, then a f Qj for some i. As Qj is P-primary, ab e I c Q, and a ^ Qj imply 
b e P. Therefore, I is P-primary. • 

Definition. A primary decomposition I — Q 1 fl ... fl Q r is normal if it is irredundant 
and if all the prime ideals P, = \[Qa are distinct. 



396 


Commutative Rings II Ch. 6 


Corollary 6.119. If R is a noetherian ring, then every proper ideal in R has a normal 
primary decomposition. 

Proof. By Theorem 6.115, every proper ideal / has a primary decomposition, say, 

i — <2i n • • • n <2r, 

where Qi is P,-primary. If P, — Pi for some i < r, then Q, and Q r can be replaced 
by Q' — Qi fl Q r , which is primary, by Lemma 6.118. Iterating, we eventually arrive 
at a primary decomposition with all prime ideals distinct. If this decomposition is not 
irredundant, remove primary ideals from it, one at a time, to obtain a normal primary 
decomposition. • 

Definition. If I — Q \ fl • • • fl Q r is a normal primary decomposition, then the minimal 
prime ideals P/ = sfQfi are called isolated prime ideals; the other prime ideals, if any, are 
called embedded. 

In Example 6.117(ii), we gave an irredundant primary decomposition of I = (x) fl 
(x, y) 2 in k[x, y], where k is a field. The associated primes are (x) and (x, y), so that (x) 
is an isolated prime and (x, y) is an embedded prime. 

Definition. A prime ideal P is minimal over an ideal / if / C P and there is no prime 
ideal P' with / C P' C P. 

Corollary 6.120. Let I be an ideal in a noetherian ring R. 

(i) Any two normal primary decompositions of I have the same set of isolated prime 
ideals, and so the isolated prime ideals are uniquely determined by I. 

(ii) I has only finitely many minimal prime ideals. 

(iii) A noetherian ring has only finitely many minimal prime ideals. 

Proof, (i) Let I = Q i fl • • • fl Q„ be a normal primary decomposition. If P is any prime 
ideal containing I, then 


p = Qin---r\Q„ 2 Qi -Qn. 

Now P 3 Qj for some i, by Proposition 6.13, and so P f */Qf — Pi- In other words, 
any prime ideal containing I must contain an isolated associated prime ideal. Hence, the 
isolated primes are the minimal elements in the set of associated primes of /; by Theo¬ 
rem 6.116, they are uniquely determined by I. 

(ii) As in part (i), any prime ideal P containing I must contain an isolated prime of I. 
Hence, if P is minimal over I, then P must equal an isolated prime ideal of I. The result 
follows, for I has only finitely many isolated prime ideals. 

(iii) This follows from part (ii) taking / = {0}. • 
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Here are some natural problems arising as these ideas are investigated further. First, 
what is the dimension of a variety? There are several candidates, and it turns out that 
prime ideals are the key. If V is a variety, then its dimension is the length of a longest 
chain of prime ideals in its coordinate ring k[V] (which, by the correspondence theorem, 
is the length of a longest chain of prime ideals above Id(V) in k\X]). 

It turns out to be more convenient to work in a larger projective space arising from k" by 
adjoining a “hyperplane at infinity.” For example, a projective plane arises from the usual 
plane by adjoining a line at infinity (it is the “horizon” where all parallel lines meet). To 
distinguish it from projective space, k" is called affine space , for it consists of the “finite 
points”—that is, not the points at infinity. If we study varieties in projective space, now 
defined as zeros of a set of homogeneous polynomials, then it is often the case that many 
separate affine cases become part of one simpler projective formula. For example, define 
the deg(C) to be the largest number of points in C fl l, where £ is a line. If C — Var(/) 
is a curve arising from a polynomial of degree d, we want deg(C) = d, but there are 
several problems here. First, we must demand that the coefficient field be algebraically 
closed, lest Var(/) = 0 cause a problem. Second, there may be multiple roots, and so 
some intersections may have to be counted with a certain multiplicity. Bezout's theorem 
states that if C and C’ are two curves, then \C fl Cj = deg(C) deg(C')- This formula 
holds in projective space, but it can be false in affine varieties. Defining multiplicities for 
intersections of higher-dimensional varieties is very subtle. 

Finally, there is a deep analogy between differentiable manifolds and varieties. A man¬ 
ifold is a subspace of M” that is a union of open replicas of euclidean space. For example, 
a torus T (i.e., a doughnut) is a subspace of R 3 , and each point of T has a neighborhood 
looking like an open disk (which is homeomorphic to the plane). We say that T is “locally 
euclidean”; it is obtained by gluing copies of R 2 together in a coherent way. That a man¬ 
ifold is differentiable says there is a tangent space at each of its points. A variety V can 
be viewed as its coordinate ring k[V], and neighborhoods of its points can be described 
“locally”, using what is called a sheaf of local rings. If we “glue” sheaves together along 
open subsets having isomorphic sheaves of local rings, we obtain a scheme, and schemes 
seem to be the best way to study varieties. Two of the most prominent mathematicians 
involved in this circle of ideas are A. Grothendieck and J.-P. Serre. 

Exercises 

6.60 Prove that every algebraically closed field is infinite. 

6.61 Prove that if an element a in a commutative ring R is nilpotent, then 1 + a is a unit. 

Hint. The power series for 1/(1 + a) stops after a finite number of terms because a is 
nilpotent. 

6.62 If I is an ideal in a commutative ring R, prove that its radical, -fl, is an ideal. 

Hint. If f e I and g s e /, prove that (/ + g) l+s G I. 

6.63 If R is a commutative ring, then its nilradical nil(W) is defined to be the intersection of all the 
prime ideals in R. Prove that nil(f?) is the set of all the nilpotent elements in R: 

nil(/?) = [r G R: r m — 0 for some m > 1). 
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Hint. If r e R is not nilpotent, use Exercise 6.9 on page 325 to show that there is some prime 
ideal not containing r. 

6.64 (i) Show that x~ + y~ is irreducible in R[jt, y], and conclude that ( x~ + y 2 ) is a prime, 

hence radical, ideal in R[jc, y], 

(ii) Prove that Varf.r 2 + y 2 ) = {(0, 0)}. 

(iii) Prove that Id(Var(x 2 + y 2 )) I> (A 2 + y 2 ), and conclude that the radical ideal (x 2 + y 2 ) 
in R[x, y] is not of the form Id( V) for some variety V. Conclude that the Nullstellensatz 
may fail in £[2 l] if k is not algebraically closed. 

(iv) Prove that (x 2 + y 2 ) = (x + iy) fl (x — iy ) in C[x, y], 

(v) Prove that Id(Var(jc 2 + y 2 )) = (jc 2 + y 2 ) in C[*, y]. 

6.65 Prove that if k is an (uncountable) algebraically closed field and f\ . ft e k[3f], then 

Var(/i. ft) = 0 if and only if there are h j, ..., h t e k[X\ such that 

t 

1 =£*,-(*) fi(X). 
i —1 

6.66 Let k be an (uncountable) algebraically closed field, and let / = (/j, ..., f t ) C k[X ]. If 

g(X) e k[X}, prove that g e \fl C k[X) if and only if {f \, ...,/(, 1 — yg) is not a proper 

ideal in k[X, y]. 

Hint. Use the Rabinowitch trick. 

6.67 Let R be a commutative ring, and let Spec(R) denote the set of all the prime ideals in R. If I 
is an ideal in R, define 

I — {all the prime ideals in R containing /}. 

Prove the following: 

(i) (0) = Spec(R). 

(ii) ~R = 0. 

(iii) J2e h = fit h- 

(iv) 77T7 = 77 = 7 u 7. 

Conclude that Spec(R) is a topological space whose closed subsets are the Zariski closed sets : 
those sets of of the form /, where / varies over the ideals in R. 

6.68 Prove that an ideal P in Spec(R) is closed (that is, the one-point set {P} is a Zariski closed 
set) if and only if P is a maximal ideal. 

6.69 If X and Y are topological spaces, then a function g: X —>■ Y is continuous if, for each closed 
subset Q of Y , the inverse image g -1 (Q) is a closed subset of X. 

Let /: R —>■ A be a ring homomorphism, and define /*: Spec(A) —»■ Spec(R) by 
f*(Q) = / -1 (<2). where Q is any prime ideal in A. Prove that f* is a continuous func¬ 
tion. [Recall that f~ l (Q) is a prime ideal, by Exercise 6.5 on page 325.] 

6.70 Prove that the function <p\ k n —>■ Spec(k[xi..*„]) [where k is an (uncountable) alge¬ 

braically closed field], defined by i p: (aj ,..., a n ) (x\ — a\, ..., x n — a„), is a continous 
injection (where both k” and Spec(fc[xj,..., x n ]) are equipped with the Zariski topology; the 
Zariski topology on k n was defined just after Proposition 6.93). 
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6.71 Prove that any descending chain 

Pi 3 3 '" 3 F m — F m +1 2 ••• 

of closed sets in k n stops; there is some t with Ft = F t+ \ = • • ■. 

6.72 If R is a commutative noetherian ring, prove that an ideal / in R is a radical ideal if and only 
if / = Pi fl • • • (~1 P r , where the Pj are prime ideals. 

6.73 Prove that there is an ideal / in a commutative ring R with I not primary and with -J1 prime. 
Hint. Take R = k[x, y], where k is a field, and / = (x 2 , xy). 

6.74 Let R = k[x, v], where k is a field, and let / = (x~, y). For each a e k, prove that / = 
(x) (T (y + ax, x 2 ) is an irredundant primary decomposition. Conclude that the primary ideals 
in an irredundant primary decomposition of an ideal need not be unique. 


6.6 Grobner Bases 


There is a canard that classical Greek philosophers were reluctant to perform experiments, 
preferring pure reason. Rather than looking in one’s mouth and counting, for example, they 
would speculate about how many teeth a person needs, deciding that every man should 
have, say, 28 teeth. Young mathematicians also prefer pure reasoning, but they, too, should 
count teeth. Computations and algorithms are useful, if for no other reason than to serve 
as data from which we might conjecture theorems. In this light, consider the problem of 
finding the irreducible components of a variety Var(7); algebraically, this problem asks for 
the associated primes of I. The primary decomposition theorem says that we should seek 
primary ideals Q , containing I, and the desired components are Var(>/Q7). In the proof 
of Theorem 6.116, however, we saw that if I — Qj fl • • • fl Q r is an irredundant primary 
decomposition, where Qj is Pj -primary, then Pj — : Cj), where c,- e fl / Qj with 

c £ Qj. Taking an honest look at the teeth involves the following question. Given a set 
of generators of I, can we find generators of Pj explicitly? The difficulty lies in finding 
the elements a, for we will show, in this section, how to find generators of Q (/ : c). 
Having made this point, we must also say that algorithms can do more than provide data 
in particular cases. For example, the euclidean algorithm is used in an essential way in 
proving that if K/k is a field extension, and if fix), g(x) e k[: c], then their gcd in A[x] is 
equal to their gcd in A;[x], 

Given two polynomials f(x), g(x) e k[x~\ with g(x) ^ 0, where k is a field, when is 
g(x) a divisor of /(x)? The division algorithm gives unique polynomials q(x), r(x) e k[x] 
with 

f(x) = qix)g(x) + r(x), 

where r = 0 or deg(r) < deg(g), and g \ f if and only if the remainder r = 0. Let us look 
at this formula from a different point of view. To say that g | / is to say that / e (g), the 
principal ideal generated by g(x). Thus, the remainder r is the obstruction to / lying in 
this ideal; that is, / e (g) if and only if r = 0. 
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Consider a more general problem. Given polynomials 

fix), gi(x), g m {x) e k[x], 

where A: is a field, when is d(x) — gcd{gi(x), ..., g m (x)} a divisor of /? The euclidean 
algorithm finds d, and the division algorithm determines whether d \ f. From another 
viewpoint, the two classical algorithms combine to give an algorithm determining whether 
/ e ig\,...,g m ) = id). 

We now ask whether there is an algorithm in k[x i, ..., x n ] = k[X] to determine, given 
f(X),gi(X),...,g m (X) e k[X], whether / e igi,---,gm)- A generalized division 
algorithm in k[X] should be an algorithm yielding 

r{X),a\{X),...,a m {X) e k[X], 


with r(X) unique, such that 


/ = a\g\ H- \-a m g m + r 

and / e (gi, ■ ■ ■, g m ) if and only if r — 0. Since (gi, ..., g m ) consists of all the linear 
combinations of the g’s, such an algorithm would say that the remainder r is the obstruction 
to/lying in (gi,..., g m ). 

We are going to show that both the division algorithm and the euclidean algorithm can 
be extended to polynomials in several variables. Even though these results are elementary, 
they were discovered only recently, in 1965, by B. Buchberger. Algebra has always dealt 
with algorithms, but the power and beauty of the axiomatic method has dominated the 
subject ever since Cayley and Dedekind in the second half of the nineteenth century. After 
the invention of the transistor in 1948, high-speed calculation became a reality, and old 
complicated algorithms, as well as new ones, could be implemented; a higher order of 
computing had entered algebra. Most likely, the development of computer science is a 
major reason why generalizations of the classical algorithms, from polynomials in one 
variable to polynomials in several variables, are only now being discovered. This is a 
dramatic illustration of the impact of external ideas on mathematics. 


Generalized Division Algorithm 

The most important feature of the division algorithm in k[x ] is that the remainder r(x) 
has small degree. Without the inequality dcg(r) < deg(g), the result would be virtually 
useless; after all, given any Q(x) e k[x], there is an equation 

fix) = Qix)g(x ) + [fix) - Qix)gix)]. 

Now polynomials in several variables are sums of monomials cx“‘ • • • x “", where c e k 
and a.j > 0 for all i. Here are two degrees that we can assign to a monomial. 
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Definition. The multidegree of a monomial cx“' ■ ■ ■ x“" e k[x i,..., x n ], where c e k 
is nonzero and a, > 0 for all i, is the n -tuple a — (a\,..., a n ); its weight is the sum 
I a | = a\ H-h a n . 

When dividing f(x) by g(x ) in k[x], we usually arrange the monomials in f(x) in 
descending order, according to degree: 

f(x) = C n X n + C n -\X n ~ { H-h C2X 2 + c\x + co. 

A polynomial in several variables, 

f(X) = fix i, ...,x n ) = £c (gl .gjx " 1 ■■■x“ n , 

can be written more compactly as 


/(X) = £]c a X“ 

a 


if we abbreviate (ofi, ..., a„) to a and x“' • • • x“" to X 01 . We will arrange the monomials 
involved in f(X ) in a reasonable way by ordering their multidegrees. 

In Example 5.69(ii), we saw that N", the set of all u-tuples a — (ai . a„) of natural 

numbers, is a monoid under addition: 

a + P — (ori,..., a n ) + (Pi, ..., p n ) = (a\ + Pi, ..., a n + p n ). 

This monoid operation is related to the multiplication of monomials: 

X a X^ — 


Recall that a partially ordered set is a set X equipped with a relation < that is reflexive, 
antisymmetric, and transitive. Of course, we may write x -< y if x < y and x ^ y, and 
we may write y > x (or v >- x) instead of x < y (or x < y). A partially ordered set X 
is well-ordered if every nonempty subset S c. X contains a smallest element; that is, there 
exists sq e S with so < s for all s e S. For example, the least integer axiom says that the 
natural numbers N with the usual inequality < is well-ordered. 

Proposition A. 3 in the Appendix proves that every strictly decreasing sequence in a 
well-ordered set must be finite. This property of well-ordered sets can be used to show 
that an algorithm eventually stops. For example, in the proof of the division algorithm 
for polynomials in one variable, we associated a natural number to each step: the degree 
of a remainder. Moreover, if the algorithm does not stop at a given step, then the natural 
number associated to the next step—the degree of its remainder—is strictly smaller. Since 
the natural numbers are well-ordered by the usual inequality <, this strictly decreasing 
sequence of natural numbers must be finite; that is, the algorithm must stop after a finite 
number of steps. 

We are interested in orderings of multidegrees that are compatible with multiplication 
of monomials—that is, with addition in the monoid N". 
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Definition. A monomial order is a well-ordering of N" such that 

a < ft implies a + y < ft + y 


for all a, ft, y e N". 

A monomial order will be used as follows. If A = (x\,..., x n ), then we define X a < 
in case a < ft: that is, monomials are ordered according to their multidegrees. 

Definition. If N” is equipped with a monomial order, then every f(X) e k[X] = 
k[x i,..., x n ] can be written with its largest term first, followed by its other, smaller, terms 
in descending order: 

f{X) = c a X a + lower terms. 

Define its leading term to be LT(/) = c a X a and its Degree to be Deg(/) = a. Call 
f(X) monic if LT(/) = X a ; that is, if c a — 1. 

Note that Deg(/) and LT(/) depend on the monomial order. 

There are many examples of monomial orders, but we shall give only the two most 
popular ones. 

Definition. The lexicographic order on N" is defined by a ^| ex fi if either a = f J > or the 
first nonzero coordinate in ft — a is positive. 17 

The term lexicographic refers to the standard ordering of words in a dictionary. For 
example, the following German words are increasing in lexicographic order (the letters are 
ordered a < b < c <■■■< z)'. 


ausgehen 

ausladen 

auslagen 

auslegen 

bedeuten 


If a -<i ex /L then they agree for the first i — 1 coordinates (for some ; > 1), that is, 
oi\ = fii,..., a ,-1 = bi- 1 , and there is strict inequality: a, < fa. 

Proposition 6.121. The lexicographic order ^| cx is a monomial order on N". 

Proof. First, we show that the lexicographic order is a partial order. The relation ^q ex 
is reflexive, for its definition shows that a ^q ex a. To prove antisymmetry, assume that 
a ^lex P and f ^i ex a. If a ^ ft, there is a first coordinate, say the ith, where they 
disagree. For notation, we may assume that a ,• < ftj. But this contradicts ft ^i ex a. 

17 The difference ft — a may not lie in N", but it does lie in Z”. 
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To prove transitivity, suppose that a -<i ex P and p ^i ex y (it suffices to consider strict 
inequality). Now a\ — Pi,..., o/j-\ — and a,- < Pi. Let y p be the first coordinate 
with P p < y p . If p < i, then 

y\ = p\ = au ..., y p -i = P p -1 = a p -i, a p — P p < y p \ 

if p >i, then 

Y\ = Pi = ai,..., Yi~i = Pi -1 = a,_i, a,- < Pi = yi- 

In either case, the first nonzero coordinate of y — a is positive; that is, a -<i ex y. 

Next, we show that the lexicographic order is a well-order. If S is a nonempty subset of 
N”, define 

Ci = {all first coordinates of n -tuples in S}, 

and define <$i to be the smallest number in Cj (note that Ci is a nonempty subset of the 
well-ordered set N). Define 

C2 = {all second coordinates of n-tuples (< 5 i, 012, ■ ■ ■, ot n ) e S}. 

Since C2 ^ 0, it contains a smallest number, 82- Similarly, for all i < n. define C,+i as all 
the (i + l)th coordinates of those n -tuples in S whose first i coordinates are ( 5 i, 82, ■ ■ ■, 5 ,-), 
and define ( 5 ,+i to be the smallest number in C, + 1. By construction, the /;-tuple 8 = 
(Si, 82,..., 8 n ) lies in S', moreover, if a — (ai, 012 ,..., a,,) e S, then 

a - 8 — (of! - <$!, a 2 - 8 2 , ..., a„ - 8„) 

has its first nonzero coordinate, if any, positive, and so 8 <| ex a. Therefore, the lexico¬ 
graphic order is a well-order. 

Assume that a ^i ex P\ we claim that 

a + y ^i ex P + Y 

for all y e N. If a — p, then a + y — p + y. Ifcr -<] ex p, then the first nonzero coordinate 
of p — a is positive. But 

(P + Y) - (& + Y) = P - a, 

and so a + y -q cx p + y. Therefore, ^i ex is a monomial order. • 

In the lexicographic order, x\ > xi > xj > ■ ■ ■. for 

(1, 0,..., 0) >- (0, 1, 0,..., 0) >-M0, 0,..., i). 

Any permutation of the variables x a (\), ..., x a («) yields a different lexicographic order 
on N". 
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Remark. If X is any well-ordered set with order <, then the lexicographic order on 
X" can be defined by a = (a 1 ,..., a „) ^i ex b — {b\, ... , b n ) in case a — b or if 
they first disagree in the i th coordinate and cij < /?,. It is a simple matter to generalize 
Proposition 6.121 by replacing N with X. ■* 

In Lemma 5.70 we constructed, for any set X, a monoid W(X): its elements are the 
empty word together with all the words • • • x p p on a set X, where p > 1 and e, = ± 1 
for all;; its operation is juxtaposition. In contrast to N", in which all words have length n, 
the monoid W(X) has words of different lengths. Of more interest here is the submonoid 
W + (X) of W(X) consisting of all the "positive” words on X: 

W + (X) — [xi ■ ■ ■ x p e W(X) : xi e X and p > 0}. 

Corollary 6.122. If X is a well-ordered set, then VV + ( X) is well-ordered in the lexico¬ 
graphic order (which we also denote by ;<i ex ). 

Proof. We will only give a careful definition of the lexicographic order here; the proof 
that it is a well-order is left to the reader. First, define 1 ;<i ex w for all w e W + (X). Next, 
given words u — x i • • • x p and v = }’\ ■ ■ ■ y q in W + (X), make them the same length by 
adjoining l’s at the end of the shorter word, and rename them u' and v' in VV + (A'). If 
m > max{p, q}, we may regard u', v', e X m , and we define u ^i ex v if u' ^i ex v' in X m . 
(This is the word order commonly used in dictionaries, where a blank precedes any letter: 
for example, muse precedes museum.) • 

Lemma 6.123. Given a monomial order on N”, any sequence of steps of the form 
f(X) —»■ f (X) — cpXf +g(20, where cpX& is a nonzero term of f{X) and Deg(g) -< f, 
must be finite. 

Proof. Each polynomial 


f(X) = ^2 c a X 01 e k[X] = k[xi,..., x n ] 

a 

can be written with the multidegrees of its terms in descending order: ai >- 012 > ■ ■ ■ > 01 p . 
Define 

multiword(/) = a\ ■ ■ • a p e >V + (N"). 

Let cpXf be a nonzero term in f(X), let e k[X) have Deg(g) -< ft, and write 

f{X) = h{X) + cpXP+t(X), 

where h(X) is the sum of all terms in f(X) of multidegree >- ft and l(X) is the sum of all 
terms in f(X) of multidegree -< ft. We claim that 

multiword(/(X) — cpX^ + g(X)) ^i ex multiword(7i + I + g) 

-<l ex multiword!/) in Vf + (X). 
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The sum of the terms in f(X) — cpX@ + g(X) with multidegree >- /I is h(X), while 
the sum of the lower terms is i(X) + g(X). But Deg(£ + g) -< fi, by Exercise 6.79 on 
page 410. Therefore, the initial terms of f(X ) and f(X) — cpXP + g(X) agree, while 
the next term of f(X) — cpXP + g(2f) has multidegree -< /S, and this proves the claim. 
Since W + (N") is well-ordered, it follows that any sequence of steps of the form f (X) -> 
f(X) — cpX^ + g(X) must be finite. • 

Here is the second popular monomial order. Recall that if a = (a i,.... a n ) e N", then 
|a| = a\ + ■ ■ ■ + a„ denotes its weight. 

Definition. The degree-lexicographic order on N" is defined by a ^diex P if either 
a — p or 

n n 

l«l = Y2 ai < ^2 Pi = \P\’ 

i =1 i =1 

or, if |a| = \fi\, then the first nonzero coordinate in ft — a is positive. 

In other words, given a — (ai,..., a n ) and ft — (J}\,..., first check weights: if 
|c<£ | < \fi\, then a ^diex if there is a tie, that is, if a and ft have the same weight, then or¬ 
der them lexicographically. For example, (1, 2, 3, 0) ^diex (0, 2, 5, 0) and (1, 2, 3, 4) -<di ex 
(1,2,5,2). 

Proposition 6.124. The degree-lexicographic order ^diex is a monomial order on N". 

Proof. It is routine to show that ^diex is a partial order on N”. To see that it is a well-order, 
let S be a nonempty subset of N ". The weights of elements in S form a nonempty subset of 
N, and so there is a smallest such, say, t. The nonempty subset of all a e S having weight 
t has a smallest element, because the degree-lexicographic order ^diex coincides with the 
lexicographic order ^| ex on this subset. Therefore, there is a smallest element in S in the 
degree-lexicographic order. 

Assume that a ^diex P and y e N". Now \a + y\ = |a| + \y\, so that |a| = \f}\ 
implies \a + y\ — + y\ and \a\ < \ft\ implies \a + y\ < \f + y\; in the latter case. 

Proposition 6.121 shows that a + y ^diex P + Y- • 

The next proposition shows, with respect to a monomial order, that polynomials in 
several variables behave like polynomials in a single variable. 

Proposition 6.125. Let < be a monomial order on N", and let f(X).g(X), h(X) e 
k[X] — k[x i,..., x n ], where k is a field. 


(i) If Deg(/) = Deg(g), then LT(g) | LT(/). 

(ii) LT (hg) = LT(7i)LT(g). 

(iii) If Deg(/) = Deg (hg), then LT (g ) | LT(/). 
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Proof, (i) If Deg(/) = a = Deg(g), then LT(/) = cX“ and LT(g) = dX a . Hence, 
LT(g) | LT(/), because c / 0 and so c is a unit in k [note that LT(/) | LT(g) as well]. 

(ii) Let h(X) — cX y + lower terms and let g(X) — bX& + lower terms, so that LT (h) = 
cX y and LT(g) = bX@. Clearly, cbX y+ P is a nonzero term of h(X)g(X). To see that it 
is the leading term, let c^X^ be a term of h{X ) with fi < y, and let b v X v be a term of 
g(X) with v < ft (with at least one strict inquality). Now Deg (c [l X tl b v X v ) = /x+ u; since 
< is a monomial order, we have /x, + v<y + v-<y+p. Thus, cbX y+ & is the term in 
h(X)g(X) with largest multidegree. 

(iii) Since Deg(/) = Deg (Jig), part (i) gives LT (Jig) | LT(/), and LT(/t)LT(g) = LT (hg), 

by part (ii); hence, LT(g) | LT(/). • 

Definition. Let < be a monomial order on N" and let f(X), g(X) e k\X], where k[X] = 
k[x i,... , x n \. If there is a nonzero term cpX^ in f(X) with LT(g) | cpX@ and 

crX? 

h{X) = f{X)-^—g{X), 

LT (g) 

g 

then the reduction f —*■ h is the replacement of / by h. 

Reduction is precisely the usual step involved in long division of polynomials in one 
variable. Of course, a special case of reduction is when cpX^ — LT(/). 

Proposition 6.126. Let < be a monomial order on N", let fiX), g(X) e k[X] — 
k[x\, ..., x„], and assume that f -4- h; that is, there is a nonzero term cpX@ in f(X) 
with LT(g) | cpX& and h(X) = f (X) — g(X). Then 

Deg (mg] 8(X) ) - Deg(/) - 

Moreover, if ft = Deg(/) [i.e., ifcpXP — LT(/)], then either 
h(X) — 0 or Deg (h) < Deg(/), 

and if f -< Deg(/), then Deg(/z) = Deg(/). 

Proof Let us write 


f(X ) = LT(/) + c k X k + lower terms, 

where c K X K — LT(/ — LT(/)); since cpXP is a term of f[X), we have f < Deg(/). 
Similarly, if LT(g) = a y X y , so that Deg(g) = y, let us write 


g(X ) = a y X y + axX x + lower terms, 
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where ci\X l = LT(g — LT(g)). Hence, 


h(X) = f(X) 

= f(X) 


cp 


X? 


LT(g) 

cpXP 

LT( t? ) 


g(X) 

[lT(*) + a^X A 
X? 


■] 


= [/(x)-c (J x-]-a_[ 0 ^ + ...]. 

Now LT(g) | cpXP says that /3 — ye N". We claim that 


Degf-^t 
S v LT( .e) L 


mg) 

cpXt> 


axX A 


■ )=k + p-y; 


that is, X + ft — y — Deg (— ij^g ) OkX x ) is the largest multidegree occurring. Suppose that 
a, ] X n is a lower term in g(X) (i.e., i] < X); since < is a monomial order, 

r] + (P - y) < y + (X - y) = X. 

Now X < y implies X + (/3 — y) -< y + (/3 — y) = / 3 , and so 


Deg ( _ <fi ~ Deg(/) ' 


-LT(g). 

Therefore, if h(X) ^ 0, then Exercise 6.79 on page 410 gives 


(6) 


XP 


LT(g)J 


g(X) 


Deg (h) < max { Deg (f(X) - cpXP), Deg (- 1 C/J 

Now if ft = Deg(/), then cpXP = LT(/), 

f{X) — cpXP — f(X) — LT(/) = c k X k + lower terms, 

and, hence, Deg(/(X) — CpXP) — k < Deg(/) in this case. If fi < Deg(/), then 
Deg (f(X) - cpXP) = Deg(/), while Deg (- [gg] g{X)) < Deg(/), by Eq. (6), 
so Deg(/z) = Deg(/) in this case. 

The last inequality is clear, for 


and 


cpXP R cpXP r , 

P : g(X) = cpXP + m_[axX x 


I 


LT(g) " LT(g)l 

Since the latter part of the polynomial has Degree X + /3 — y -< /?, we see that 

iXP 


Deg fe®) = ^- Deg(/) - 
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Definition. Let {gi (X),..., g m (X)} be a set of polynomials in k[X], A polynomial r(X) 
is reduced mod {gi,..., g m } if either r(X) = 0 or no LT(g,) divides any nonzero term of 
r(X). 

Here is the division algorithm for polynomials in several variables. Because the algo¬ 
rithm requires the “divisor polynomials” {gi,..., g m } to be used in a specific order (after 
all, an algorithm must give explicit directions), we will be using an m -tuple of polynomi¬ 
als instead of a subset of polynomials. We denote the m-tuple whose ith entry is g, by 
[gi,..., g m ], because the usual notation (gj,..., g,„) would be confused with the ideal 
(gt, • • •, gm) generated by the g,-. 

Theorem 6.127 (Division Algorithm in k[X]). Let <be a monomial order on N", and 
let £[X] = k[x i,..., x n ]. If f(X) e k[X] and G — [gi(X), ..., g,„(X)] is an m-tuple 
of polynomials in k[X\ then there is an algorithm giving polynomials r(X),a i(X), .. 
a m (X) e £[X] with 

f =a\gi H-h a m g m +r, 

where r is reduced mod {gj, ..., g m ], and 

Deg(a,g ; ) ^ Deg(/) for all i. 

Proof Once a monomial order is chosen, so that leading terms are defined, the algorithm 
is a straightforward generalization of the division algorithm in one variable. First, reduce 
mod gi as many times as possible, then reduce mod g 2 as many times as possible, and then 
reduce again mod gi; more generally, once a polynomial is reduced mod [gi,..., g,] for 
any i, then reduce mod [gi ..., g ( , g,+i]. Here is a pseudocode describing the algorithm 
more precisely. 

Input: /(X) = cpX?, [gi.gm] 

Output: r,ai,... ,a m 
r := /; a t := 0 

WHILE / is not reduced mod {gj,..., g„,} DO 
select smallest i with LT(g,) | cpX^ for some f> 
f - [cpXP/CTig^gi := / 
at + [c^X^/LTig,)] := at 

END WHILE 

At each step h j —>■ h j +1 of the algorithm, we have 

multiword (h j) > i ex multiword (h j + \) 

in W+(N"), by Lemma 6.123, and so the algorithm does stop, because <| cx is a well-order 
on W+(N"). Obviously, the output r(X) is reduced mod {gi,..., g,„}, for if it has a term 
divisible by some LT(g, ), then one further reduction is possible. 

Finally, each term of a, (X) has the form c l gX^/LT(g 1 ) for some intermediate out¬ 
put h(X) (as we see in the pseudocode). It now follows from Proposition 6.126 that 
Deg (atgi) < Deg(/). • 
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Definition. Given a monomial order on N", a polynomial f(X) e £[X], and an m-tuple 
G — [yi,..., g m ], we call the output r(X) of the division algorithm the remainder of 
f(X) mod G. 

Note that the remainder r of f mod G is reduced mod {gi, ..., g „,} and / — r e I = 
(g i,, g m ). The algorithm requires that G be an m-tuple, because of the command 

select smallest i with LT(g,-) | cpX@ for some /I 

specifying the order of reductions. 

The next example shows that the remainder may depend not only on the set of poly¬ 
nomials {gi, ..., g m } but also on the ordering of the coordinates in the m-tuple G = 

[gi,- g m ]- That is, if a e S m is a permutation and G a = [go-(i),..., ga(m)], then 

the remainder r a of / mod G, 7 may not be the same as the remainder r of / mod G. Even 
worse, it is possible that r 0 and r a — 0, so that the remainder mod G is not the ob¬ 
struction to / being in the ideal (g|,... , g m ). We illustrate this phenomenon in the next 
example, and we will deal with it in the next subsection. 

Example 6.128. 

Let fix, y, z ) = x 2 y 2 + xy, and let G = [gj, g 2 , £ 3 ], where 

gi = y 2 + z 2 
82 = x 2 y + yz 
g 3 = z 3 +xy. 

We use the degree-lexicographic order on N 3 . Now y 2 = LT(gi) | LT(/) = x 2 y 2 , and so 
f h, where 

h = f - '^—y-(y 2 + z 2 ) — -x 2 z 2 + xy. 

y 

The polynomial — x 2 z 2 + xy is reduced mod G, because neither —x 2 z 2 nor xy is divisible 
by any of the leading terms LT(gi) = y 2 , LT(g 2 ) = x 2 y, or LT(g 3 ) = z 3 . 

Let us now apply the division algorithm using the 3-tuple G' — [,^ 2 . g 1 . # 3 ]. The first 

82 

reduction gives / —> h\ where 

2 2 

h 1 = f - *-^-(x 2 y + yz) = ~y 2 z + xy. 
x L y 

Now It is not reduced, and reducing mod g] gives 

h' - -’j-(y 2 + z 2 ") = z, 3 + xy. 

y 2 

But ’■ + xy = g;g and so + xy —>■ 0. Thus, the remainder depends on the ordering of 
the divisor polynomials gi in the m-tuple. 

Lor a simpler example of different remainders (but with neither remainder being 0), see 
Exercise 6.78. ◄ 
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Exercises 


6.75 


6.76 

6.77 

6.78 

6.79 

6.80 

6.81 

6.82 

6.83 


6.84 


(i) Let (X, <) and (Y, <’) be well-ordered sets, where X and Y are disjoint. Define a binary 
relation < on X U Y by 

x\ < X 2 if xi ,*2 e X andxi < X 2 , 
y\ < yi if v’i , y2 e Y and y\ <' yj, 

x < y if x e X and y e Y. 

Prove that (X U Y , <) is a well-ordered set. 

(ii) If r < n, we may regard N r as the subset of N™ consisting of all n -tuples of the form 

(« 1 , ..., n r , 0.0), where n , e N for all i < r. Prove that there exists a monomial 

order on N " in which a < b whenever a e N r and fi el" — N'. 

Hint. Consider the lex order on k[x \, .... x„] in which x\ < X 2 < ■ ■ ■ < x n . 

(i) Write the first 10 monic monomials in k[x, y] in lexicographic order and in degree- 
lexicographic order. 

(ii) Write all the monic monomials in k\x, y, z] of weight at most 2 in lexicographic order 
and in degree-lexicographic order. 

Give an example of a well-ordered set X containing an element u having infinitely many 
predecessors; that is, [x e X : x < u j is infinite. 

Let G = [x — y, x — z] and G' = [x — z, x — y]. Show that the remainder of x mod G (in 
degree-lexicographic order) is distinct from the remainder of x mod G'. 

Let ^ be a monomial order on N", and let f{X), g(X) e k[X ] = k\x \, ..., x n \ be nonzero. 

(i) Prove that if / + g ^ 0, then Deg(/ + g) < max{Deg(/), Deg(g)), and that strict 
inequality can occur only if Deg(/) = Deg(g). 

(ii) Prove that Deg(/g) = Deg(/) + Deg(g), and Deg(/ m ) = m Deg(/) for all m > 1. 
Use the degree-lexicographic order in this exercise. 

(i) Find the remainder of x 7 v 2 + x 3 y 2 — y + 1 mod [xy 2 — x, x — y 3 ]. 

(ii) Find the remainder of x 7 v 2 + x 3 y 2 — y + 1 mod [x — y 3 , xy 2 — x]. 

Use the degree-lexicographic order in this exercise. 

(i) Find the remainder of x 2 y + xy 2 + y 2 mod [y 2 — 1, xy — 1], 

(ii) Find the remainder of x 2 y + xy 2 + y 2 mod [xy — 1, y 2 — 1]. 

Let c a X a be a nonzero monomial, and let f(X ), g(X) e k[X} be polynomials none of whose 
terms is divisible by c a X a . Prove that none of the terms of f(X) — g(X) is divisible by c a X a . 
An ideal I in k[X] that is generated by monomials, say, I = (X 01 ^ 1 , ..., X a ^), is called a 
monomial ideal. 

(i) Prove that f(X ) e / if and only if each term of f(X) is divisible by some X a ^\ 

(ii) Prove that if G — [gj, ..., g m ] and r is reduced mod G. then r does not lie in the 
monomial ideal (LT(gi), .... LT(g m )). 

Let f(X) = Yla c o/X a e k[X] be symmetric, where k is a field and X = (xj, ..., x n ). 
Assume that N" is equipped with the degree-lexicographic order and that Deg(/) = fi = 
(Pi . fin)- 

(i) Prove that if c a x ** • • • x a " occurs with nonzero coefficient c a , then every monomial 
x* j • • ■ x„"j also occurs in f(X) with nonzero coefficient, where a e S n . 
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(ii) Prove that 0j > 02 > • • • > P n . 

(iii) If e\,... , e„ are the elementary symmetric polynomials, prove that 

Deg(e;) = (1, .... 1, 0.0), 

where there are i l’s. 

(iv) Let (yi . Yn) = (01 “ft, 02-03. • ••. Pn-\~Pn,Pn)- Prove that if g(x\, .. ,,x n ) = 

xp ■ ■ ■ x Yn , then g(ee„) is symmetric and Deg(g) = 0. 

(v) Fundamental Theorem of Symmetric Polynomials. Prove that if k is a field, then 
every symmetric polynomial f(X) e k[X ] is a polynomial in the elementary symmetric 
functions e\, ... ,e n . (Compare with Theorem 4.37.) 

Hint. Prove that h(X) = f(X ) — cpg(e \,..., e n ) is symmetric, and that Deg(/7) < 0. 


Buchberger’s Algorithm 

For the remainder of this section we will assume that N” is equipped with some monomial 
order (the reader may use the degree-lexicographic order), so that LT(/) is defined and the 
division algorithm makes sense. 

We have seen that the remainder of / mod [gi,..., g m ] obtained from the division al¬ 
gorithm can depend on the order in which the g; are listed. Informally, a Grobner basis 
{gi, ..., g m } of the ideal I = (g i,, g m ) is a generating set such that, for every m-tuple 
G a — [g<T(i),..., go-(m)] formed from the g,-, where a e S,„ is a permutation, the remain¬ 
der of / mod Grj is always the obstruction to whether / lies in I. We define Grobner bases 
using a property that is more easily checked, and we then show, in Proposition 6.129, that 
they are characterized by the more interesting obstruction propery just mentioned. 

Definition. A set of polynomials {gj,..., g m ) is a Grobner basis 1 x of the ideal I = 
(gi,..., g m ) if, for each nonzero / e /, there is some g, with LT(g,) | LT(/). 

Note that a Grobner basis is a set of polynomials, not an m-tuple of polynomials. Ex¬ 
ample 6.128 shows that 

[y 2 + z 2 , x 2 y + yz, z 3 + xy} 

is not a Grobner basis of the ideal (y 2 + z , x 2 y + yz, Z 3 + xy). 

Proposition 6.129. A set {gi,..., g,„} of polynomials is a Grobner basis of an ideal 
I = (gi,..., g m ) if and only if for each m-tuple G a — [g<r(i),.... g<r(m)], where a e S m , 
every f e I has remainder 0 mod G a . 

Proof. Assume there is some permutation a e S m and some / e / whose remainder 
mod G a is not 0. Among all such polynomials, choose / of minimal Degree. Since 
{gi, ..., g m } is a Grobner basis, LT(g,) | LT(/) for some i ; select the smallest aii) for 

18 B. Buchberger has written in his article in Buchberger-Winkler, Grobner Bases and Applications. “The early 
paper of Grobner in 1954, although not yet containing the essential ingredients of Grobner basis theory, pointed 
in the right direction and motivated me, in 1976, to assign the name of W. Grobner (1899-1980) to the theory.” 
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which there is a reduction / h , and note that h e I. Since Deg(/z) -< Deg(/), by 
Proposition 6.126, the division algorithm gives a sequence of reductions h = hi) —> h \ —> 
hi -»•••-» h p — 0. But the division algorithm for / adjoins / -> h at the front, 
showing that 0 is the remainder of / mod G a , a contradiction. 

Conversely, assume that every / e I has remainder 0 mod G a but that {gi,..., g m } is 
not a Grobner basis of / = (gi ,..., g m ). If there is a nonzero / e I with LT(g,) \ LT(/) 
for every z, then in any reduction / \ h, we have LT(/z) = LT(/). Hence, if G = 
[gi,..., g m ], the division algorithm mod G gives reductions / -> hi —*■ I 12 —> ■ ■ ■ -* 
h p = r in which LT(r) = LT(/). Therefore, r / 0; that is, the remainder of / mod G is 
not zero, and this is a contradiction. • 

Corollary 6.130. If {g 1 ,..., g m ] is a Grobner basis of the ideal I — (g 1 ,..., g m ), and 
if G = [gi,..., g m \ is any m-tuple formed from the gi, then for every f(X) e k[X\ there 
is a unique r(X) e k[X], which is reduced mod {gi,..., g m ], such that f — r e I; in 
fact, r is the remainder of f mod G. 

Proof The division algorithm gives a polynomial r , reduced mod {gi, ..., g m }, and poly¬ 
nomials a 1 ,..., a m with / = aigi+- • -+a m g, n +r; clearly, f—r = aigi+■ ■ ■+a m g m e I. 

To prove uniqueness, suppose that r and f are reduced mod {gi,..., g,„} and that 
f — r and f — r' lie in /, so that (/ — r') — (/ — r) = r — r' e /. Since r and r' are 
reduced mod {g 1 , ..., g ;;j }, none of their terms is divisible by any LT(g,). If r — r' 0, 
then Exercise 6.82 on page 410 says that no term of r — r' is divisible by any LT(g,); in 
particular, LT(r — f ) is not divisible by any LT(g ; ), and this contradicts Proposition 6.129. 
Therefore, r = r'. • 

The next corollary shows that Grobner bases resolve the problem of different remainders 
in the division algorithm arising from different m -tuples. 

Corollary 6.131. Let {gi, ..., g m } be a Grobner basis of the ideal I — (gj. g m ), 

and let G = [gi, ..., g m ]. 

(i) If f{X) e k[X] and G a = [go-(i), ..., g a (m)\ where a e S m is a permutation, then 
the remainder of f mod G is equal to the remainder of f mod G a . 

(ii) A polynomial f e / if and only if f has remainder 0 mod G. 

Proof (i) If r is the remainder of / mod G, then Corollary 6.130 says that r is the unique 
polynomial, reduced mod {gi, ..., g m }, with / — re/; similarly, the remainder r n of / 
mod G a is the unique polynomial, reduced mod {gi, ..., g m |, with f — r a e /. The 
uniqueness assertion in Corollary 6.130 gives r — ra¬ 
in) Proposition 6.129 shows that if / e /, then its remainder is 0. For the converse, if r is 
the remainder of / mod G, then / = q + r, where q e /. Hence, if r = 0, then / e /. • 

There are several obvious questions. Do Grobner bases exist and, if they do, are they 
unique? Given an ideal I in k[X ], is there an algorithm to find a Grobner basis of /? 
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The notion of S-polynomial will allow us to recognize a Grobner basis, but we first 
introduce some notation. 

Definition. If a = (ai,..., a n ) and fl = (fi \...., /!„) are in N”, define 


a V P — fj., 


where /x, = max{a ; -, /S,} and /i = ..., p, n ). 

Note that X aW P is the least common multiple of the monomials X a and X@. 

Definition. Let f(X), g(X) e k\ X]. where LT(/) = a a X a and LT(g) = bp. Define 


L(/,g) = X“ v ^. 


The S-polynomial S(f, g) is defined by 


L(f,g) L(f, g) 
LT(/) J LT(g) 8 ' 


that is, if pu — a V f), then 

S(f, g) = a~ l X^~ a f(X) - b~ p X X^-Pg{X). 

Note that S(f, g) — —S(g, /). 

Example 6.132. 

(i) If f(x,y ) = 3 x 2 y and g(x, y) — 5xy 3 — y (in degree-lexicographic order), then 
L(f, g) = x 2 y 3 and 


S(f, 8 ) = ^3r 3x2 y - .- V -y)= ^xy. 


3 x 2 y 


5 xy 3 


(ii) If f(X) and g(X) are monomials, say, f(X) — a a X a and g(X) — bpXP, then 


S(f, 8) = 


a„X a 


-a„X a 


bpX? 


b p X p =0. ◄ 


The following technical lemma indicates why S-polynomials are relevant. It says that if 
Deg(^ ; Qjgj) -< 8, where the aj are monomials, while Deg (cijgj) = 8 for all j, then any 
polynomial of multidegree -< <5 can be rewritten as a linear combination of 5-polynomials, 
with monomial coefficents, each of whose terms has multidegree strictly less than <S. 
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Lemma 6.133. Given gi(X ),..., gt(X ) e k[X] and monomials CjX a( i\ let h(X) = 
T.UcjX^gjiX). 

Let 8 be a multidegree. IfDeg(h) -< 8 and Deg (c j X ai, i ] gj (X)) — 8 for all j < l, then 
there are d / € k with 

h{x) = Y d d i xi ~ iiU)s igj, gj+i), 
j 

where /x(j) = D eg(gj) V Deg(g ;+ i), and for all j < i, 

Deg (X s -^S(gj,g j+1 ))<8. 

Proof. Let LT(gy) = bjX so that LT (cjX a ^gj(X)) — CjbjX s . The coefficient of 
X s in h(X) is thus '^ZjCjbj. Since Deg(Vi) -< 8, we must have ^ Zj c jbj = 0. Define 
monic polynomials 

u j (X) = bj 1 X a Wgj(X). 

There is a telescoping sum 

l 

h(X) = J2^x a(J) gj(X) 

7=1 

e 

= Y c i b i u i 

7=1 

= c\b\{u\ - u 2 ) + (c\b\ + c 2 b 2 )(u 2 - M 3 ) H- 

+ (c\b\ H-h < 7 . \b, ]){ut 1 - ue) 

+ (cibi H-h cibt)ut. 

The last term {c\b\ H-1 -c^bi)ur = 0, for Jfj Cjbj = 0. Since 8 — Deg (cjX 01 ^gj(X)), 

we have a(j) + f(j) = 8, so that \ X s for all j. Hence, for all j < l , we have 

lcm{X^) ; X^J +l) } = X^vW+D | X 5 ; that is, if we write /Mj) = f(j) V f(j + 1), 
then 8 — ji(j) e N". But 

x‘-^s (gJ . SJ+l) = x‘-^fXX sl(x) _ 

X^ 

TTY T~TY \£/+l(^0 

LT (gj ) LT(gy_|_i) 

= bj'x^gj - bjl l x a u +1) gj+1 

— U j U j- f_l. 

Substituting this equation into the telescoping sum gives a sum of the desired form, where 
dj — c\b\ H - f Cjbj\ 

h(X) = ci biX s ~^S( gl ,g 2 ) + (ci In + c 2 b 2 )X s ~^S(g 2 , g3 ) + --- 

+ (ci/71 + • • • + c l - l h-i)X s -^- l) S(gi- l , g e ). 
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Finally, since both uj and Uj+\ are monic with leading term of multidegree <5, we have 
Deg (if j — u j+ 1 ) -< 8. But we have shown that uj — u j+\ — X s ^ ll(g> S(gj, g j+i), and so 
Deg(.X s -^S(gj,g j+1 )) <8, as desired. • 

By Proposition 6.129, {gi, ..., g m ] is a Grobner basis of I — (g i, ..., g m ) if every 
/ e I has remainder 0 mod G (where G is any wi-tuple formed by ordering the g,). 
The importance of the next theorem lies in its showing that it is necessary to compute the 
remainders of only finitely many polynomials, namely, the 5-polynomials, to determine 
whether {gi,..., g m } is a Grobner basis. 

Theorem 6.134 (Buchberger). A set {gi, ..., g m } is a Grobner basis of an ideal I = 
(gl, ..., g m ) if and only if S(g p , g q ) has remainder 0 mod G for all p, q, where G = 

[^l > • • • ’ gm ]• 

Proof. Clearly, S(g p . g q ), being a linear combination of g p and g q , lies in I. Hence, if 
G = {gi, ..., g m } is a Grobner basis, then S(g p , g q ) has remainder 0 mod G, by Proposi¬ 
tion 6.129. 

Conversely, assume that S(g p , g q ) has remainder 0 mod G for all p,q\ we must show 
that every / e I has remainder 0 mod G. By Proposition 6.129, it suffices to show that 
if / e /, then LT(g,) | LT(/) for some i. Since / e I — (gi,..., g,„), we may write 
f — Hi higi, and so 

Deg(/) ■< max{Deg(/f,g;)}. 

i 

If there is equality, then Deg(/) = Dcg(/i,g,) for some i, and so Proposition 6.125 gives 
LT(g ; ) | LT(/), as desired. Therefore, we may assume strict inequality: Deg(/) -< 
max, {Deg {hi g,)}. 

The polynomial / may be written as a linear combination of the g; in many ways. Of 
all the expressions of the form / = Hi hjgi, choose one in which <5 = max, {Deg(/f,g, )} 
is minimal (which is possible because < is a well-order). If Deg(/) = 8, we are done, as 
we have seen; therefore, we may assume that there is strict inequality: Deg(/) -< S. Write 

/= X! h J g J + 11 htgl ' 

j i 

Deg(/i jgj)=S Deg{li t g t )<8 

If Deg(^ ; hjgj ) = S, then Deg(/) = 8, a contradiction; hence, Deg(^ ; - hjgj) -< 8. But 
the coefficient of X s in this sum is obtained from its leading terms, so that 

Deg(^]LT(/i ; )g ; ) -<«. 

j 

Now Hj LT (hj)gj is a polynomial satisfying the hypotheses of Lemma 6.133, and so there 
are constants d ; and multidegrees p.(j) so that 


E LT (/*;)g; = HdjX s -M">S(gj,gj +1 ), 


(8) 
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where Deg S(gj , g;+i)j -< 8 . 19 

Since each S(gj , gj+i) has remainder 0 mod G, the division algorithm gives ajj(X) e 
k[X] with S(gj , g j+ 1 ) = a ji Si •> where Deg(ajigi) < Deg(S(gj, g j+ i)) for all j. i. It 
follows that 

X 5 -^S(gj,g j+l ) = J2 xS ~ llU)a jigi- 

i 

Therefore, Lemma 6.133 gives 

Deg (X s -^ajigi) < Deg (X s ~^S(gj, g j+l )) -< 8. (9) 

Substituting into Eq. (8), we have 

E LT (hj)gj = J2 d j x *~ ,l(j)s (8j’8j+i) 

j j 

= E J /(E^ 0 V^ 

j i 

= E(E (/ v x " Ii[j)a .n)si- 

i j 

If we denote JE djX 8 ~^^aji by h' t , then 

Elt(M W = Ea{«. (10) 

j i 

where, by Eq. (9), Dcgi/i'g,) -< <5 for all i. 

Finally, we substitute the expression in Eq. (10) into Eq. (7): 

/ = E h jsj+ E htgi 

j t 

Deg (hj gj )=8 Deg(li e gl)<S 

= LT(h j)sj+ E [ h j-m h j)]gj+ E h 

i j i 

Deg (hjgj)=S Deg(hjgj)=8 Deg th(ge)<8 

= E /l ^ + E u i j-m h j)]gj+ h ^- 

i j i 

Deg(lijgj)=S Deg (h ege )<8 

We have rewritten / as a linear combination of the gj in which each term has multidegree 
strictly smaller than 8, contradicting the minimality of 8. This completes the proof. • 


1 tJ The reader may wonder why we consider all S-polynomials Stgp, gq) instead of only those of the form 
S(gj, gi+ 1 ). The answer is that the remainder condition is applied only to those hjgj for which D egQijgj) = 8, 
and so the indices viewed as V s need not be consecutive. 
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Corollary 6.135. If I — (/i,..., f s ) in k[X], where each fj is a monomial (that is, if I 
is a monomial ideal), then {/l, ..., f s ] is a Grobner basis of I. 

Proof. By Example 6.132(ii), the 5-polynomial of any pair of monomials is 0. • 

Here is the main result: A Grobner basis of (fi,..., f s ) can obtained by adjoining 
remainders of 5-polynomials. 

Theorem 6.136 (Buchberger’s Algorithm). Every ideal / = (/),..., f s ) in k[X] has 
a Grobner basis 20 that can be computed by an algorithm. 

Proof. Here is a pseudocode for an algorithm. 

Input : B — {fi,..., f s } G = [f\,..., f s ] 

Output : a Grobner basis B = {g\,..., g „,} containing {fi, ■ ■ ■, f s ] 

B :={/i,..., /,} G:=[f u ...,fsi 

REPEAT 

B’ := B G' :=G 

FOR each pair g, g' with g g e B' DO 

r := remainder of S(g, g') mod G' 

IF r ^4 0 

THEN B B U {r} and G' = [gi,..., g m ,r] 

UNTIL B = B' 

Now each loop of the algorithm enlarges a subset B C. I — {g\,..., g m ) by adjoining the 
remainder mod G of one of its 5-polynomials S(g, g'). As g. g' e I, the remainder r of 
S(g, g') lies in I, and so the larger set B U (r) is contained in I. 

The only obstruction to the algorithm’s stopping at some B' is if some S(g, g') does not 
have remainder 0 mod G'. Thus, if the algorithm stops, then Theorem 6.134 shows that B' 
is a Grobner basis. 

To see that the algorithm does stop, suppose a loop starts with B' and ends with B. 
Since B' C B, we have an inclusion of monomial ideals 

(LT(g'):g' 6fi')c(LTfe) : ^8). 

We claim that if B' C B, then there is also a strict inclusion of ideals. Suppose that r is a 
(nonzero) remainder of some 5-polynomial mod B ', and that B = B' U {r}. By definition, 
the remainder r is reduced mod G', and so no term of r is divisible by LT(g') for any g' e 
B'\ in particular, LT(r) is not divisible by any LT(,sp'). Hence, LT(r) f (LT)#'): g' e B'), 
by Exercise 6.83 on page 410. On the other hand, we do have LT(r) e (LT(g): g e B). 
Therefore, if the algorithm does not stop, there is an infinite strictly ascending chain of 
ideals in k[X~\, and this contradicts the Hilbert basis theorem, for k[X \ has the ACC. • 

“°A nonconstructive proof of the existence of a Grobner basis can be given using the proof of the Hilbert basis 
theorem; for example, see Section 2.5 of Cox-Little-O’Shea, Ideals, Varieties, and Algorithms (they also give a 
constructive proof in Section 2.7). 
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Example 6.137. 

The reader may show that B' — {y 2 + z 2 , x 2 y + yz, z 3 + xy] is not a Grobner basis 
because S(y 2 + z 2 . x 2 y + yz) — x 2 z 2 — y 2 Z does not have remainder 0 mod G'. However, 
adjoining x 2 z 2 — y 2 Z does give a Grobner basis B because all the 5-polynomials in B 
[there are (^) = 6 of them] have remainder 0 mod B'. < 

Theoretically, Buchberger’s algorithm computes a Grobner basis, but the question arises 
how practical it is. In very many cases, it does compute in a reasonable amount of time; 
on the other hand, there are examples in which it takes a very long time to produce its 
output. The efficiency of Buchberger’s algorithm is discussed in Section 2.9 of Cox-Little- 
O’Shea, Ideals, Varieties, and Algorithms. 

Corollary 6.138. 

(i) If I — (/j, .... f t ) is an ideal in k[X], then there is an algorithm to determine 
whether a polynomial h(X) e k[A] lies in I. 

(ii) If I — (/i, ..., ft) C k[X], then there is an algorithm to determine whether a 
polynomial g(X) e k[X) lies in yT. 

(iii) If I = (/i, ...,/)) and I' = (/[, ..., f' s ) are ideals in k[X\ then there is an 
algorithm to determine whether I = I'. 

Proof (i) Use Buchberger’s algorithm to find a Grobner basis B of /, and then use the 
division algorithm to compute the remainder of h mod G (where G is any m-tuple arising 
from ordering the polynomials in B). By Corollary 6.131(h), h e I if and only if r — 0. 

(ii) Use Exercise 6.66 on page 398 and then use Buchberger’s algorithm to find a Grobner 

basis of (/i-- ft, 1 - yg) in k[X, y], 

(iii) Use Buchberger’s algorithm to find Grobner bases {gi,..., g m } and {gj,..., g' m } of 

I and / ', respectively. By part (i), there is an algorithm to determine whether each g'. e /, 
and /' C I if each g'. e I. Similarly, there is an algorithm to determine the reverse 
inclusion, and so there is an algorithm to determine whether I = V. • 

A Grobner basis B = {gi,..., g,„} can be too large. For example, it follows from the 
very definition of Grobner basis that if / e /, then B U {/} is also a Grobner basis of /; 
thus, we may seek Grobner bases that are, in some sense, minimal. 

Definition. A basis {gi, ..., g m } of an ideal I is reduced if 

(i) each g ; - is monic; 

(ii) each g ; - is reduced mod {gi,..., gi, -g m }. 


Exercise 6.90 on page 421 gives an algorithm for computing a reduced basis for every 
ideal (fi,, ft). When combined with the algorithm in Exercise 6.93 on page 422, 
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it shrinks a Grobner basis to a reduced Grobner basis. It can be proved that a reduced 
Grobner basis of an ideal is unique. In the special case when each f (X) is linear, that is, 


fi(X) = anxi H-b a in x n . 

then the common zeros Var(/i ,..., f t ) are the solutions of a homogeneous system of t 
equations in n unknowns. If A — [a, ; ] is the t x n matrix of coefficients, then it can be 
shown that the reduced Grobner basis corresponds to the row-reduced echelon form for 
the matrix A (see Section 10.5 of Becker-Weispfenning, Grobner Bases). Another special 
case occurs when when f\,..., f t are polynomials in one variable. The reduced Grobner 
basis obtained from { f \,...,/,} turns out to be their gcd, and so the euclidean algorithm 
has been generalized to polynomials in several variables. 

Corollary 6.138 does not begin by saying “If I is an ideal in k[X]”; instead, it specifies 
a basis: / = (f \,..., f t ). The reason, of course, is that Buchberger’s algorithm requires a 
basis as input. For example, if / = (hi,..., h s ), then the algorithm cannot be used directly 
to check whether a polynomial f(X ) lies in the radical sfj , for we do not have a basis of 
\[J. The book of Becker-Weispfenning, Grobner Bases, gives an algorithm computing a 
basis of \[J (page 393) when k satisfies certain conditions. There is no algorithm known 
that computes the associated primes of an ideal, although there are algorithms to do some 
special cases of this general problem. As we mentioned at the beginning of this section, if 
an ideal I has a primary decomposition I — Q\ fl- • - r\Q r , then the associated prime P, has 
the form *J(I : c,) for any c; e fj j+i Qj and G f Qi ■ There is an algorithm computing 
a basis of colon ideals (Becker-Weispfenning, Grobner Bases, page 266). Thus, we could 
compute Pi if there were an algorithm finding elements c,. For a survey of applications of 
Grobner bases to various parts of mathematics, the reader should see Buchberger-Winkler, 
Grobner Bases and Applications. 

We end this chapter by showing how to find a basis of an intersection of ideals. 

Given a system of polynomial equations in several variables, one way to find solutions 
is to eliminate variables (van der Waerden, Modern Algebra II, Chapter XI). Given an ideal 
I C k[X], we are led to an ideal in a subset of the indeterminates, which is essentially the 
intersection of Var(7) with a lower-dimensional plane. 

Definition. Let k be a field and let I C k[X, Y] be an ideal, where k[X, Y] is the poly¬ 
nomial ring in disjoint sets of variables AUK. The elimination ideal is 

I x = IDk[X]. 


For example, if I — (x 2 , xy), then a Grobner basis is \x 2 . xy } (they are monomials, so 
that Corollary 6.135 applies), and I x = (x 2 ) C £[x], while I y = {0}. 

Proposition 6.139. Let k be afield and let k[X\ — k[x \,.... x n ] have a monomial order 
for which x\ >- xi >■ • • • >- x„ (for example, the lexicographic order) and, for fixed p > 1, 
let Y — x p , .. ., x n . If I C k[X) has a Grobner basis G — {gi, ..., g m }, then G fl Iy is a 
Grobner basis for the elimination ideal Iy = I fl k[x p , ..., x n ]. 
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Proof. Recall that {gi, ..., g m } being a Grobner basis of / = (gi,..., g m ) means that for 
each nonzero / e /, there is gj with LT(g,-) | LT(/). Let f (x p ,..., x„) e ly be nonzero. 
Since ly c /, there is some gi ( X ) with LT (gj ) | LT(/); hence, LT(g ; ) involves only 
the “later” variables x p ,... ,x n . Let Deg(LT(g,)) = ft. If g, has a term c a X a involving 
“early” variables x* with i < p, then a > ft, because xi >-•••>- x p >-•••>- x„. This is a 
contradiction, for ft. the Degree of the leading term of gj, is greater than the Degree of any 
other term of gj. It follows that gj e k[x p ,..., x n ]. Exercise 6.92 on page 422 now shows 
that G (T k[x p ,..., x n ] is a Grobner basis for ly — I fl k[x p , ..., x n ]. • 

We can now give Grobner bases of intersections of ideals. 

Proposition 6.140. Let k be a field, and let I\,..., I t be ideals in k[X], where X = 
x\, ..., x n . 

(i) Consider the polynomial ring k[X, y\, ..., y t ] having a new variable yj for each j 
with 1 < j < t. If J is the ideal in k[X, y\,..., y f ] generated by 1 — (yi + • • • + y t ) 
and yj I j, for all j, then H /=i Ij = Jx- 

(ii) Given Grobner bases of I\, ..., I t , a Grobner basis off' |J =1 Ij can be computed. 

Proof, (i) If / = f(X) e Jx = J fl k[X\, then / e /, and so there is an equation 

f(X) - g(X, F)(l -£yj) + Y J hj{X, y x ,..., y t )y jqj {X), 

j 

where g, hj e k[X, Y] and q j e I j. Setting yj = 1 and the other y’s equal to 0 gives / = 
h j(X, 0,..., 1,..., 0 )qj(X). Note that h j(X, 0,..., 1,..., 0) e £[2f], and so / e Ij. 
As j was arbitrary, we have / e P| 0’ anc * so Jx ^ Pi Ij- 
For the reverse inclusion, if f & f] I j, then the equation 

/ = /(!- J2 y j )+ J2yjf 

j 

shows that / e Jx, as desired. 

(ii) This follows from part (i) and Proposition 6.139 if we use a monomial order in which 
all the variables in X precede the variables in Y. • 


Example 6.141. 

Consider the ideal I — (x) fl (x 2 , xy, y 2 ) C £[x, y], where k is a field, that we considered 
in Example 6.117(h). Even though it is not difficult to find a basis of I by hand, we shall 
use Grobner bases to illustrate Proposition 6.140. Let u and v be new variables, and define 

2 9 

J = (1 — u — v, mx, vx , vxy, vy ) c k[x, y, u, u]. 

The first step is to find a Grobner basis of /; we use the lex monomial order with x < y < 
it -< v. Since the 5-polynomial of two monomials is 0, Buchberger’s algorithm quickly 
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gives a Grobner basis 21 G of /: 

G — {i> + u — 1, x 2 , yx, ux, uy 2 — y 2 }. 

It follows from Proposition 6.139 that a Grobner basis of 7 is GGk[x, v]: all those elements 
of G that do not involve the variables u and v. Thus, 

/ = (x) fl (x 2 , xy, y 2 ) — (x 2 , xy). ◄ 


We mention that Grobner bases can be adapted to noncommutative rings. A. I. Shirsov 
began investigating whether there are analogs holding for rings of polynomials in several 
noncommuting variables, with the aim of implementing algorithms to solve problems in 
Lie algebras. 

Exercises 

Use the degree-lexicographic monomial order in the following exercises. 

6.85 Let 7 = (y — x 2 , z — x 3 ). 

(i) Order x -< y -< z, and let ;<i ex be the corresponding monomial order on N 3 . Prove that 
[y — x 2 , z — x 3 ] is not a Grobner basis of 7. 

(ii) Order y -c z -< x, and let ;<i ex be the corresponding monomial order on N 3 . Prove that 
[y — x 2 , z — x 3 ] is a Grobner basis of 7. 

6.86 Find a Grobner basis of 7 = (x 2 — 1, xy 2 — x). 

6.87 Find a Grobner basis of 7 = (x 2 + y, x 4 + 2x 2 y + y 2 + 3). 

6.88 Find a Grobner basis of 7 = (xz, xy — z, yz — x). Does x 3 + x + 1 lie in 7? 

6.89 Find a Grobner basis of 7 = (x — y, y — x, x~y — xy). Does x + x + 1 lie in 7? 

6.90 Show that the following pseudocode gives a reduced basis Q of an ideal 7 = (f \, ..., f t ). 

Input: P = [/i, ..., /,] 

Output: Q = [q\ . q s \ 

Q — P 

WHILE there is q e Q which is 

not reduced mod Q — [q] DO 
select q e Q which is not reduced mod Q — {q} 

Q-=Q-lq) 

h the remainder of q mod Q 
IF h 7 ^ 0 THEN 
Q:=QU{h) 

END IF 
END WHILE 
make all q e Q rnonic 

21 This is actually the reduced Grobner basis given by Exercise 6.93 on page 422. 
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6.91 If G is a Grobner basis of an ideal I, and if Q is the basis of / obtained from the algorithm in 
Exercise 6.90, prove that Q is also a Grobner basis of I. 

6.92 Let / be an ideal in k[X], where k is a field and k[X] has a monomial order. Prove that if a set 
of polynomials (gj, ..., g m ) Q I has the property that, for each nonzero / e I, there is some 
gi with LT(g,) | LT(/), then I = (g\, ■ ■ ■, g m )- Conclude, in the definition of Grobner basis, 
that one need not assume that I is generated by gj, ..., g m . 

6.93 Show that the following pseudocode replaces a Grobner basis G with a reduced Grobner ba¬ 
sis H. 

Input: G = {gi,...,gm} 

Output: H 
H := 0; F -G 
WHILE F / 0 DO 
select f' from F 
F:=F-{f ! 

IF LT(/) f LT(/') for all f e F AND 
LT (h) f LT (/') for all he H THEN 
H-.= HU{f'} 

END IF 
END WHILE 

apply the algorithm in Exercise 6.90 to H 
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Modules and Categories 


We now introduce R-modules , where R is a commutative ring; formally, they generalize 
vector spaces in the sense that scalars are allowed to be in R instead of a field. If R is a 
PID, then we shall see, in Chapter 9, that classification of finitely generated R -modules 
simultaneously gives a classification of all finitely generated abelian groups as well as a 
classification of all linear transformations on a finite-dimensional vector space by canonical 
forms. After introducing noncommutative rings in Chapter 8, we will define modules over 
these rings, and they will be used, in an essential way, to prove that every finite group of 
order p m q n , where p and q are primes, is a solvable group. 

Categories and functors first arose in algebraic topology, where topological spaces and 
continuous maps are studied by means of certain algebraic systems (homology groups, 
cohomology rings, homotopy groups) associated to them. Categorical notions have proven 
to be valuable in purely algebraic contexts as well; indeed, it is fair to say that the recent 
great strides in algebraic geometry could not have occurred outside a categorical setting. 


7.1 Modules 

An /^-module is just a “vector space over a ring R"; that is, in the definition of vector 
space, allow the scalars to be in R instead of in a field. 

Definition. Let R be a commutative ring. An R-module is an (additive) abelian group M 
equipped with a scalar multiplication R x M —> M, denoted by 

( r , m) i —> rm , 

such that the following axioms hold for all m, m' e M and all r. r', 1 e R: 

(i) r(m + in') = rm + rm 

(ii) (r + r')m — rm + r'm\ 

(iii) (rr')m = r{r'm)\ 

(iv) bn — m. 
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Remark. This definition also makes sense for noncommutative rings R, in which case 
M is called a left R-module. ◄ 


Example 7.1. 

(i) Every vector space over a field k is a /.'-module. 

(ii) By the laws of exponents. Proposition 2.23, every abelian group is a Z-module. 

(iii) Every commutative ring R is a module over itself if we define scalar multiplication 
R x R R to be the given multiplication of elements of R. More generally, every ideal 
I in R is an /^-module, for if i e / and r e R. then ri e I. 

(iv) If S is a subring of a commutative ring R , then R is an 5-module, where scalar multi¬ 
plication S x R -> R is just the given multiplication (s,r) i-o* sr. For example, if k is a 
commutative ring, then k[X] is a k-module. 

(v) Let T: V —»• V be a linear transformation, where Visa finite-dimensional vector space 
over a field k. The vector space V can be made into a k[x]-module if scalar multiplication 
k[jr] x V —> V is defined as follows: If f(pc) = Y^''Lo c ‘ x ‘ ^ es ' n then 

m m 

f{x) v — £>')., = 7>r‘0>), 

i= 0 i=0 

where T° is the identity map 1 y , and T' is the composite of T with itself i times if i > 1. 
We denote V viewed as a k[x]-module by V 1 . 

Here is a special case of this construction. Let A be an n x n matrix with entries in k, 
and let T : k n —> k" be the linear transformation T(w) = Aw , where w is ail n x 1 column 
vector and Aw is matrix multiplication. Now the vector space k" is a k[x]-module if we 
define scalar multiplication k[x] x k n -> k n as follows: If f(pc) — Y1 T=o CiX ‘ e then 

m m 

f{x)w — (Y,CiX l )w = CjA‘w, 
i= 0 i= 0 

where A 0 = I is the identity matrix, and A 1 is the / th power of A if i > 1. We now 
show that (k n ) T — (k n ) A . Both modules are comprised of the same elements (namely, 
all /;-tuples), and the scalar multiplications coincide: In (k") T , we have xw — T(w)\ in 
(, k n ) A , we have xw — Aw, these are the same because T(w) — Aw. ◄ 

Here is the appropriate notion of homomorphism. 

Definition. If R is a ring and M and N are /^-modules, then a function /: M -* N is an 
R-homomorphism (or R-map) if, for all m, m' e M and all re R, 

(i) /(m + m') = f(m) + 

(ii) firm) = rf(m). 
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If an R -homomorphism is a bijection, then it is called an R-isomorphisnr, A 1 -modules 
M and N are called isomorphic , denoted by M = N, if there is some //-isomorphism 
f: M —*■ N. 

Note that the composite of //-homomorphisms is an //-homomorphism and, if / is an 
//-isomorphism, then its inverse function / -1 is also an //-isomorphism. 

Example 7.2. 

(i) If R is a field, then //-modules are vector spaces and //-maps are linear transformations. 
Isomorphisms here are nonsingular linear transformations. 

(ii) By Example 7.1 (ii), Z-modules are just abelian groups, and Lemma 2.52 shows that 
every homomorphism of (abelian) groups is a Z-map. 

(iii) If M is an //-module and r e //, then multiplication by r (or homothety by r ) is the 
function p r : M -> M given by m i-> rm. 

The functions //,- are //-maps because R is commutative: If a <= R and m e M. then 
p, r (am) = ram while ap, (m) = arm. 

(iv) Let T: V —> V be a linear transformation on a vector space V over a field k, let 
vi,..., v n be a basis of V, and let A be the matrix of T relative to this basis. We now show 
that the two fe[x]-modules V 1 and (k n ) A are isomorphic. 

Define ip: V —> k n by <p(vi) = e;, where e \,..., e n is the standard basis of k"; the 
linear transformation <p is an isomorphism of vector spaces. To see that <p is a /c[v]-map, it 
suffices to prove that (p(f(x)v) = f(x)(p(v) for all f(x) e A:[x] and all v e V. Now 

q>(xvi) = <p(T(vj )) 

= ( p(J2 a n v j) 

= ^UjiViVj) 

= J2 a i ie j' 

which is the ;th column of A. On the other hand, 

xcpivj) — A(p(vj) — Aej, 

which is also the /th column of A. It follows that cp(xv) — xcp(v) for all v e V, and we 
can easily prove, by induction on deg(/), that (p{f{x)v ) = f(x)(p(v) for all f(x) e k[x ] 
and all v e V. ◄ 

The next proposition generalizes the last example. 

Proposition 7.3. Let V be a vector space over a field k, and let T , S : V —> V be 
linear transformations. Then the k[x]-modules V T and V s in Example 7.1(v) are A:[x]- 
isomorphic if and only if there is a vector space isomorphism cp: V —> V with 

S = tpTtp~ l . 
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Proof. If (p: V T V s is a k[x]-isomorphism, then cp: V —> V is an isomorphism of 
vector spaces with 

(p(f(x)v) = f(x)<p(v) 

for all v e V and all f(x) e k[x). In particular, if fix) = x, then 

(p{xv) — xcpiv). 

But the definition of scalar multiplication in V T is xv — T(v). while the definition of 
scalar multiplication in V s is xv — S(v). Hence, for all v e V, we have 

cp(T(v)) = S(<p(v )). 

Therefore, 

<pT = Sep. 

As tp is an isomorphism, we have the desired equation S — (pT(p~ l . 

Conversely, we may assume <p(f(x)v) — f(x)<piv) in the special cases deg(/) < 1: 

<p(xv) = (pT(v ) = Scpi v) = xcp(v). 

Next, an easy induction shows that <p(x n v) = x n (p{v), and a second easy induction, on 
deg (f), shows that cpiffx)v) — fix)cpiv). • 

It is worthwhile making a special case of the proposition explicit. The next corollary 
shows how comfortably similarity of matrices fits into the language of modules (and we 
will see, in Chapter 9, how this contributes to finding canonical forms). 

Corollary 7.4. Let k be afield, and let A and B be n x n matrices with entries in k. Then 
the k[x]-modules ik n ) A and fk n ) B in Example 7.1(v) are k[x]-isomorphic if and only if 
there is a nonsingular matrix P with 

B = PAP~ l . 

Proof Define T : k" -> k" by T (y) = Ay, where y e k" is a column; by Example 7. l(v), 
the A:[x]-module ik n ) T — ik n ) A . Similarly, define S: k" -> k n by Siy) = By, and denote 
the corresponding k[x]-module by ik") B . The proposition now gives an isomorphism 
cp: V T V s with 

(piAy) = B(piy). 

By Proposition 3.94, there is an n x n matrix P with <p(y) = Py for all y e k" (which is 
nonsingular because <p is an isomorphism). Therefore, 

PAy = BPy 

for all y e k" , and so 

PA = BP\ 

hence, B = PAP~ l . 

Conversely, the nonsingular matrix P gives an isomorphism <p\ k n —»• k n by <piy) — Py 
for all y e k". The proposition now shows that q>\ ik n ) A -> ik n ) B is a k[x]-module 
isomorphism. • 

Homomorphisms can be added. 
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Definition. If M and N are /^-modules, then 

Horn r(M, N ) = {all /J-homomorphisms M N}. 

If /, g e Hom^(M, N ), then define / + g: M —> N by 

f + g:m f(m) + g(m). 

Proposition 7.5. If M and N are R-modules, where R is a commutative ring, then 
Hom^(M, N) is an R-module, where addition has just been defined, and scalar multi¬ 
plication is given by 

rf: m H> firm). 

Moreover, there are distributive laws: If p: M' -» M and q: N -> N', then 
(/ + g)P = fP + gp and q if + g) = qf + qg 
for all f,ge Hom^(M, N ). 

Proof. Verification of the axioms in the definition of /^-module is straightforward, but we 
present the proof of 

(rr')f = rir'f) 

because it uses commutativity of R. 

If m e M, then )rr')f : m i-^ firr'm). On the other hand, rfr' f): m ir' f )irm) — 

fir'rm). Since R is commutative, rr' = r'r, and so irr')f — rir'f). • 

Example 7.6. 

In linear algebra, a linear functional on a vector space V over a field k is a linear trans¬ 
formation cp: V k [after all, k is a (one-dimensional) vector space over itself]. For 
example, if 

V — {continuous /: [0, 1] —»■ R}, 

then integration, / i-> f Q l fit) dt, is a linear functional on V. 

If V is a vector space over a field k, then its dual space is the set of all linear functionals 
on V: 

V* =Hom k iV,k). 

By the proposition, V* is also a ^-module; that is, V* is a vector space over k. < 

We now show that constructions made for abelian groups and for vector spaces can also 
be made for modules. A submodule S is an R -module contained in a larger /^-module M 
such that if s, s' e S and r e R, then ,v + s' and rs have the same meaning in S as in M. 

Definition. If M is an R -module, then a submodule N of M, denoted by N C M, is an 
additive subgroup N of M closed under scalar multiplication: rn e N whenever n e N 
and r e R. 
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Example 7.7. 

(i) Both {0} and M are submodules of a module M. A proper submodule of M is a 
submodule ACM with A /M. In this case, we may write ACM. 

(ii) If a commutative ring R is viewed as a module over itself, then a submodule of R is an 
ideal; I is a proper submodule when it is a proper ideal. 

(iii) A submodule of a Z-module (i.e., of an abelian group) is a subgroup, and a submodule 
of a vector space is a subspace. 

(iv) A submodule W of V T , where T : V -> V is a linear transformation, is a subspace W 
of V with T(W ) C W (it is clear that a submodule has this property; the converse is left as 
an exercise for the reader). Such a subspace is called an invariant subspace. 

(v) If M is an /^-module and r e R. then 

rM — {rm: m e M} 


is a submodule of M. 

Here is a related construction. If J is an ideal in R and M is an /?-module, then 
JM = { jiiTij: ji e J and mi e M} 

i 

is a submodule of M. 

(vi) If S and T are submodules of a module M, then 

S + T = {s + f:seS and t e T] 
is a submodule of M which contains S and T. 

(vii) If {Sj: i e 1} is a family of submodules of a module M, then H, <= / Si is a submodule 
of M. 

(viii) If M is an R -module and m e M, then the cyclic submodule generated by in. denoted 
by (m), is 

( m) — {rm : r e R}. 

More generally, if A is a subset of an R -module M, then 

{X) — J ^2 r i x i • r i e R an d x i e X } - 

finite 

the set of all R-linear combinations of elements in X. We call (A) the submodule gener¬ 
ated by X. See Exercise 7.2 on page 440. ◄ 


Definition. A module M is finitely generated if M is generated by a finite set; that is, if 
there is a finite subset A = {xi,..., x„] with M = (A). 

For example, a vector space is finitely generated if and only if it is finite-dimensional. 
We continue extending definitions from abelian groups and vector spaces to modules. 
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Definition. If /: M —»• N is an //-map between //-modules, then 
kernelf — ker/ = [m e M: f(m) = 0} 


and 

image f — im/ = {n e N: there exists m e M with n = f(m)}. 

It is routine to check that ker / is a submodule of M and that im / is a submodule of 
N. Suppose that M — (X)\ that is, M is generated by a subset X. Suppose further that N 
is a module and that /, g: M zV are //-homomorphisms. If / and g agree on X [that 
is, if fix) = g{x) for all x e X], then f — g. The reason is that / — g: M —»■ N, defined 
by / — g : m i-> f(m) — g{m), is an //-homomorphism with X C ker(/ — g). Therefore, 
M = (X) C ker(/ — g), and so / — g is identically zero; that is, f — g. 

Definition. If N is a submodule of an //-module M, then the quotient module is the 
quotient group M/N (remember that M is an abelian group and N is a subgroup) equipped 
with the scalar multiplication 


rim + N ) = rm + N. 

The natural map. jt : M —> M/N, given by m m + N, is easily seen to be an //-map. 

Scalar multiplication in the definition of quotient module is well-defined: If m + N = 
m' + N, then m — m' e N, hence rim — m') e N (because N is a submodule), and so 
rm — rm' e N and rm + N = rm' + N. 

Theorem 7.8 (First Isomorphism Theorem). If f: M —> N is an R-map of modules, 
then there is an R-isomorphism 


(p: M/ ker / -> im / 


given by 


(p: m + ker / fim). 


Proof. If we view M and N only as abelian groups, then the first isomorphism theorem 
for groups says that (p: M / ker / -> im / is an isomorphism of abelian groups. But <p 
is an //-map: (pirfn + N)) = q>irm + N) = firm ); since / is an //-map, however, 
firm) — rf im) = repim + N), as desired. • 


The second and third isomorphism theorems are corollaries of the first one. 


Theorem 7.9 (Second Isomorphism Theorem). IfS and T are submodules of a module 
M, then there is an R-isomorphism 

S/iS n T) -> iS + T)/T. 

Proof. Let u : M —» M/T be the natural map, so that ker7r = T ; define /; = jt\ S, so 
that h: S —»■ M/T. Now 


ker/z — S HT 
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and 

im h = (S + T)/T 

[for (S + T)/T consists of all those cosets in M/T having a representative in 5]. The first 
isomorphism theorem now applies. • 

Theorem 7.10 (Third Isomorphism Theorem). IfT C S C M is a tower of submod¬ 
ules, then there is an R-isomorphism 

(M/T)/(S/T) —»■ M/S. 

Proof. Define the map g: M/T M/S to be coset enlargement ; that is, 

g: m + T i —y m T S. 

Now g is well-defined: If m + T — in' + T. then m — in' e T C S and m + S — in' + S. 
Moreover, 

kerg = S/T 
and 

im g = M/S. 

Again, the first isomorphism theorem completes the proof. • 

If /: M -> N is a map of modules and if S C ,V, then the reader may check that 

f-\S) = {meM-. f(m) e S] 
is a submodule of M containing ker /. 

Theorem 7.11 (Correspondence Theorem). IfT is a submodule of a module M, then 
there is a Injection 

(,o\ {intermediate submodules TC.SC. M} —> {submodules of M/T) 

given by 

S S/T. 

Moreover, S c S' in M if and only if S/T C S'/T in M/T. 
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M 



Proof. Since every module is an additive abelian group, every submodule is a subgroup, 
and so the correspondence theorem for groups. Theorem 2.76, shows that <p is an injection 
that preserves inclusions: S C S' in M if and only if S/T C S' jT in ,47/7’. The remainder 
of this proof is a straightforward adaptation of the proof of Proposition 6.1; we need check 
only that additive homomorphisms are now A 1 -maps. • 


Proposition 7.12. An R-module M is cyclic if and only if M = R/l for some ideal I. 

Proof. If M is cyclic, then M = (m) for some m e M. Define /:/?-» M by f(r) = 
rm. Now / is surjective, since M is cyclic, and its kernel is some ideal 1. The first 
isomorphism theorem gives R/I = M. 

Conversely, R/I is cyclic with generator 1 + 7, and any module isomorphic to a cyclic 
module is itself cyclic. • 


Definition. A module M is simple (or irreducible ) if M f {0} and M has no proper 
nonzero submodules; that is, the only submodules of M are {0} and M. 


Example 7.13. 

By Proposition 2.107, an abelian group G is simple if and only if G = I p for some 
prime p. ■* 


Corollary 7.14. An R-module M is simple if and only if M = R/I, where I is a maximal 
ideal. 


Proof. This follows from the correspondence theorem. • 
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Thus, the existence of maximal ideals guarantees the existence of simple modules. 

The notion of direct sum, already discussed for vector spaces and for abelian groups, 
extends to modules. Recall that an abelian group G is an internal direct sum of subgroups 
S and T if S + T — G and S D T — {0}, while an external direct sum is the abelian 
group whose underlying set is the cartesian product S x T and whose binary operation is 
pointwise addition; both versions give isomorphic abelian groups. The internal-external 
viewpoints persist for modules. 

Definition. If S and T are R -modules, where R is a commutative 1 ring, then their direct 
sum , denoted 2 by S U T, is the cartesian product S x T with coordinatewise operations: 

(s , t ) T (s , t ) = (.v T s , t -f- 1 ); 
r(s, t) — ( rs , rt), 

where s, s' e S, t , t' e T, and r e R. 

There are injective R-maps /,<,: S —»■ S U T and Xj : T —> S U T given, respectively, 
by : s i->- (s, 0 ) and A 7 -: t h-> ( 0 , t). 

Proposition 7.15. The following statements are equivalent for R-modules M, S, and T. 

(i) S u T = M. 

(ii) There exist injective R-maps i: S —»■ M and j : T —> M such that 

M = imi + im j and im i n im j — {0}. 

(iii) There exist R-maps i: S -> M and j : T —>• M such that, for every m e M, there 
are unique s e S and t e T with 


m = is + jt. 

(iv) There are R-maps i: S —> M, j: T —> M, p : M —> S, and q : M —> T such that 
pi — Is, qj — It, pj — 0, qi — 0, and ip + jq — Im- 


Remark. The maps i and j are called injections , and the maps p and q are called projec¬ 
tions. The equations pi — 1 5 and qj = I 7 - show that the maps i and j must be injective 
(so that im / = S and im j = T) and the maps p and q must be surjective. < 


1 Modules over noncommutative rings are defined in the next chapter. 

2 Other common notations are S ®T and S x T. 
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Proof, (i) =>• (ii) Let (p : S U T -> M be an isomorphism, and define i — q>X$ [where 
A,?: j h> ( s , 0)] and j = <pXj [where Xj: t m* (0, f)]- Both i and j are injections, being 
the composites of injections. If m e M , there is a unique ordered pair (.v, t ) e S U T with 
m — tpiis, t)). Hence, 

m — <p((s, t)) — <p((s, 0) + (0, t)) = <pXs(s) + (pXj(t) — is + jt e im/ + im j. 

If x e im/ fl im j, then is — jt for s e S and t e T\ that is, q>X${s) = cpXrit ). Since tp is 
an isomorphism, we have (s, 0) = X$(s) = X-pf) — (0, t) in S u T. Therefore, s = 0 = f, 
x = 0, and im i n im j — {0}. 

(ii) =>■ (iii) Given m e M. an expression of the form m = is + jt exists, by part (ii), and 
so we need prove only uniqueness. If also m = is' + jt', then i(s — s') = jit' — t) e 
im i n im j = {0}. Therefore, i (s — s') — 0 and j (t — t') — 0. Since i and j are injections, 
we have s = s' and t — t'. 

(iii) =>■ (iv) If m e M, then there are unique s e S and t e T with m — is + jt. The 
functions p and q, defined by 

pirn) — s and q(m) — t, 

are thus well-defined. It is routine to check that p and q are A 1 -maps and that the first four 
equations in the statement hold (they follow from the definitions of p and q). For the last 
equation, if m e M, then m = is + jt, and ip(m) + jq(m) = is + jt — m. 

(iv) =>• (i) Define <p\ S U T —> M by <p : (s, t) i-^ is + jt. It is easy to see that <p is an 
/Gmap; <p is surjective because 1 m = ip + ./<?■ To see that cp is injective, suppose that 
<pi(s, t)) = 0, so that is — —jt. Now s = pis — —pjt — 0 and —t = —qjt — qis — 0, 
as desired. • 

Internal direct sum is probably the most important instance of a module isomorphic to 
a direct sum. 

Definition. If S and T are submodules of a module M, then M is their internal direct 
sum if M L S LJ 7’ with i: S -> M and j : T —> M the inclusions. We denote an internal 
direct sum by 

M — S ® T. 

In this chapter only, we will use the notation S U T to denote the external direct sum 
(underlying set the cartesian product of all ordered pairs) and the notation M — S © T to 
denote the internal direct sum (S and T submodules of M as just defined). Later, we shall 
write as the mathematical world writes: The same notation S © T is used for either version 
of direct sum. 

Here is a restatement of Proposition 7.15 for internal direct sums. 

Corollary 7.16. The following conditions are equivalent for an R-module M with sub- 
modules S and T. 


(i) M = S@T. 
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(ii) S + T — M and SHT = {0}. 

(iii) Each m e M has a unique expression of the form m = s + t for s e S and t e T. 
Proof This follows at once from Proposition 7.15 by taking i and j to be inclusions. • 

Definition. A submodule S of a module M is a direct summand of M if there exists a 
submodule T of M with M = S © T. 

The next corollary will connect direct summands with a special type of homomorphism. 

Definition. If S is a submodule of an /?-module M, then S is a retract of M if there exists 
an R -homomorphism p: M —> .S', called a retraction, with p (,v) = s for all ,v e S. 

Retractions in nonabelian groups arose in Exercise 5.72 on page 318. 

Corollary 7.17. A submodule S of a module M is a direct summand if and only if there 
exists a retraction p : M —> S. 

Proof In this case, we let i: S —»• M be the inclusion. We show that M — S © T , where 
T — ker p. If m e M, then m = (m — pm) + pm. Plainly, pm e imp = S. On the other 
hand, p(m — pm) — pm — ppm = 0, because pm e S and so ppm = pm. Therefore, 
M = S + T. 

If m e S, then pm = m; if m e T — ker p, then pm = 0. Hence, if m e S fl T, then 
m — 0. Therefore, SOT — {0}, and M — S © T. 

For the converse, if M — S © T, then each m e M has a unique expression of the form 
m — s + t, where s e S and t e T, and it is easy to check that p: M -» S, defined by 

p: s + t s, is a retraction M -» S. • 

Corollary 7.18. If M = S @T and S c A C M, then A = S © (A fl T). 

Proof. Let p: M -» S be the retraction s + t s. Since S C A , the restriction 

p\ A: A —> S is a retraction with ker p\A = A fl T. • 

The direct sum construction can be extended to finitely many submodules. There is an 
external and internal version. 

Definition. Let .S’|. S„ be /^-modules. Define the external direct sum 

Si u • • • u S„ 


to be the /^-module whose underlying set is the cartesian product S\ x ■ ■ ■ x S n and whose 
operations are 
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Let M be a module, and let .S'], ..., S n be submodules of M. Define M to be the internal 
direct sum 

M = Si © • • • © S n 

if each m e M has a unique expression of the form m = s 1 + • • • + s n , where ,v,- e Sj for 
all i — 1 ,... ,n. 

We let the reader prove that both internal and external versions, when the former is 
defined, are isomorphic. 

For example, if V is an n -dimensional vector space over a field k, and if vi, ..., v n is a 
basis, then 

V = (vi) © • • • © (v n ). 

If Si,..., S„ are submodules of a module M, when is (Si . S n ), the submodule 

generated by the Sj, equal to their direct sum? A common mistake is to say that it is 
enough to assume that Sj fl Sj — {0} for all i ^ j, but Example 5.3 on page 251 shows 
that this is not enough. 

Proposition 7.19. Let M — .S’] + • • • + S n , where the Sj are submodules', that is, each 
m € M has a (not necessarily unique ) expression of the form 


m — s i H-b s„. 


where Sj € Sj for all i. Then M = © • • • © S n if and only if for each i, 

Sj PI (Si + • • • + Sj + • • • + S n ) = {0}, 
where Sj means that the term Sj is omitted from the sum. 

Proof. A straightforward adaptation of Proposition 5.4. See Exercise 7.79 on page 519 
for the generalization of this proposition for infinitely many submodules. • 

Here is the last definition in this dictionary of modules. 

Definition. A sequence of /Lumps and /?-modules 

, , Jn+ 1 . . fn , . 

• • • > M n +i > M n > M n —i > • • • 

is called an exact sequence 3 if im f n+ i = ker /„ for all n. 

f 8 

Observe that there is no need to label an arrow 0 -> A or B —> 0 for, in either case, 
there is a unique map, namely, /: 0 i->- 0 or the constant homomorphism g(b) = 0 for all 
beB. 4 

Here are some simple consequences of a sequence of homomorphisms being exact. 

3 This terminology comes from advanced calculus, where a differential form co is called closed if dco = 0 and 
it is called exact if a> = dh for some function h (see Proposition 9.146 on page 753). The term was coined by the 
algebraic topologist W. Hurewicz. It is interesting to look at the book by Hurewicz-Wallman, Dimension Theory, 
which was written just before this coinage. We can see there many results that would have been much simpler to 
state had the word exact been available. 

4 In diagrams, we usually write 0 instead of {0}. 
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Proposition 7.20. 

(i) A sequence 0 —> A 4 B is exact if and only if f is injective. 

g 

(ii) A sequence B —> C —> 0 is exact if and only if g is surjective. 

h 

(iii) A sequence 0 —> A —> B —> 0 is exact if and only ifh is an isomorphism. 

Proof, (i) The image of 0 —»• A is {0}, so that exactness gives ker / = {0}, and so / is 

injective. Conversely, given /: A — > B. there is an exact sequence ker / —»■ A —4 B. If 
/ is injective, then ker / = {0}. 

(ii) The kernel of C —> 0 is C. so that exactness gives im g — C, and so g is surjective. 

g 

Conversely, given g. B —»• C, there is an exact sequence B —> C -* C/ im g (see 
Exercise 7.13). If g is suijective, then C = im g and C/img = {0}. 

h 

(iii) Part (i) shows that h is injective if and only if 0 —A —^ B is exact; part (ii) shows 

h 

that h is surjective if and only if A —> B —> 0 is exact. Therefore, h is an isomorphism if 

h 

and only if the sequence 0 —> A —> B ^ 0 is exact. • 

We can restate the isomorphism theorems in the language of exact sequences. 
Definition. A short exact sequence is an exact sequence of the form 

0->a4b4c->0. 

We also call this short exact sequence an extension of A by C. 

Some authors call this an extension of C by A; some authors say that the middle module 
B is an extension. 


Proposition 7.21. 

f g 

(i) IfO ->A-+B-+C—>0isa short exact sequence, then 

A = im/ and B/imf = C. 

(ii) IfT C S C M is a tower of submodules, then there is an exact sequence 

0 -» S/T 4 M/T 4 M/S -* 0. 


Proof, (i) Since / is injective, it is an isomorphism A —>■ im/. The first isomorphism 
theorem gives B / kcry = i m g. By exactness, however, ker g = im / and i m g = C; 
therefore, B / im / = C. 

(ii) This is just a restatement of the third isomorphism theorem. Define / : S/T —> M/T 
to be the inclusion, and define g: M/T —» M/S be “coset enlargement:” g: m + T 
m + S. As in the proof of Theorem 7.10, g is suijective, and kerg = S/T = im /. • 
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In the special case when A is a submodule of B and /: A —»■ B is the inclusion, then 
exactness ofO—> a4_b 4-C->0 gives B/A = C. 

Definition. A short exact sequence 

0-* A fi -4 C 0 
is split if there exists a map j : C —» B with pj — 1 c- 

Proposition 7.22. If an exact sequence 

o->a4b4c->o 


is split, then B = A U C. 

Remark. Exercise 7.17 on page 441 characterizes split short exact sequences. < 

Proof. We show that B = im i © im j, where j : C —> B satisfies pj = 1 . If h e B. 
then pb e C and b — jpb e ker p, for p(b — jpb ) = pb — pj (pb) — 0 because pj — 1 c- 
By exactness, there is a e A with ia — b — j pb. It follows that B — im i+ im j. It remains 
to prove im i n imy = {0}. If ia — x = jc, then px — pia — 0, because pi — 0, whereas 
px — pjc — c, because pj — 1 c- Therefore, x — jc — 0, and so B = A u C. • 

The converse of the last proposition is not true. Let A = (a), B = (b), and C = ( c) be 
cyclic groups of orders 2, 4, and 2, respectively. If i: A —*■ B is defined by i (a) — 2b and 

p : B -> C is defined by p(b) = c, then 0 —> A — U- B — U- C -* 0 is an exact sequence 
which is not split: im; = (2b) is not even a pure subgroup of B. By Exercise 7.12 on 
page 440, for any abelian group M, there is an exact sequence 

0^a4buM AcuM-^0, 

where i'(a) — (2b, 0) and p'(b, m) — ( c, m), and this sequence does not split either. If 
we choose M = I 4 [jv] U IBM (the direct summands are the polynomial rings over I 4 and 
I 2 , respectively), then A LI (C U M) = (For readers who are familiar with infinite 

direct sums, which we introduce later in this chapter, M is the direct sum of infinitely many 
copies of I 4 LI I 2 .) 

Here is a characterization of noetherian rings using these ideas. 

Proposition 7.23. 

(i) A commutative ring R is noetherian if and only if every submodule of a finitely 
generated R-module M is itself finitely generated 

(ii) If R is a PID and if M can be generated by n elements, then every submodule of M 
can be generated by n or fewer elements. 
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Remark. Proposition 7.23(ii) is not true more generally. For example, if R is not a 
PID, there there is some ideal I that is not principal. Thus, R has one generator while its 
submodule I cannot be generated by one element. -4 

Proof, (i) Assume that every submodule of a finitely generated /?-module is finitely gen¬ 
erated. In particular, every submodule of R , which is a cyclic /^-module and hence finitely 
generated, is finitely generated. But submodules of R are ideals, and so every ideal is 
finitely generated; that is, R is noetherian. 

We prove the converse by induction on n > 1, where M — (x i. ..., x n ). If n — 1, then 
M is cyclic, and so Proposition 7.12 gives M = R/1 for some ideal /. If S C M. then 
the correspondence theorem gives an ideal J with I C / C R and S = J/I. But R is 
noetherian, so that /, and hence J/I, is finitely generated 

If n > 1 and M = {x \, ..., x n , v„+i}, consider the exact sequence 

0 M' -U M -A M" -> 0, 

where M' = {x\,..., x„), M" = M/M', i is the inclusion, and p is the natural map. Note 
that M" is cyclic, being generated by x n +\ + M'. If S C M is a submodule, there is an 
exact sequence 

shm' -+ s -> s/(s n m ') -> o. 

Now SDM’ C M', and hence it is finitely generated, by the inductive hypothesis. Further¬ 
more, S/(S D M') = (S + M')/M' C M/M', so that S/(S fl M') is finitely generated, by 
the base step. Using Exercise 7.15 on page 441, we conclude that S is finitely generated 

(ii) We prove the statement by induction on n > 1. If M is cyclic, then M = R/I\ if 
Sell, then S = J /1 for some ideal J in R containing 1. Since R is a PID, J is principal, 
and so J/I is cyclic. 

For the inductive step, we refer to the exact sequence 

o s/(s nM')-> o 

in part (i), where M = (x\, ... ,x n , x„+i) and M' — (x\,..., x n ). By the inductive 
hypothesis, XflM' can be generated by n or fewer elements, while the base step shows 
that S/(S fl M') is cyclic. Exercise 7.15 on page 441 shows that S can be generated by 
n + 1 or fewer elements. • 

The next proposition, whose proof uses Proposition 7.23(h), shows that the sum and 
product of algebraic integers are themselves algebraic integers. If a and f> are algebraic 
integers, it is not too difficult to give monic polynomials having a + f and af> as roots, but 
it takes a bit of work to find such polynomials having all coefficients in Z (see Pollard, The 
Theory of Algebraic Numbers, page 33). 

Proposition 7.24. Let a e C and define Z[a] = {g(a): g(x) e Z[x]}. 

(i) Z[a\ is a subring ofC. 
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(ii) A complex number a is an algebraic integer if and only ifZ[a] is a finitely generated 
additive abelian group. 

(iii) The set of all the algebraic integers is a subring ofC. 

Proof, (i) Since 1 = g(a), where g(x) = 1 is a constant polynomial, we have 1 e Z[a]. 
If f(a),g(a) e Z[a], then so is f(a) + g(a ) = h(a), where h(x ) = fix) + g(x). 
Similarly, fia)g(a) e Z[a], and so Z[a\ is a subring of C. 

(ii) If a is an algebraic integer, there is a monic polynomial fix) e Z[x\ having a as 
a root. We claim that if deg(/) = n, then Z[a\ = G, where G is the set of all linear 
combinations mo + mice + • • • + m„-\a n ~ x with m; e Z. Clearly, G C Z[a], For the 
reverse inclusion, each element u e Z[a] has the form u — g(a), where g(x) e Z[x], 
Since fix) is monic, the division algorithm (Corollary 3.22) gives q{x), r(x) e Z[x] with 
g(x ) = q(x)fix) + r(x), where either r(x) — 0 or deg(r) < deg(/) = n. Therefore, 

u = g(a) — q(a)f(a) + r(a) — r(a ) e G. 

Thus, the additive group of Z[a] is finitely generated. 

Conversely, if the additive group of the commutative ring Z[a] is finitely generated, 
that is, Z[a] = (gi,..., g m ) as an abelian group, then each gj is a Z-linear combination of 
powers of a. Let m be the largest power of a occurring in any of these g’s. Since Z[a] is a 
commutative ring, a"’ +l e Z[a ]; hence, a m+1 can be expressed as a Z-linear combination 
of smaller powers of a; say, a m+l = Yl'lLo^' 01 ' > where bj e Z. Therefore, a is a root of 
fix) — x"' +1 — Yl'iL o bjX 1 , which is a monic polynomial in Z[x], and so a is an algebraic 
integer. 

(iii) Suppose that a and ft are algebraic integers; let a be a root of a monic fix) e Z[x ] 
of degree n, and let f be a root of a monic g(x) e Z[x] of degree m. Now Z[af] is an 
additive subgroup of G — (a 1 fi J : 0 < i < n, 0 < j < m ). Since G is finitely generated, so 
is its subgroup Z[aj3], by Proposition 7.23(h), and so af is an algebraic integer. Similarly, 
Z[a + ft] is an additive subgroup of [a 1 f) ] : i + j < n + m — l|, and so a + f> is also an 
algebraic integer. • 

This last theorem gives a technique for proving that an integer a is a divisor of an 
integer b. If we can prove that b/a is an algebraic integer, then it must be an integer, for it 
is obviously rational. This will actually be used in Chapter 8 to prove that the degrees of 
the irreducible characters of a finite group G are divisors of |G|. 

Exercises 

7.1 Let R be a commutative ring. Call an (additive) abelian group M an almost R-module if there 
is a function R x M —y M satisfying all the axioms of an ILmodule except axiom (iv): We 
do not assume that 1 m = m for all m e M. 

Prove that 


M = M\ © Mo, 
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where 

M\ = [m e M: bn — m) and Mq = (m e M: rm = 0 for all r e R) 
are subgroups of M that are almost R-modules; in fact, M\ is an R-module. 

7.2 If X is a subset of a module M, prove that ( X ), the submodule of M generated by X, is equal 
to P| 5. where the intersection ranges over all those submodules SQM containing X. 

7.3 Prove that if /: M —»■ N is an R-map and K is a submodule of M with K Q ker /, then / 

induces an R-map f: M/K -*■ N by /: m + K i-*- /(m). 

7.4 Let R be a commutative ring and let J be an ideal in R. Recall that if M is an R-module, 
then JM = {JL y,■/«,-: y' ; - e 7 and m, e M} is a submodule of M. Prove that M/JM is an 
R/7-module if we define scalar multiplication: 

(r + J)(m + J M) = rm + J M. 

Conclude that if 7M = (0). then M itself is an R/7-module; in particular, if J is a maximal 
ideal in R and JM = (0), then M is a vector space over R/7. 

7.5 For every R-module M, prove that there is an R-isomorphism 

(fM : Hom^(R, Af) —>■ M, 

given by ip M : / /(l). 

7.6 Let F = ^" = | (Z) ; ) be a direct sum of R-modules, where fj: R —>■ (i, ), given by r i->- rbj, 

is an isomorphism. Prove that if M is a maximal ideal in R, then the cosets {bj + MF: i = 

1,...,«} form a basis of the vector space F/MF over the field R/M. (See Exercise 7.4.) 

7.7 Let R and S be commutative rings, and let <p: R - 4 - 5 be a ring homomorphism. If M is an 
5-module, prove that M is also an R-module if we define 

rm = (p(r)m , 

for all r e R and m e M. 

7.8 Let M = 5i U ■ • • U 5 n be a direct sum of R-modules. If 7) C 5/ for all /, prove that 

(5i U • • • U Sn)l<J x U • • • U T„) = (Si/ro U • • • U (Sn/T„). 

7.9 Let R be a commutative ring and let M be a nonzero R-module. If m e M . define ord(m) = 
{?■ € R : rm = 0}, and define JF = jord(m) : m e M and m 7 ^ 0). Prove that every maximal 
element in T is a prime ideal. 

f g 

7.10 Let A -> B —> C be a sequence of module maps. Prove that gf = 0 if and only if irn / C 
kerg. Give an example of such a sequence that is not exact. 

7.11 If 0 -*■ M —>• 0 is an exact sequence, prove that M = (0). 

7.12 Let 0 —^ A —^ B —^ C —^ 0 be a short exact sequence of modules. If M is any module, prove 
that there are exact sequences 

0 -^A©M^S©M-»C ->0 


and 
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Definition. If /: M —y N is a map, define its cokernel, denoted by coker/, as 

coker/ = N / im /. 

7.13 (i) Prove that a map /: M —> N is surjective if and only if coker/ = {0}. 

(ii) If /: M -*■ N is a map, prove that there is an exact sequence 

0 -*■ ker / -*■ M - 4 - IV -*■ coker/ -*■ 0. 

f h 

7.14 If A —> B —>■ C —> D is an exact sequence, prove that / is surjective if and only if h is 
injective. 

7.15 Let 0 —^ A B -4 C —^ 0 be a short exact sequence. 

(i) Assume that A — {X ) and C = (Y). For each y e Y. choose y' e B with ply') = y. 
Prove that 

B = (i(X)U{y':yeY}). 

(ii) Prove that if both A and C are finitely generated, then B is finitely generated More 
precisely, prove that if A can be generated by m elements and if C can be generated by 
n elements, then B can be generated by m + n elements. 

7.16 Prove that every short exact sequence of vector spaces is split. 

7.17 Prove that a short exact sequence 

o-^a4s4c^o 


splits if and only if there exists q : B -*■ A with qi = 1^. 

7.18 (i) Prove that a map <p: B -¥ C is injective if and only if <p can be canceled from the left; 

that is, for all modules A and all maps /, g: A —> B, we have (pf = <pg implies / = g. 


f 


B 


C 


(ii) Prove that a f^-map <p: B — > C is surjective if and only if <p can be canceled from the 
right; that is, for all /^-modules D and all f?-maps h,k\ C —> D, we have lup = k(p 
implies h = k. 

w h 

B -4- C =1 D 
k 

7.19 ( Eilenberg-Moore) Let G be a (possibly nonabelian) group. 

(i) If H is a proper subgroup of a group G, prove that there exists a group L and distinct 
homomotphisms /, g: G —>■ L with f\H = g\H. 

Hint. Define L = Sx, where X denotes the family of all the left cosets of H in G 
together with an additional element, denoted oo. If a e G. define f(a) = f a 6 Sx by 
fa(oo) = oo and f a (bH) = abH. Define g: G —> Sx by g = y o /, where y e Sx is 
conjugation by the transposition ( H. oo). 

(ii) If A and G are groups, prove that a homomorphism <p: A —> G is surjective if and only 
if i p can be canceled from the right; that is, for all groups L and all maps /, g: G —> L, 
we have fip — gcp implies / = g. 
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7.2 Categories 


Imagine a set theory whose primitive terms, instead of set and element , are set and function. 
How could we define bijection, cartesian product, union, and intersection? Category theory 
will force us to think in this way. Now categories are the context for discussing general 
properties of systems such as groups, rings, vector spaces, modules, sets, and topological 
spaces, in tandem with their respective transformations: homomorphisms, functions, and 
continuous maps. There are two basic reasons for studying categories: The first is that they 
are needed to define functors and natural transformations (which we will do in the next 
sections); the other is that categories will force us to regard a module, for example, not in 
isolation, but in a context serving to relate it to all other modules (for example, we will 
define certain modules as solutions to universal mapping problems). 

There are well-known set-theoretic “paradoxes” that show that contradictions arise if 
we are not careful about how the undefined terms set and element are used. For example, 
Russell’s paradox shows how we can run into trouble by regarding every collection as a 
set. Define a Russell set to be a set S that is not a member of itself; that is, S f S. If R 
is the family of all Russell sets, is R a Russell set? On the one hand, if R e R, then R is 
not a Russell set; as only Russell sets are members of R , we must have R f R. and this 
is a contradiction. On the other hand, if we assume that R R, then R is a Russell set, 
and so it belongs to R (which contains every Russell set); again, we have a contradiction. 
We conclude that we must impose some conditions on what collections are allowed to be 
sets (and also some conditions on the membership relation e). One way to avoid such 
problems is to axiomatize set theory by considering class as a primitive term instead of 
set. The axioms give the existence of finite classes and of N; they also provide rules for 
constructing special classes from given ones, and any class constructed according to these 
rules is called a set. Cardinality can be defined, and there is a theorem that a class is a set 
if and only if it is “small”; that is, it has a cardinal number. A proper class is defined to be 
a class that is not a set. For example, N, Z, Q, R, and C are sets, while the collection of 
all sets is a proper class. Paradoxes are avoided by decreeing that some rules apply only to 
sets but not to proper classes. 

Definition. A category C consists of three ingredients: a class obj(C) of objects , a set 
of morphisms Hom(A, B) for every ordered pair (A, B) of objects, and composition 
Hom(A, B) x Hom(R, C) -> Hom(A, C), denoted by 

(/, g) I-+ gf, 

for every ordered triple A, B. C of objects. [We often write f:A—> fi or A 4 B to 
denote / e Hom(A, B ).\ These ingredients are subject to the following axioms: 

(i) the Horn sets are pairwise disjoint; 5 that is, each morphism has a unique domain and 
a unique target; 

5 One can force pairwise disjointness by labeling morphisms / e Hom(A, B) by a/b- 
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(ii) for each object A, there is an identity morphism l a € Hom(A, A) such that 
fl A = f and 1 B f = f for all /: A B; 


(iii) composition is associative: Given morphisms 


B 


C A D, 


then 

h(gf) = ( hg)f. 

The important notion, in this circle of ideas, is not category but functor, which will 
be introduced in the next section. Categories are necessary because they are an essential 
ingredient in the definition of functor. A similar situation occurs in linear algebra: Linear 
transformation is the important notion, but we must first consider vector spaces in order to 
define it. 

The following examples will explain certain fine points of the definition of category. 

Example 7.25. 

(i) C = Sets. 

The objects in this category are sets (not proper classes), morphisms are functions, and 
composition is the usual composition of functions. 

A standard result of set theory is that if A and B are sets, then Hom(A, B), the class of 
all functions from A to B, is a set. That Horn sets are pairwise disjoint is just the reflection 
of the definition of equality of functions given in Chapter 1: In order that two functions be 
equal, they must, first, have the same domains and the same targets (and, of course, they 
must have the same graphs). 

(ii) C — Groups. 

Here, objects are groups, morphisms are homomorphisms, and composition is the usual 
composition (homomorphisms are functions). 

(iii) C — CommRings. 

Here, objects are commutative rings, morphisms are ring homomorphisms, and compo¬ 
sition is the usual composition. 

(iv) C = *Mod. 6 

The objects in this category are R -modules, where R is a commutative ring, morphisms 
are R -homomorphisms, and composition is the usual composition. We denote the sets 
Hom(A, B) in ^Mod by 

Hom^(A, B). 

If R = Z, then we often write 

zMod — Ab 

to remind ourselves that Z-modules are just abelian groups. 

6 When we introduce noncommutative rings in the Chapter 8, then we will denote the category of left R- 
modules by /?Mod and the category of right /?-modules by Mod/?. 
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(v) C = PO(X). 

If X is a partially ordered set, regard it as a category whose objects are the elements of 
X , whose Horn sets are either empty or have only one element: 


Hom(x, y) 


10 if x £ y 
I Ky if x < y 


(the symbol k* denotes the unique element in the Horn set when x < y) and whose com¬ 
position is given by 

KiK* = K*. 

Note that \ x = k x , by reflexivity, while composition makes sense because < is transitive . 7 

We insisted, in the definition of category, that Hom(A, II) be a set, but we left open the 
possibility that it be empty. The category PO(X) is an example in which this possibility 
occurs. [Not every Horn set in a category C can be empty, for Hom(A, A) ^ 0 for every 
object A e C because it contains the identity morphism 1 a-] 

(vi) C — C(G). 

If G is a group, then the following description defines a category C(G): There is only 
one object, denoted by *, Hom(*, *) = G, and composition 


Hom(*, *) x Hom(*, *) —> Hom(*, *); 


that is, G x G —> G, is the given multiplication in G. We leave verification of the axioms 
to the reader . 8 

The category C(G) has an unusual property. Since * is merely an object, not a set, there 
are no functions * —> * defined on it; thus, morphisms here are not functions. Another 
curious property of this category is another consequence of there being only one object: 
there are no proper subobjects here. 

(vii) There are many interesting nonalgebraic examples of categories. For example, C = 
Top, the category with objects all topological spaces, morphisms all continuous functions, 
and usual composition. ◄ 


Here is how to translate isomorphism into categorical language. 


Definition. A morphism /: A -> B in a category C is an equivalence (or an isomor¬ 
phism) if there exists a morphism g: B —> A in C with 

gf = 1 A and fg = \ B . 

The morphism g is called the inverse of /. 

7 A nonempty set X is called quasi-ordered if it has a relation x < y that is reflexive and transitive (if, in 
addition, this relation is antisymmetric, then X is partially ordered). PO(X) is a category for every quasi-ordered 
set. 

8 That every element in G have an inverse is not needed to prove that C (G) is a category, and C (G) is a category 
for every monoid G. 
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It is easy to see that an inverse of an equivalence is unique. 

Identity morphisms in a category are always equivalences. If C — PO( A), where X is 
a partially ordered set, then the only equivalences are identities; if C = C(G), where G 
is a group (see Example 7.25(vi)), then every morphism is an equivalence. If C — Sets, 
then equivalences are bijections; if C — Groups, C — a*M od, or C — CommRings, then 
equivalences are isomorphisms; if C — Top, then equivalences are homeomorphisms. 

Let us give a name to a feature of the category a Mod (which we saw in Proposition 7.5) 
that is not shared by more general categories: Homomorphisms can be added. 

Definition. A category C is pre-additive if every Hom(A, B) is equipped with a binary 
operation making it an (additive) abelian group for which the distributive laws hold: for all 
/,geHom(A,fi), 

(i) if p : B —>■ B', then 

p(f + g) = pf + pg e Hom(A, B’)\ 

(ii) if q: A' —*■ A, then 

(/ + g)q = fq + gq e Hom(A', B). 

In Exercise 7.22 on page 458, it is shown that Groups does not have the structure of a 
pre-additive category. 

A category is defined in terms of objects and morphisms; its objects need not be sets, 
and its morphisms need not be functions [ C(G ) in Example 7.25(vi) is such a category]. 
We now give ourselves the exercise of trying to describe various constructions in Sets or 
in a Mod so that they make sense in arbitrary categories. 

In Proposition 7.15(iii), we gave the following characterization of direct sum M = 
A © B: there are homomorphisms p: M —> A, q : M —> B. i: A —>• M. and j: B M 
such that 

pi = 1 A, qj = 1 b, pj = 0, qi — 0 and ip + jq — l M - 

Even though this description of direct sum is phrased in terms of arrows, it is not general 
enough to make sense in every category; it makes use of a property of the category a Mod 
that is not enjoyed by the category Sets, for example: Morphisms can be added. 

In Corollary 7.17, we gave another description of direct sum in terms of arrows: 

There is a map p: M —>• S with ps — s\ moreover, kerp = imy, imp = imi, and 
p(s) — s for every s e imp. 

This description makes sense in Sets, but it does not make sense in arbitrary categories 
because the image of a morphism may fail to be defined. For example, the morphisms in 
C(G) [see Example 7.25(vi)] are elements in Hom(*, *) = G, not functions, and so the 
image of a morphism has no obvious meaning. 

However, we can define direct summand categorically: An object S is (equivalent to) a 
retract of an object M if there exist morphisms 

i: S —> M and p : M —> S 
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for which pi = 1 s and (ip) 2 = ip (for modules, define p = ip). 

One of the nice aspects of thinking in a categorical way is that it enables us to see 
analogies that might not have been recognized before. For example, we shall soon see that 
direct sum in p Mod is the same notion as disjoint union in Sets. 

We begin with a very formal definition. 

Definition. A diagram in a category C is a directed multigraph 9 whose vertices are objects 
in C and whose arrows are morphisms in C. 

For example. 


X 



k 


is a diagram in a category, as is 



If we think of an arrow as a “one-way street,” then a path in a diagram is a “walk” from 
one vertex to another taking care never to walk the wrong way. A path in a diagram may 
be regarded as a composite of morphisms. 

Definition. A diagram commutes if, for each pair of vertices A and B , any two paths 
from A to B are equal; that is, the composites are the same morphism. 

For example, the triangular diagram above commutes if gf = h and kf — h, and the 
square diagram above commutes if gf = f'g'■ The term commutes in this context arises 
from this last example. 

If A and B are subsets of a set .S', then their intersection is defined: 

AnB — {seS:seA and s e B} 

(if two sets are not given as subsets, then their intersection may not be what one expects: 
for example, if Q is defined as all equivalence classes of ordered pairs (m, n) of integers 
with n ^ 0, then Z fl Q = 0). 

We can force two overlapping subsets A and B to be disjoint by “disjointifying” them. 
Consider the cartesian product (A U B) x {1,2}, and consider the subsets A' — A x {1} 
and B' = B x {2}. It is plain that A' fl B' — 0, for a point in the intersection would have 
coordinates (a, 1) = (b. 2); this cannot be, for their second coordinates are not equal. We 

9 A directed multigraph consists of a set V, called vertices and. for each ordered pair (u. v) e V x V, a 
(possibly empty) set arr(ii, v). called arrows from u to v. 
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call Aft U B’ the disjoint union of A and B. Let us take note of the functions or. A A' 
and ft: B -* B\ given by a: a i-> (a, 1) and ft : b i—> (b, 2). We denote the disjoint union 
A' U B' by A u B. 

If there are functions /: A —> X and g: B —> X. for some set X , then there is a unique 
function h: A u B —> X given by 

|,g(w) if m € B. 

The function h is well-defined because A and B are disjoint. 

Here is a way to describe this construction categorically (i.e., with diagrams). 

Definition. If A and B are objects in a category C, then their coproduct , denoted by 
,4 U B. is an object C in obj(C) together with injection morphisms or. A -> A U B 
and ft: B — > 4 U B. such that, for every object X in C and every pair of morphisms 
f: A X and g: B —> X, there exists a unique morphism 9: A u B -* X making the 
following diagram commute (i.e., 9a = f and 9ft = g). 


A 



Here is the formal proof that the set A u B — A'VJB' C (A U B) x {1,2} just constructed 
is the coproduct in Sets. If X is any set and if /: A —> X and g: B —>• X are any given 
functions, then there is a function 9 : A u B —> X that extends both / and g. If c e A u B. 
then either c — (a, 1) e 4' or c — (b. 2) e B'. Define 9{(a, 1)) = f (a) and define 
9{{b , 2)) = g(b), so that 9a = f and 9ft — g. Let us show that 9 is the unique function 
on A u B extending both / and g. If xft: A u B —> X satisfies jra — f and xftft = g , then 

f{a{a)) = f({a, 1)) = f{a) = 9((a, 1)) 


and, similarly, 

M(.b,2)) = g(b). 

Therefore, \ft agrees with 9 on Aft U B' = A u B , and so i ft — 9. 

We do not assert that coproducts always exist; in fact, it is easy to construct examples 
of categories in which a pair of objects does not have a coproduct (see Exercise 7.21 on 
page 458). Our argument, however, shows that coproducts do exist in Sets, where they 
are disjoint unions. Coproducts exist in the category of groups, and they are called free 
products ; free groups turn out to be free products of infinite cyclic groups (analogous to 
free abelian groups being direct sums of infinite cyclic groups). A theorem of A. G. Kurosh 
states that every subgroup of a free product is itself a free product. 
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Proposition 7.26. If A and B are R-modules, then their coproduct in a* Mod exists, and 
it is the direct sum C — A U B. 

Proof. The statement of the proposition is not complete, for a coproduct requires injection 
morphisms a and ft. The underlying set of C — A u B is the cartesian product A x B, and 
so we may define a: A -> C by a: a i->- (a, 0) and ft: B -> C by ft: b i->- (0, b). 

Now let X be a module, and let /: A —» X and g: B —> X be homomorphisms. Define 
6: C —> X by 6: (a, b) i —> f(a) + g(b). First, the diagram commutes: If a e A, then 
Oaifl) — 9({a, 0)) = f (a) and, similarly, if b e B, then 9ft(b) — 9((0,b)) = g(b). 
Finally, 6 is unique. If xft: C -> X makes the diagram commute, then jr((a, 0)) = f (a) 
for all a e A and xf ((0, b)) — g(b) for all b e B. Since \[r is a homomorphism, we have 

i K(a,b)) = xft((a,0) + (0,b)) 

= if {(a, 0)) + VK(0, b)) = f (a) + g(b). 


Therefore, i ft — 9. • 

Let us give the explicit formula for the map 9 in the proof of Proposition 7.26. If 
/: A —» X and g: B —> X are the given homomorphisms, then 0: A © B —>• X is given 
by 

9: ( a,b ) f(a) + g(b). 

The outline of the proof of the next proposition will be used frequently; we have already 
seen it in our proof of Lemma 5.74, when we proved that the rank of a nonabelian free 
group is well-defined. 


Proposition 7.27. If C is a category and if A and B are objects in C, then any two 
coproducts of A and B, should they exist, are equivalent. 

Proof. Suppose that C and D are coproducts of A and B. In more detail, assume that 
a: A->C,ft: B->C,y:A-+D, and 8: B —»• D are injection morphisms. If, in the 
defining diagram for C, we take X — D, then there is a morphism 9: C -> I) making the 
diagram commute. 



B 


Similarly, if, in the defining diagram for D , we take X = C, we obtain a morphism 
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\[r: D —»■ C making the diagram commute. 


A 



Consider now the following diagram, which arises from the juxtaposition of these two 
diagrams. 


A 



This diagram commutes because f9a = fy — a and i JrQji — \[rS = ft. But plainly, the 
identity morphism lc : C —> C also makes this diagram commute. By the uniqueness of 
the dotted arrow in the defining diagram for coproduct, x[r9 — 1 c- The same argument, 
mutatis mutandis, shows that 9 \jr — 1We conclude that 9 : C -» D is an equivalence. • 

Informally, an object S in a category C is called a solution to a universal mapping 
problem if it is defined by a diagram such that, whenever we vary an object X and various 
morphisms in the diagram, there exists a unique morphism making the new diagram com¬ 
mute. The “metatheorem” is that solutions, if they exist, are unique to unique equivalence. 
The proof just given is the prototype for proving the metatheorem fif we wax categorical, 
then the statement of the metatheorem can be made precise, and we can then prove it; see 
Exercise 7.29 on page 459 for an illustration, and see Mac Lane, Categories for the Work¬ 
ing Mathematician , Chapter III, for appropriate definitions, statement, and proof). There 
are two steps. First, if C and D are solutions, get morphisms 9: C —> I) and i// : /7 —> C 
by setting X — D in the diagram showing that C is a solution, and by setting X — C in the 
corresponding diagram showing that D is a solution. Second, set X = C in the diagram for 
C and show that both f9 and lc are “dotted” morphisms making the diagram commute; as 
such a dotted morphism is unique, conclude that i j/9 = 1 c- Similarly, the other composite 
0 i/r = 1 D , and so 9 is an equivalence. 

Definition. If A and B are objects in a category C, then their product , denoted by A n B. 
is an object P e C and morphisms p: P —*■ A and q\ P —> B, such that, for every object 
X e C and every pair of morphisms /: X —> A and g: X -* B, there exists a unique 
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morphism 9: X -> P making the following diagram commute: 



B 


The cartesian product P — A x B of two sets A and B is the categorical product in Sets. 
Define p: A x B -> A by p : (a, b) a and define q : A x B -> B by q : (a, b) b. 
If X is a set and /: X —> A and g: X —*■ B are functions, then the reader may show 
that 9: X -> A x B, defined by 9: x i-> (/(x),g(x)) e A x B, satisfies the necessary 
conditions. 


Proposition 7.28. If A and B are objects in a category C, then any two products of A and 
B, should they exist, are equivalent. 

Proof Adapt the proof of the prototype. Proposition 7.27 • 

The reader should note that the defining diagram for product is obtained from the dia¬ 
gram for coproduct by reversing all the arrows. A similar reversal of arrows can be seen in 
Exercise 7.18 on page 441: The diagram characterizing a surjection in ^Mod is obtained 
by reversing all the arrows in the diagram that characterizes an injection. If S is a solution 
to a universal mapping problem posed by a diagram V, let V be the diagram obtained 
from V by reversing all its arrows. If S' is a solution to the universal mapping problem 
posed by V, then we call S and S' duals. There are examples of categories in which an 
object and its dual object both exist, and there are examples in which an object exists but 
its dual does not. 

What is the product of two modules? 

Proposition 7.29. If R is a commutative ring and A and B are R-modules, then their 
(categorical) product An B exists; in fact, 

A n B = A u B. 


Remark. Thus, the product and coproduct of two objects, though distinct in Sets, coin¬ 
cide in flMod. ◄ 


Proof. In Proposition 7.15(iii), we characterized M = ,4 U /i by the existence of projec¬ 
tion and injection morphisms 

i 1 

A ?± M B 
P i 


satisfying the equations 


pi = \ A , qj = 1 b, pj = 0, qi = 0 and ip + jq = \ M . 
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If X is a module and /: X —> A and g\ X —> B are homomorphisms, define 9 : X 
AuBby 9(x) = if (pc) + jg(x). The product diagram 



B 


commutes because, for all x e X. 

p9(x) = pi fix) + pjg(x) = pi fix) = f(x) 

(using the given equations) and, similarly, q9(x) = g(x). To prove uniqueness of 9, note 
that the equation ip + jq — IauB gives 

i //= ipf + jqf = if + jg = 9. • 


Exercise 7.23 on page 458 shows that direct products are products in Groups. 

There are (at least) two ways to extend the notion of direct sum of modules from two 
summands to an indexed family of summands. 

Definition. Let R be a commutative ring and let {A; : i e 1} be an indexed family of R- 
modules. The direct product ]”[,£/ 4, is the cartesian product [i.e., the set of all /-tuples 10 
(i ai) whose ;th coordinate a; lies in A, for every ;'] with coordinatewise addition and scalar 
multiplication: 

(at) + (bi) = (di + bi) 
r(ai) = (rat), 

where r e R and at, bi e A, for all i. 

The direct sum , denoted by A, (and also by © )S/ A,), is the submodule of 
]”[,£/ -A, consisting of all (a,-) having only finitely many nonzero coordinates. 

Each m e A, has a unique expression of the form 

m = Y, o/j (a), 
iel 

where a,- e A/. a,(a) is the /-tuple in P| ( A, whose /th coordinate is a,- and all other 
coordinates are 0, and almost all «; = 0; that is, only finitely many a, can be nonzero. 

Note that if the index set I is finite, then n,e/ A/ = fhri A,. On the other hand, when 
I is infinite and infinitely many A, f 0, then the direct sum is a proper submodule of the 
direct product (moreover, in this case, they are almost never isomorphic). 

We now extend the definitions of coproduct and product to a family of objects. 

10 An /-tuple is a function /: / -> (Ji A, with f(i) € A; for all i 6 I. 
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Definition. Let C be a category, and let {A; : i e 1} be a family of objects in C in¬ 
dexed by a set I. A coproduct is an ordered pair (C, {a,: A, —» C}), consisting of an 
object C = |_|/e/ A,- and a family {a;: A/ —»• I— l/€/ ^ for all i e 1} of injection mor- 
phisms, that satisfies the following property. For every object A equipped with morphisms 
fi: Ai —> X, there exists a unique morphism 9: |_| (g/ A,- -> X making the following 
diagram commute for each i : 



As usual, coproducts are unique to equivalence should they exist. 

We sketch the existence of the disjoint union of sets {A, : i e I}. First form the set 
B — (U/e/ A i ) x I, and then define 

A- = {(a/, 0 e B : w e A,}. 

Then the disjoint union is Ll/s/ A i — U/s/ A\ (of course, the disjoint union of two sets 
is a special case of this construction). The reader may show that |_|, A/ together with the 
functions or,-: A, —> |_| ( A,- given by a,- : a,- i-> (a,-, i) e |_| ( A,-, comprise the coproduct in 
Sets; that is, we have described a solution to the universal mapping problem. 

Proposition 7.30. If {A, : i e 1} is a family of R-modules, then the direct sum ff l(: i A/ 
is their coproduct in /{Mod. 

Proof The statement of the proposition is not complete, for a coproduct requires injection 
morphisms a,-. Denote ff ic j A, by C, and define cy, : A/ —> C by a; a, (a) as follows: 
If ai e A{ , then «; (a ) e C is the /-tuple whose / th coordinate is a/ and whose other 
coordinates are zero. 

Now let A be a module and, for each i e I, let /,: A/ —> X be homomorphisms. Define 
9: C —>• A by 9: (a,) i->- f(aj) (note that this makes sense, for only finitely many 
a, ’s are nonzero). First, the diagram commutes; If/// e A,, then 0a/ (//,) = fiifli). Finally, 
9 is unique. If jr: C -> A makes the diagram commute, then i//((a;)) = fiifli). Since i/r 
is a homomorphism, we have 

f((fli)) = f{^2,ai{ai)) 
i 

= ^ faf Ui) = 22 fi(Pi )• 
i i 

Therefore, i Jr — 9. • 

Let us make the formula for 9 explicit. If //: A, —»• A are given homomorphisms, then 
J2isi A i ^ is given by 

0 : (a/) ^ //(a/) 

iel 




Sec. 7.2 Categories 


453 


[of course, almost all the u, = 0, so that there are only finitely many nonzero terms in the 

sum Eiel /<(«<)]• 

Here is the dual notion. 

Definition. Let C be a category, and let [A, : i e 1} be a family of objects in C indexed 
by a set I. A product is an ordered pair (C, {pt : C A/}), consisting of an object 
|—| ;S /A, and a family { /;,■: C -> A, for all i e 1} of projection morphisms, that satisfies 
the following condition. For every object X equipped with morphisms /;: X -> A/, there 
exists a unique morphism 9: X -> | | ; g / A , making the following diagram commute for 
each i: 



UieiAi < 0 X 

Products are unique to equivalence should they exist. 

We let the reader prove that cartesian product is the product in Sets. 

Proposition 7.31. //{A,- : i e 1} is a family of R-modules, then the direct product 

C — n, e/ A, is their product in r Mod. 

Proof The statement of the proposition is not complete, for a product requires projec¬ 
tions. For each j e /, define pj: C -> A,- by pj \ (a,) m* ay e Ay. 

Now let A be a module and, for each i e /, let f ): X —> A/ be a homomorphism. 
Define 0: X —*■ C by O', x i->- (fix)). First, the diagram commutes: If x e X, then 
Pi0(x) — fi(x). Finally, 9 is unique. If f : X —> C makes the diagram commute, then 
Pijr(x) — fi(ai) for all i ; that is, for each i, the ;th coordinate of f(x) is f (x), which is 
also the ;th coordinate of 9(x). Therefore, fix) = 9(x) for all x e X, and so f — 9. • 

The categorical viewpoint makes the next two proofs straightforward. 

Theorem 7.32. Let R be a commutative ring. For every R-module A and every family 
{Bj : i e 1} of R-modules, 

Horn* (A, I] Bi) = nHom*(A, B;), 
iel iel 


via the R-isomorphism 


<P- f i-> iPif), 


where the p, are the projections of the product Bi. 

Proof. It is easy to see that ip is additive. To see that qj is an /Nmap, note, for each i and 
each r e R, that p,rf — rp, f \ therefore. 


(P-rf i-» iPtrf) = ( rpif) = r(pif) = np(f). 
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Let us see that <p is surjective. If (/,) e ]~[ Horn/A A. Bj), then f ): A —>• Bj for every i. 


Bi 



n Bi < 



a 


By Proposition 7.31, fl B, is the product in « Mod, and so there is a unique B-map 0: 4 —> 
Q B, with = ft for all i. Thus, (/,) = ^(0) and cp is surjective. 

To see that q> is injective, suppose that / e ker <p\ that is, 0 = <p(f ) = ( p,f ). Thus, 
Pif — 0 for every i. Hence, the following diagram containing / commutes: 


Bi 



Y\Bi 



But the zero homomorphism also makes this diagram commute, and so the uniqueness of 
the arrow A n Bi gives / = 0. • 


Theorem 7.33. For every R-module B and every family {A; : i e 1} of R-modules, 

Hom R (j2 At, B^j = f]Horn*(A/, B), 
iel iel 

via the R-isomorphism 

f h * (fat), 

where the a, are the injections of the sum A/. 

Proof This proof is similar to that of Theorem 7.32, and it is left to the reader. • 

There are examples showing that Hom^(A, Bi) Horn/A A. B,) and that 

Hom^dl, Ai, B) ? fl, Hom*(A;, B). 

Corollary 7.34. If A, A', B, and B' are R-modules. then there are isomorphisms 
Hom«(A, B U B') = Hom«(A, B) u Uom R (A, B') 

and 

Hom^(A U A', B) = Hom K (A, 6) u Hom^lA', B). 

Proof When the index set is finite, the direct sum and the direct product of modules are 
equal. • 
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Example 7.35. 

(i) In Example 7.6, we defined the dual space V* of a vector space V over a field k to be 
the vector space of all its linear functionals: 

V* =Hom k (V,k). 

If dim(V) — n < oo, then Example 5.6 shows that V = V\ © • • • © V n , where each 
Vj is one-dimensional. By Corollary 7.34, V* = Hom^( Vj, k) is a direct sum of n 
one-dimensional spaces [for Exercise 7.5 on page 440 gives Hom^f k,k) = k], and so 
Exercise 7.26 on page 458 gives dimfV*) = dim(V) = n. Thus, a finite-dimensional 
vector space and its dual space are isomorphic. It follows that the double dual, V**, defined 
as (V*)*, is isomorphic to V when V is finite-dimensional. 

(ii) There are variations of dual spaces. In functional analysis, one encounters topological 
real vector spaces V, so that it makes sense to speak of continuous linear functionals. The 
topological dual V* consists of all the continuous linear functionals, and it is important to 
know whether a space V is reflexive ; that is, whether the analog of the isomorphism V -> 
V** for finite-dimensional spaces is a homeomorphism for these spaces. For example, that 
Hilbert space is reflexive is one of its important properties. a 

We now present two dual constructions that are often useful. 

Definition. Given two morphisms /: B A and g: C —> A in a category C, a solution 
is an ordered triple (D, a, ft making the following diagram commute: 

D ——>■ C 
I P I s 


Apullback for fiberedproduct) is a solution (Z), a, ft) that is “best” in the following sense: 
For every solution (X,a\ ft), there exists a unique morphism 0: X -» D making the 
following diagram commute: 



Pullbacks, when they exist, are unique to equivalence; the proof is in the same style as 
the proof that coproducts are unique. 



456 


Modules and Categories Ch. 7 


Proposition 7.36. The pullback of two maps f : B —>■ A and g: C —> A in r Mod exists. 
Proof. Define 

D = {(b,c) £ B uC : f(b) = g(c)}, 

define or. D —> C to be the restriction of the projection ib. c) c, and define f>: I) —> B 
to be the restriction of the projection (b, c ) i->- b. It is easy to see that (D , a , f) is a 
solution. 

If (X, of, ft) is another solution, define a map 0 : X — > D by 0 : x i->- (ft (x), of (x )). 
The values of 6 do lie in D. for fft(x) = got' (x) because X is a solution. We let the reader 
prove that the diagram commutes and that 6 is unique. • 


Example 7.37. 

(i) That B and C are subsets of a set A can be restated as saying that there are inclusion 
maps ;: B —> A and /: C —> A. The reader will enjoy proving that the pullback D exists 
in Sets, and that D — B fl C . 

(ii) Pullbacks exist in Groups: They are certain subgroups of a direct product constructed 
as in the proof of Proposition 7.36. 

(iii) If /: B —*■ A is a homomorphism, then ker / is the pullback of the following diagram: 


0 



The pullback is {( b , 0) e B u {0} : fb = 0} = ker /. ◄ 

Here is the dual construction. 


Definition. Given two morphisms /: A -* B and g: A —» C in a category C, a solution 
is an ordered triple (D,a, f) making the following diagram commute: 



C 

P 

D 


A pushout (or fibered sum ) is a solution (D. a, f ) that is “best” in the following sense: 
for every solution (X,a',ft), there exists a unique morphism 0: D -» X making the 
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following diagram commute: 



Again, pushouts are unique to equivalence when they exist. 

Proposition 7.38. The pushout of two maps f : A —> B and g: A —> C in a Mod exists. 
Proof. It is easy to see that 

S — {(f(a), —g(a)) e B u C : a e Aj 

is a submodule of B u C. Define D — (B u C)/S, define a : B —> D by b i-> (b, 0) + S, 
and define /: C —> D by c (0, c) + S. It is easy to see that ( D , a, fi) is a solution. 

Given another solution (X, a\ fi'), define the map 9 : D X by 9: (b, c) + S 
a'(b) + P\c). Again, we let the reader prove commutativity of the diagram and uniqueness 
of (9. • 

Pushouts in Groups are quite interesting; for example, the pushout of two injective 
homomorphisms is called a free product with amalgamation. 

Example 7.39. 

(i) If B and C are subsets of a set A, then there are inclusion maps i: B fl C -> B and 
j: B flC —» B. The reader will enjoy proving that the pushout D exists in Sets, and that 
I) is their union B U C. 

(ii) If /: A —> B is a homomorphism, then coker/ is the pushout of the following diagram: 


Y 

0 

After all, the pushout here is the quotient ({0} U B)/S, where S = {(0, fa)}, and so 
({0} U B)/S = B/imf = coker/. ◄ 
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Exercises 

7.20 (i) Prove, in every category C, that each object A e C has a unique identity motphism. 

(ii) If / is an equivalence in a category, prove that its inverse is unique, 

7.21 (i) Let X be a partially ordered set, and let a,b e X. Show, in PO(X) [defined in Exam¬ 

ple 7.25(v)], that the coproduct aLlb is the least upper bound of a and b, and that the 
product a n b is the greatest lower bound. 

(ii) Let Y be a set, and let V(Y) denote its power set ; that is, V(Y) is the family of all the 
subsets of Y. Now regard V(Y) as a partially ordered set under inclusion. If A and B 
are subsets of Y, show, in PCKPCL)), that the coproduct AuB = A U B and that the 
product An B = An B. 

(iii) Give an example of a category in which there are two objects whose coproduct does not 
exist. 

Hint. See Exercise 6.43 on page 374. 

7.22 Prove that Groups is not a pre-additive category. 

Hint. If G is not abelian and /, g : G —> G are homomorphisms, show that the function 
x f(x)g(x) may not be a homomorphism. 

7.23 If A and B are (not necessarily abelian) groups, prove that An B = A x B (direct product) in 

Groups. 

7.24 If G is a finite abelian group, prove that Hom^fQ, G) = 0. 

7.25 Let [M, : i e 1} be a family of modules and, for each i, let N, be a submodule of Mj. Prove 
that 

(I>i)/(E^) = E( M *-/^)- 

i i i 

7.26 (i) Let tq,..., v n be a basis of a vector space V over a field k, so that every v e V has a 

unique expression 

v = a itq -I-h a n v n , 

where ctj e k for i — 1 , ,n. For each i , prove that the function v* : V -*■ k, defined 

by v* : v a;, lies in the dual space V*. 

(ii) Prove that it*, ..., is a linearly independent list in V*. 

(iii) Use Example 7.35(i) to conclude that v* .t> ; * is a basis of V* (it is called the dual 

basis of tq, ..., v n ). 

(iv) If /: V —> V is a linear transformation, let A be the matrix of / with respect to a basis 
tq, ..., v n of V; that is. the ith column of A consists of the coordinates of /(«,-) in terms 
of the given basis tq, ..., v n . Prove that the matrix of the induced map /* : V* —> V* 
with respect to the dual basis is A 1 , the transpose of A. 

7.27 Given a map o : ]”[ B, —» ]”[ Cj, find a map a making the following diagram commute, 

Hom(A, ]”[ Bj ) —a- Hom(A, ]”[ Cj) 



]”[ Hom(A, Bi) 


o 


>nHom(A,C ; ), 
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where r and x' are the isomorphisms of Theorem 7.32. 

Hint. If / e Hom(A, ]~[ Bj), define o : (/,) (p/erf); that is, the jth coordinate of <?(/,)) 
is the jth coordinate of o(f) e ]~[ Cj- 

7.28 (i) Given a pushout diagram in ^Mod 


A 


g 


C 


¥ 

B 


P 

'■ 



prove that g injective implies o’ injective, and that g surjective implies a surjective. 
Thus, parallel arrows have the same properties. 

(ii) Given a pullback diagram in ^Mod 


D 


a 


c 


n 

B 


g 

>■ 


/ 


A 


prove that / injective implies a injective, and that / surjective implies a surjective. 
Thus, parallel arrows have the same properties. 


7.29 Definition. An object A in a category C is called an initial object if, for every object C in C, 
there exists a unique morphism A —> C. 

An object G in a category C is called a terminal object if, for every object C in C, there 
exists a unique morphism C —»■ £2. 

(i) Prove the uniqueness of initial and terminal objects, if they exist. Give an example of a 
category which contains no initial object. Give an example of a category that contains 
no terminal object. 

(ii) If £2 is a terminal object in a category C, prove, for any G e obj(C), that the projections 
X: G n £2 —>• G and p: £2 n G —> G are equivalences. 

(iii) Let A and B be objects in a category C. Define a new category C' whose objects are 
diagrams 



where C is an object in C and a and /S are morphisms in C. Define a morphism in C' to 
be a morphism 9 in C that makes the following diagram commute: 


A 


C 


B 


1a 


e 



> 


A- C' B 

a ft 


There is an obvious candidate for composition. Prove that C' is a category. 
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(iv) Prove that an initial object in C' is a coproduct in C. 

(v) Give an analogous construction showing that product is a terminal object in a suitable 
category. 

7.30 A zero object in a category C is an object Z that is both an initial object and a terminal object. 

(i) Prove that {0} is a zero object in ^Mod. 

(it) Prove that 0 is an initial object in Sets. 

(iii) Prove that any one-point set is a terminal object in Sets. 

(iv) Prove that a zero object does not exist in Sets. 

7.31 (i) Assuming that coproducts exist, prove associativity: 

A U (B U C) = (A U B) U C. 

(ii) Assuming that products exist, prove associativity: 

A n (B n C) = (A n B) n C. 


7.32 Let Cj, C 2 , D j, £>2 be objects in a category C. 

(i) If there are morphisms /;: C, D ( , for i — 1, 2, and if C\ n C 2 and D\ n £>2 
exist, prove that there exists a unique morphism /j n /2 making the following diagram 
commute: 

f\ n /•> 

Cj n c 2 D\ n D 2 


Pi 

V 


C, 


9i 

Y 


where p, and cp are projections. 

(ii) If there are morphisms g, : X -¥ Cj , where X is an object in C and ( = 1,2, prove that 
there is a unique morphism (gi, g 2 ) making the following diagram commute: 


Ci 




Cl, 


where the p, are projections. 

Hint. First define an analog of the diagonal Ax' X —> K x X in Sets, given by 
x (x, x), and then define (g\, g 2 ) = (gi n g 2 )A x . 


7.33 Let C be a category having finite products and a terminal object £2. A group object in C is 
a quadruple (G, /x, tj, e), where G is an object in C, p.: G PI G -*■ G, ry. G —>■ G, and 
e: £2 —> G are morphisms, so that the following diagrams commute: 

Associativity: 

1 n u 

G n G n G->■ GnG 


U n 1 


Y 

GnG 


p 


r 

G 
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Identity. 

G n Q 1 nf > G n G < n 1 Cl n G 



where A. and p are the equivalences in Exercise 7.29(ii). 
Inverse: 


G 




G n G 




G 


1 

Q. 



co 

v 


Q 


where co: G -*■ £2 is the unique morphism to the terminal object. 

(i) Prove that a group object in Sets is a group. 

(ii) Prove that a group object in Groups is an abelian group. 
Hint. Use Exercise 2.73 on page 95. 


7.3 Functors 

Functors 11 are homomorphisms of categories. 

Definition. Recall that obj(C) denotes the class of all the objects in a category C. If C and 
V are categories, then a functor T: C —»• V is a function such that 

(i) if A e obj(C), then T(A) e obj(X>); 

(ii) if /: A A! in C, then T(f): T(A) T(A') in V ; 

(iii) if A 4 A' A A" in C, then T (A) T H } T(A') ^ T (A") in V and 

T(gf) = T {g)T (/); 

(iv) for every A e obj(C), 

T (1a) = 1 T(A)- 


Example 7.40. 

(i) If C is a category, then the identity functor 1^: C -> C is defined by 

l c (A) = A for all objects A, 


and 

lc (/) = f for all morphisms /. 

1 *The term functor was coined by the philosopher R. Carnap, and S. Mac Lane thought it was the appropriate 
term in this context. 
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(ii) If C is a category and A e obj(C), then the Horn functor T,\ : C —»■ Sets is defined by 
T A {B) — Hom(A, B) for all B e obj(C), 
and if /: B —* B' in C, then Ta(/) : Hom(A, B) -* Hom(A, B’) is given by 

7 a (/): h h * fh. 

We call 7/i ( f ) the induced map , and we denote it by 

Ta(/) = fh. 

Because of the importance of this example, we will verify the parts of the definition in 
detail. First, the very definition of category says that Hom(A, B) is a set. Note that the 
composite fh makes sense: 


fh 



Suppose now that g : B —» B" . Let us compare the functions 

(£/)*,£*/*: Hom(A, B) Hom(A, B"). 
If h e Hom(A, B ), i.e., if h: A -+ B, then 

(gf )*■ h h * (gf)h\ 


on the other hand. 


g*f*- h h* fh h* g(fh). 


as desired. Finally, if / is the identity map l a : A —> A, then 


(1a)* : h i-> 1 A h — h 

for all h € Hom(A, B), so that (1a)* = lHom(A.B)- 

If we denote Hom(A, ) by 7 a, then Theorem 7.32 says that 7 a preserves products: 
T A (UiBi) = UiTA(Bi). 

(iii) If R is a commutative ring and A is an TNmodule, then the Horn functor Ta : ^Mod —> 
Sets has more structure. We have seen, in Proposition 7.5, that Hom«(A, B) is an R- 
module; we now show that if /: B — » B', then the induced map /*: Hom^(A, B) -> 
Hom^(A, B given by/z i-»- fh, is an TNmap. First, /* is additive: If/z, h’ e Hom(A, B), 
then for all a e A, 

/*(/z + h') = f(h + h'): a i-> f(ha + h!a) 

= fha + fh'a = (/*(/z) + Mh'))(a), 
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so that f*(h + h') = f*(h) + Second, /* preserves scalars. Recall that if r e R 

and h e Hom(A, B), then rh : a i—>■ h{ra). Thus, 

f*(rh): a i-> f(rh)(a) = fh(ra), 


while 

rf*(h) = rfh: a i-> fh(ra). 

Therefore, f*(rh) — (rf)*(h). 

In particular, if R is a field, then the Home’s are vector spaces and the induced maps 
are linear transformations. 

(iv) Let C be a category, and let A e obj(C). Define T : C -» C by T (C) = A for every 
C e obj(C), and T (/) = 1 a for every morphism / in C. Then T is a functor, called the 
constant functor at A. 

(v) If C — Groups, define the forgetful functor U: Groups -» Sets by U(G) is the 
“underlying” set of a group G and U (/) regards a homomorphism / as a mere function. 
Strictly speaking, a group is an ordered pair (G, /u), where G is its (underlying) set and 
fjt: G x G —> G is its operation, and U ((G, // )) = G; the functor U “forgets” the operation 
and remembers only the set. 

There are many variants. For example, an R-module is an ordered triple (M, a, cr), 
where M is a set, a: M x M -> M is addition, and a : R x M -* M is scalar multipli¬ 
cation. There are forgetful functors U': /.-Mod —» Ab with U'((M, a , cr)) = (M, a), and 
U" : flMod -> Sets with U"(M, a, cr)) = M, for example. ◄ 

The following result is useful, even though it is very easy to prove. 

Proposition 7.41. IfT:C —> T> is a functor, and if f : A -> B is an equivalence in C, 
then T (/) is an equivalence in T>. 

Proof. If g is the inverse of /, apply T to the equations 


gf = 1 A and fg = l B . • 


This proposition illustrates, admittedly at a low level, the reason why it is useful to 
give categorical definitions: Functors can recognize definitions phrased solely in terms of 
objects, morphisms, and diagrams. How could we prove this result in Ab if we regard an 
isomorphism as a homomorphism that is an injection and a surjection? 

There is a second type of functor that reverses the direction of arrows. 

Definition. If C and V are categories, then a contravariant functor T : C —> V is a 
function such that 


(i) if C e obj(C), then T(C) e obj(D); 

(ii) if /: C -> C’ in C, then T(f): T(C') T(C) in V\ 
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(iii) if C 4 C' \ C" in C, then T(C") ^ T(C') T ^V T(C ) in V and 

T(gf) = T(f)T(g)- 

(iv) for every A e obj(C), 

T(1a) — 1 T (A) ■ 

To distinguish them from contravariant functors, the functors defined earlier are called 
covariant functors. 

Example 7.42. 

(i) If C is a category and B e obj(C), then the contravariant Horn functor T B : C Sets 
is defined, for all C e obj(C), by 

T b (C) = Hom(C, B) 

and if f\C^C in C, then T B (/): Hom(C', B) -> Hom(C, B) is given by 

T B {f ): h hf. 

We call T b (/) the induced map , and we denote it by 

T B (f ) = /*: h ^ hf. 

Because of the importance of this example, we verify the axioms, showing that T B is a 
(contravariant) functor. Note that the composite hf makes sense: 
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for all h € Hom(C, B), so that (1 c)* = lHom(C.B)- 

If Hom( , B) is denoted by T B , then Theorem 7.33 says that the contravariant functor 
T b converts sums to products: A,-) = ]”[/ T b (Ai). 

(ii) If R is a commutative ring and C is an R-module, then the contravariant Horn functor 
A*Mod —> Sets has more structure. We show that if f:C —>• C' is an /?-map, then 
the induced map /*: Horn ft (C', B) -> Horn ft (C, B), given by h i-> hf, is an /?-map 
between /^-modules. First, /* is additive: If g. h e HomfC 7 , B), then for all c e C, 

f*(g + h ) = (g + h)f: c'^(g + h)f(c') 

= gfc’ + hfc’ = ( f*(g ) + f*(hMc'), 

so that f*(g + h ) = f*(g ) + f*(h). Second, f* preserves scalars. Recall that if r e R 
and h e Hom(A, B ), then rli: a hft* h(ra). Thus, 

f*(rh ): c (rh)f(c') = h(rf(c)), 


while 

rf*(h) — r (hf): c' i->- hf(rc'). 

These are the same, because rf(c') = f(rc f ), and so f*(rh) — rf*(h). 

In particular, if R is a field, then the Home’s are vector spaces and the induced maps 
are linear transformations. A special case of this is the dual space functor Honiftf , k ), 
where k is a field. ◄ 

It is easy to see, as in Proposition 7.41, that every contravariant functor preserves equiva¬ 
lences; that is, if T : C —> V is a contravariant functor, and if /’: C —» C is an equivalence 
in C, then T (/) is an equivalence in V. 

Definition. If C and V are pre-additive categories, then a functor T : C —> T), of either 
variance, is called an additive functor if, for every pair of morphisms f, g: A -> B. we 
have 

T(f + g) = T(f) + T(g). 

It is easy to see that Horn functors r Mod -> Ab of either variance are additive functors. 
Every covariant functor T : C -> V gives rise to functions 

T ab : Hom(A, B) -> Horn (TA, TB), 

for every A and B , defined by /; T(h). If T is an additive functor between pre-additive 
categories, then each Tab is a homomorphism of abelian groups; the analogous statement 
for contravariant functors is also true. 

Here is a modest generalization of Corollary 7.34. 
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Proposition 7.43. IfT : r Mod -> Ab is an additive functor of either variance, then T 
preserves finite direct sums: 


T(Ai © • • • © A„) = TiAfi © • • • © T(A n ). 

Proof. By induction, it sufffices to prove that 7X4 © B) = 7(A) © T{B). Proposi¬ 
tion 7.15(iii) characterizes M = A © B by maps p: M —» A, q : M -» B, i: A —>• M, 
and j : B —> M such that 

pi = 1 a, qj = 1b. pj = 0, qi = 0 and ip + jq = 1 M - 

Since T is an additive functor. Exercise 7.34 on page 470 gives T (0) = 0, and so T 
preserves these equations. • 

We have just seen that additive functors T : /. Mod -> Ab preserve the direct sum of 
two modules: 

r(A©C) = T (A) © T (C). 

If we regard such a direct sum as a split short exact sequence, then we may rephrase this 
by saying that if 

o-*a4b4c-*o 

is a split short exact sequence, then so is 

0 T (A) T(B) T -^i T(C ) 0. 

This leads us to the more general question: If 

o->a-4b4c-^o 

is any short exact sequence, not necessarily split, is 

0 T (A) —'l T(B) T -^l T(C) 0 

also an exact sequence? Here is the answer for Horn functors (there is no misprint in the 
statement of the theorem: > 0” should not appear at the end of the sequences, and we 
shall discuss this point after the proof). 

Theorem 7.44. If 

0-> A B 4- C 

is an exact sequence of R-modules, and if X is an R-module, then there is an exact se¬ 
quence 

0 Homfi(X, A) 4 Horace (A, B) ^ Horn^ (A, C). 


Proof (i) ker/* = {0}: 
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If / e ker/*, then f: X —*■ A and /*(/) = 0; that is, 

ifix) = 0 for all x e X. 

Since i is injective, fix) — 0 for all x e X, and so / = 0. 

(ii) im/* c ker /?*: 

If g e im /*, then g: X B and g = /*(/) = if for some f: X —> A. But 
p*(g) — pg — pif — 0 because exactness of the original sequence, namely, im i = ker p, 
implies pi = 0. 

(iii) ker/?* c im/*: 

If g e ker p*, then g: X -> B and p* (g) = pg — 0. Hence, pgix) — 0 for all x e X, 
so that g(x) e ker p — im i. Thus, g(x) = i(a) for some a e A; since / is injective, this 
element a is unique. Hence, the function /: X —»■ A, given by fix) — a if g (x ) = / (a), is 
well-defined. It is easy to check that / e Horn/HIf, A); that is, / is an /^-homomorphism. 
Since 

g(x +x') = g{x) + gix') — i(a) + i(a f ) = i(a + a f ), 


we have 

fix + x') — a + a' — fix) + fix'). 

A similar argument shows that firx) — rfix) for all r e R. But, /*(/) = if and 
ifix) — iia) — gix) for all x e X; that is, /*(/) = g, and so g e im/*. • 

Example 7.45. 

Even if the map p: B —> C in the original exact sequence is assumed to be surjective, the 
functored sequence need not end with “—»■ 0;” that is, p*: Horn/,'(A - , B ) —> Horn r(X. C) 
may fail to be surjective. 

The abelian group Q/Z consists of cosets q + Z for q e Q, and it easy to see that 
its element j + Z has order 2. It follows that HomzOb, Q/Z) f {0}, for it contains the 
nonzero homomorphism [1] i—>• ^ + Z. 

Apply the functor Hom^CIb. ) to 

0 -> Z -4 Q 4 - Q/Z 0, 

where / is the inclusion and p is the natural map. We have just seen that 

Hom z (I 2 ,Q/Z) ± {0}; 

on the other hand, Homz(l 2 , Q) = {0} because Q has no (nonzero) elements of finite 
order. Therefore, the induced map /?*: Homz(l 2 , Q) -> Homz(l 2 . Q/Z) cannot be sur¬ 
jective. < 
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Definition. A covariant functor T: r Mod —>• Ab is called left exact if exactness of 

0 -> A 4 C 

implies exactness of 

0 -> T(A) T{B) T -^l T(C). 

Thus, Theorem 7.44 shows that covariant Horn functors Horn/A A. ) are left exact func¬ 
tors. Investigation of the cokernel of Horn a (A. ) is done in homological algebra; it is 
involved with a functor called Ext^(A, ). 

There is an analogous result for contravariant Horn functors. 

Theorem 7.46. If 

A \ B \ C 0 

is an exact sequence of R-modules, and ifY is an R-module, then there is an exact sequence 

0 -> Horace, Y) 4 Hom*(B, Y) 4 Horn* (A, Y). 

Proof, (i) ker p* = {0}. 

If h e ker p*, then h : C -* Y and 0 = p*(h) = hp. Thus, h(p(b)) = 0 for all b e B, 
so that h(c) = 0 for all c e im p. Since p is surjective, imp — C. and h — 0. 

(ii) im p* C ker/*. 

If g e Hom^(C, T), then 


i*P*(8) = ( P i)*(g) = 0, 


because exactness of the original sequence, namely, im i — ker p, implies pi = 0. 

(iii) ker/* C imp*. 

If g e ker/*, then g: B —> Y and i*(g) = gi = 0. If c e C, then c — p(b) for some 
b e B, because p is surjective. Define /: C —> Y by /(c) = g(b) if c = pib). Note that 
/ is well-defined: If p(b) — p(b'), then b — b' e ker p = im/, so that b — b' = i(a) for 
some a e A. Hence, 


g(b) - g{b’) = g(b - b’) = gi(a ) = 0, 


because gi = 0. The reader may check that / is an /Amap. Finally, 


P*(f) = fP = 8, 


because if c = p{b), then gib) — /(c) = fipib)). Therefore, g e im p*. • 
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Example 7.47. 

Even if the map i: A —* B in the original exact sequence is assumed to be injective, the 
functored sequence need not end with 0;” that is, i*: Horn/A B. Y ) -* Horn r( A, Y) 
may fail be surjective. 

We claim that Homy(Q, Z) = 0. Suppose that / : Q -» Z and / (a/b) ^ 0 for some 
a/b e Q. If f (a/b) = m, then, for all n > 0, 

nf(a/nb) — f(na/nb) = f(a/b) — m. 

Thus, m is divisible by every positive integer n, and this contradicts the fundamental theo¬ 
rem of arithmetic. 

If we apply the functor Homy/ , Z) to the short exact sequence 

o->z-Vq4 q/z -* o, 

where i is the inclusion and p is the natural map, then the induced map 

i*: Homy(Q, Z) —> Homy(Z, Z) 

cannot be surjective, for Homy(Q, Z) = {0} while Homy(Z, Z) ^ {0}, because it con¬ 
tains lz. ◄ 


Definition. A contravariant functor T: r Mod -> Ab is called left exact if exactness of 


B 


C -» 0 


implies exactness of 

0 -> T(C ) ^ T(B) T(A). 

Thus, Theorem 7.46 shows that contravariant Horn functors Horn/A , Y) are left exact 
functors. 12 

There is a converse of Theorem 7.46; a dual statement holds for covariant Horn functors. 


Proposition 7.48. Let i : B' —> B and p : B —» B" be R-maps, where R is a commutative 
ring. If, for every R-module M, 


0 -> Hornes", M) Horn R (B, M) Homes', M) 

is an exact sequence, then so is 

B' -4 B -4 B" -> 0. 


12 These functors are called left exact because the functored sequence has 0 —> on the left. 
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Proof, (i) p is surjective. 

Let M — B"/ im p and let /: B" —> B"/ im p be the natural map, so that / e 
Hom(/J". M). Then p*(f ) = fp — 0, so that f — 0, because p* is injective. There¬ 
fore, B"/ im p = 0, and p is surjective. 

(ii) im i C ker p. 

Since i*p* = 0, we have 0 = (pi)*. Hence, if M — B" and g = 1 b", so that 
g e Horn (B", M), then 0 = ( pi)*g = gpi = pi, and so im i C ker p. 

(iii) ker p C i m i. 

Now choose M — B / im ; and let h: B -» M be the natural map, so that /; e 
Horn (B.M). Clearly, i*h = hi = 0, so that exactness of the Horn sequence gives an 
element h' e Homs(B", M) with p*(h') = h'p = h. We have im; C kerp, by part (ii); 
hence, if im; f ker p, there is an element b e B with b f im; and b e ker p. Thus, 
hb ^ 0 and pb = 0, which gives the contradiction hb — if pb = 0. • 


Definition. A covariant functor T: r Mod —> Ab is an exact functor if exactness of 

o-»a-Vb4c->o 

implies exactness of 

0 -» T(A) —> T(B) -U T (C) -> 0. 

An exact contravariant functor is defined similarly. 

In the next section, we will see that Horn functors are exact functors for certain choices 
of modules. 

Exercises 

7.34 If T : ^Mod -» Ab is an additive functor, of either variance, prove that T (0) = 0, where 0 
denotes either a zero module or a zero morphism. 

7.35 Give an example of a covariant functor that does not preserve coproducts. 

Hint. Use Exercise 7.21(iii) on page 458. 

S T TS 

7.36 Let A — > B — *■ C be functors. Prove that the composite A —»■ C is a functor that is 

covariant if the variances of S and T are the same, and contravariant if the variances of S and 
T are different. 

7.37 (i) Prove that there is a functor on CommRings defined on objects by /? i—and on 

morphisms /: R —»■ S by ;- f(r) (that is, in the formal notation for elements of 
R[x), (r, 0, 0, ■ • •) h* (/(;'), 0, 0, - - -). 

(ii) Prove that there is a functor on Dom, the category of all domains, defined on objects by 
R m- Frac(f?), and on morphisms f: R —*■ S by r/1 /(;•)/1. 

7.38 Prove that there is a functor Groups —> Ab taking each group G to G/G 1 , where G' is its 
commutator subgroup. 
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7.39 (i) If X is a set and k is a field, define the vector space k x to be the set of all functions 

X -¥ k under pointwise operations. Prove that there is a functor F: Sets ^Mod 
with F(X) = k x . 

(ii) If X is a set, define F(X) to be the free group with basis X. Prove that there is a functor 
F: Sets —>• Groups with F: X F(X). 


7.4 Free Modules, Projectives, and Injectives 

The simplest modules are free modules and, as for groups, every module is a quotient 
of a free module; that is, every module has a presentation by generators and relations. 
Projective modules are generalizations of free modules, and they, too, turn out to be useful. 
We define injective modules, as duals of projectives, but their value cannot be appreciated 
until Chapter 10, when we discuss homological algebra. In the meantime, we will see here 
that injective Z-modules are quite familiar. 

Definition. An .R-module F is called a free R-module if F is isomorphic to a direct sum 
of copies of R : that is, there is a (possibly infinite) index set I with 

F = E R i> 

iel 

where R, = (bj) = R for all i. We call B = {bj : i € /} a basis of F. 

A free Z-module is a free abelian group, and every commutative ring R, when consid¬ 
ered as a module over itself, is itself a free R -module. 

From our discussion of direct sums, we know that each m e F has a unique expression 
of the form __ 

m — Yj.b,, 
iel 

where r, e R and almost all r, = 0. A basis of a free module has a strong resemblence to 
a basis of a vector space. Indeed, it is easy to see that a vector space V over a field k is a 
free /.'-module, and that the two notions of basis coincide in this case. 

There is a straightforward generalization of Theorem 3.92 from finite-dimensional vec¬ 
tor spaces to arbitrary free modules (in particular, to infinite-dimensional vector spaces). 

Proposition 7.49. Let F be a free R-module, and let B — {bj : i e 1} be a basis of F. If 
M is any R-module and if y : B -> M is any function, then there exists a unique R-map 
g: F -> M with g(bj) = y(bj) for all i e I. 

F 

'' & 

B - > M 
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Proof. Every element v e F has a unique expression of the form 

v — 

iel 

where r,* e R and almost all r; = 0. Define g : F —> M by 

*(v) = £^oo- • 

iel 

Here is a fancy proof of this result. By Proposition 7.30, a free module F is the coprod¬ 
uct of { {bj) : i e /}, with injections a,- mapping to the vector having r, bj in the ;th 
coordinate and 0’s elsewhere. As for any coproduct, there is a unique map 0: F —> M 
with 9(bi) = y(bj). The maps 0 and g agree on each element of the basis B, so that 0 — g. 

Definition. The number of elements in a basis is called the rank of F. 

Of course, rank is the analog of dimension. The next proposition shows that rank is 
well-defined. 

Proposition 7.50. 

(i) If R is a nonzero commutative ring, then any two bases of a free R-module F have 
the same cardinality ; that is, the same number of elements. 

(ii) If R is a nonzero commutative ring, then free R-modules F and F' are isomorphic 
if and only (/‘rank(F) = rank(F'). 

Proof, (i) Choose a maximal ideal I in R (which exists, by Theorem 6.46). If A is a 
basis of the free /(-module F, then Exercise 7.6 on page 440 shows that the set of cosets 
{t> + IF : v e X] is a basis of the vector space F/IF over the field R/I. If Y is another 
basis of F, then the same argument gives {u + IF : u e Y] a basis of F/IF. But any two 
bases of a vector space have the same size (which is the dimension of the space), and so 
|A| = \Y\, by Theorem 6.51. 

(ii) Let A be a basis of F, let X' be a basis of F' , and let y : A —*■ A' be a bijection. 
Composing y with the inclusion X' —» F' , we may assume that y : X —> F' . By Propo¬ 
sition 7.49, there is a unique F-map <p\ F —> F' extending y. Similarly, we may regard 
y~ l : X' -> A as a function X' —> F, and there is a unique rjr: F' —> F extending y~ l . 
Finally, both i/np and If extend I x, so that f<p — If- Similarly, the other composite 
is 1 pt, and so <p: F -> F' is an isomorphism. (The astute reader will notice a strong 
resemblance of this proof to the uniqueness of a solution to a universal mapping problem.) 

Conversely, suppose that cp: F -> F' is an isomorphism. If {i> ( - : i el} is a basis of 
F, then it is easy to see that {<p(t>;) : i e /} is a basis of F'. But any two bases of the free 
module F' have the same size, namely, rank(F'), by part (i). Hence, rank(F') = rank(F). 


The next proposition will enable us to use free modules to describe arbitrary modules. 
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Proposition 7.51. Every R-module M is a quotient of a free R-module F. Moreover, M 
is finitely generated if and only if F can be chosen to be finitely generated. 

Proof. Let F be the direct sum of \M\ copies of R (so F is a free module), and let 
{x m : m e M } be a basis of F. By Proposition 7.49, there is an A 1 -map g: F -> M with 
g(x m ) = m for all m e M. Obviously, g is a surjection, and so F/ kerg = M. 

If M is finitely generated, then M = (mi, .... m n ). If we choose F to be the free R- 
module with basis {xi,..., x„}, then the map g: F —> M with g(x, ) — m , is a surjection, 
for 

img = (g(xi),- g(x„)) = (mi,..., m„) = M. 

The converse is obvious, for any image of a finitely generated module is itself finitely 
generated • 

The last proposition can be used to construct modules with prescribed properties. For 
example, let us consider Z-modules (i.e., abelian groups). The group Q/Z contains an 
element a of order 2 satisfying the equations a = 2 n a„ for all n > 1; take a — ^ + Z and 
a n = l/2" +1 + Z. Of course, Hom^CQ, Q/Z) £ {0} because it contains the natural map. 
Is there an abelian group G with Homz(Q, G) = {0} that contains an element a of order 2 
satisfying the equations a — 2" a„ for all n >17 Let F be the free abelian group with basis 

{a, b l ,b 2 , ...,b n ,...} 


and relations 


{2a, a — 2 n b n , n > 1); 


that is, let K be the subgroup of F generated by {2a, a — 2 n b n , n > 1}. Exercise 7.48 
on page 487 asks the reader to verify that G — F/K satisfies the desired properties. This 
construction is a special case of defining an /^-module by generators and relations (as we 
have already done for groups). 

Definition. LetX — {x, : i e /} be a basis of a free A’-module F, and let '12 = !5~, r ji x i ■ 
j e /} be a subset of F. If K is the submodule of F generated by '12, then we say that the 
module M — F/ K has generators X and relations 77. 11 We also say that the ordered pair 
(Aj77) is a presentation of M. 

We will return to presentations at this end of the section, but let us now focus on the key 
property of bases. Lemma 7.49 (which holds for free modules as well as for vector spaces), 
in order to get a theorem about free modules that does not mention bases. 

Theorem 7.52. If R is a commutative ring and F is a free R-module, then for every 
surjection p: A —> A" and each h: F —> A", there exists a homomorphism g making the 

1 ' A module is called free because it has no entangling relations. 
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following diagram commute: 


F 

S h 

A A" -^ 0 

Proof. Let {/?,- : i e 1} be a basis of F. Since p is surjective, there is a, e A with 
p(aj) — h{bj) for all i. By Proposition 7.49, there is an P-map g: F —»• A with 

g(b[) — ai for all i. 

Now pg(bi) = p(aj) = h(bj), so that pg agrees with h on the basis {bj : i e /}; it follows 
that pg — h on {{bj : i e /}> = F\ that is, pg — h. • 

Definition. We call a map g: F —>■ A with pg — h (in the diagram in Theorem 7.52) a 
lifting of h. 

If C is any, not necessarily free, module, then a lifting g of h, should one exist, need not 
be unique. Since pi = 0, where i : ker p —> A is the inclusion, other liftings are g + if 
for any / e Hom^(C, ker p). Indeed, this is obvious from the exact sequence 

0 -> Hom(C, ker p) Hom(C, A) -A Hom(C, A"). 

Any two liftings of h differ by a map in ker p. t . — im i* C Hom(C, A). 

We now promote this (basis-free) property of free modules to a definition. 

Definition. A module P is projective if, whenever p is surjective and h is any map, there 
exists a lifting g; that is, there exists a map g making the following diagram commute: 

P 

S , h 

A A" ->■ 0 

We know that every free module is projective; is every projective /^-module free? We 
shall see that the answer to this question depends on the ring R. Note that if projective 
/^-modules happen to be free, then free modules are characterized without having to refer 
to a basis. 

Let us now see that projective modules arise in a natural way. We know that the Horn 
functors are left exact; that is, for any module P, applying Horn/A/L ) to an exact sequence 

o 

gives an exact sequence 

0 -* Horn*(P, A') Horn r(P, A) Horn R (P, A”). 
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Proposition 7.53. A module P is projective if and only z/Hom/A P, ) is an exact functor. 

Remark. Since Hom/A P, ) is a left exact functor, the thrust of the proposition is that p* 
is suijective whenever p is surjective. ■* 

Proof If P is projective, then given h: P —> A", there exists a lifting g: P -> A with 
pg — h. Thus, if h e Hom^(P, A"), then h — pg — p*(g) e imp*, and so p* is 
suijective. Hence, Hom( P, ) is an exact functor. 

For the converse, assume that Hom(P, ) is an exact functor, so that p* is surjective: 
If h e Hom^(P, A"), there exists g e Homs(P, A) with h = p*(g) = pg. This says 
that given p and h, there exists a lifting g making the diagram commute; that is, P is 
projective. • 


Proposition 7.54. A module P is projective if and only if every short exact sequence 


is split. 

Proof. If P is projective, then there exists /: P B making the following diagram 
commute; that is, pj = 1 />. 

P 

i / i, 

A p \ 

B -> P -^0 

Corollary 7.17 now gives the result. 

Conversely, assume that every short exact sequence ending with P splits. Consider the 
diagram 

P 

f 

p ' 

B --^0 

with p surjective. Now form the pullback 


D 


> P 




J 


f 


B 



0 


By Exercise 7.28 on page 459, surjectivity of p in the pullback diagram gives surjectivity 
of a. By hypothesis, there is a map j: P -> D with aj = 1 />. Define g: P -> B by 
g = f j. We check: 

P8 = Pfij = f<*j = flp = f- 
Therefore, P is projective. • 

We restate one half of this proposition so that the word exact is not mentioned. 
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Corollary 7.55. Let Abe a submodule of a module B. If B / A is projective, then there is 
a submodule C of B with C = B / A and B = A © C. 


Theorem 7.56. An R-module P is projective if and only if P is a direct summand of a 
free R-module. 

Proof Assume that P is projective. By Proposition 7.51, every module is a quotient of a 
free module. Thus, there is a free module F and a surjection g: F —> P, and so there is 
an exact sequence 

0 -> kerg -> F —>■ P -> 0. 

Proposition 7.54 now shows that P is a direct summand of F. 

Suppose that P is a direct summand of a free module F, so there are maps q : F —»■ P 
and j: P —»• F with qj = 1 p. Now consider the diagram 


q 



h \ f 


Y p I 

B ->■ C -^0, 

where p is surjective. The composite f q is a map F -> C; since F is free, it is projective, 
and so there is a map h : F —> B with ph — f q. Define g: P -> B by g = hj. It remains 
to prove that pg — f. But 


pg = phj = fqj = fl P = f. • 

Actually, the second half of the proof shows that any direct summand of a projective 
module is itself projective. 

We can now give an example of a commutative ring R and a projective .R-module that 
is not free. 

Example 7.57. 

The ring R = Ig is the direct sum of two ideals: 

16 = 7®/, 


where 

J = {[0], [2], [4]} = I 3 and / = {[0], [3]} = I 2 . 

Now If, is a free module over itself, and so J and /, being direct summands of a free 
module, are projective ][(,-modules. Neither J nor I can be free, however. After all, a 
(finitely generated) free Ig-module F is a direct sum of, say, n copies of Ig, and so F has 
6" elements. Therefore, J is too small to be free, for it has only three elements. ■* 
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Describing projective R-modules is a problem very much dependent on the ring R. In 
Chapter 9, for example, we will prove that if R is a PID, then every submodule of a free 
module is itself free. It will then follow from Theorem 7.56 that every projective R -module 
is free in this case. A much harder result is that if R — k[x \,..., x n ] is the polynomial 
ring in n variables over a field k, then every projective R-module is also free; this theorem, 
implicitly conjectured 14 by J.-P. Serre, was proved, independently, by D. Quillen and by A. 
Suslin (see Rotman, An Introduction to Homological Algebra, pages 138-145, for a proof). 
There is a proof of the Quillen-Suslin theorem using Grobner bases, due to N. Fitchas, A. 
Galligo, and B. Sturmfels. 

There are domains having projective modules that are not free. For example, if R is the 
ring of all the algebraic integers in an algebraic number field (that is, an extension of Q of 
finite degree), then every ideal in R is a projective R-module. There are such rings R that 
are not PIDs, and any ideal in R that is not principal is a projective module that is not free 
(we will see this in Chapter 11 when we discuss Dedekind rings). 

Here is another characterization of projective modules. Note that if A is a free R-module 
with basis {a,- : i e 1} C A, then each x e A has a unique expression * = T,iei r i at, and 
so there are R-maps ipt: A -> R given by q>i : x i-> r,-. 

Proposition 7.58. An R-module A is projective if and only if there exist elements 
{a, : i e 1} C A and R-maps {tpi: A —» R : i e 1} such that 

(i) for each x e A, almost all <pi(x) — 0; 

(ii) for each x e A, we have x — Y2iei( ( Pi x ) a i- 
Moreover, A is generated by {a,- : i e /} C A in this case. 

Proof. If A is projective, there is a free R-module F and a surjective R-map \j/ : F —> A. 
Since A is projective, there is an R-map tp: A —*■ F with ij/cp = 1 a, by Proposition 7.54. 
Let {e; ; i e 1} be a basis of F, and define at = i jr(et). Now if x e A, then there is a unique 
expression q>(x) — r, c/, where r, e R and almost all r, = 0. Define tpp. A —»■ R by 
<Pi(x) = rj. Of course, given x, we have (fii(x) — 0 for almost all i. Since i/r is surjective, 
A is generated by {a,- = \[r (e,-) : i e /}. Finally, 

x = ftp(x) = T/r(y~Vfe;) 

= = ?>*)*(«*) = 5 ~2(<Pix)ai. 

Conversely, given {a, : i e 1} c A and a family of R-maps { (fj: ,4 R : i e 1} 
as in the statement, define F to be the free R-module with basis {e,- : i e /}, and define 
an R-map ■jr. F —*■ A by fr. e; a,-. It suffices to find an R-map tp: A —>■ F with 
frcp — 1a, for then A is (isomorphic to) a retract (i.e., A is a direct summand of F), and 
hence A is projective. Define q> by cp{x) — J 2i(<Pi x )ei, for x e A. The sum is finite, by 

11 On paye 243 of “Faisceaux Algebriques Coherents,” Annals of Mathematics 61 (1955), 197-278, Serre writes 
"... on ignore s'il existe des A-modules projectifs de type fini qui ne soient pas libres.” Here, A = k[x \,..., x n ]. 
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condition (i), and so (p is well-defined. By condition (ii), 

f<P(x) = i' y^S<Pix)ej = y^(^x)i//(e, ) = 'Y^{(p i x)a i = x; 

that is, jrip — 1^. • 

Definition. If A is an R -module, then a subset {a,- : i e /} c A and a family of /?-maps 
{(Pi: A -* R : i e 1} satisfying the condition in Proposition 7.58 is called a projective 
basis. 

An interesting application of projective bases is due to R. Bkouche. Let A be a locally 
compact Hausdorff space, let C(X) be the ring of all continuous real-valued functions on 
X , and let J be the ideal in C(A) consisting of all such functions having compact support. 
Then A is a paracompact space if and only if / is a projective C(A)-module. 

Remark. The definition of projective module can be used to define a projective object in 
any category (we do not assert that such objects always exist), if we can translate surjection 
into the language of categories. One candidate arises from Exercise 7.18 on page 441, but 
we shall see now that defining surjections in arbitrary categories is not so straightforward. 

Definition. A morphism cp\ B —> C in a category C is an epimorphism if <p can be 
canceled from the right; that is, for all objects D and all morphisms h: C -> D and 
k: C -> Z>, we have tup = kcp implies h — k. 

w h 

B C =t D 

k 

Now Exercise 7.18 on page 441 shows that epimorphisms in r Mod are precisely the 
suijections, and Exercises 7.45 on page 487 and 7.19 on page 441 show that epimorphisms 
in Sets and in Groups, respectively, are also surjections. However, in CommRings, it is 
easy to see that if R is a domain, then the ring homomorphism <p: R —> Frac(R), given 
by r i-> r/1, is an epimorphism; if A is a commutative ring and h. k: Frac( R ) -* A are 
ring homomorphisms that agree on R. then h — k. But <p is not a surjective function if R 
is not a field. A similar phenomenon occurs in Top. If /; A -> Y is a continuous map 
with im / a dense subspace of Y, then / is an epimorphism, because any two continuous 
functions agreeing on a dense subspace must be equal. 

There is a similar problem with monomorphisms , a generalization of injections to arbi¬ 
trary categories: A category whose objects have underlying sets may have monomorphisms 
whose underlying function is not an injection. < 

Let us return to presentations of modules. 

Definition. An R -module M is finitely presented if it has a presentation (A’\'IZ) in which 
both X and 1Z are finite. 
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If M is finitely presented, there is a short exact sequence 

where F is free and both K and F are finitely generated. Equivalently, M is finitely 
presented if there is an exact sequence 

F' F M 0, 

where both F' and F are finitely generated free modules (just map a finitely generated free 
module F' onto K). Note that the second exact sequence does not begin with "0 — 

Proposition 7.59. If R is a commutative noetherian ring, then every finitely generated 
R-module is finitely presented. 

Proof. If M is a finitely generated R -module, then there is a finitely generated free R- 
module F and a surjection qj : F —» M. Since R is noetherian. Proposition 7.23 says that 
every submodule of F is finitely generated. In particular, ker qj is finitely generated, and so 
M is finitely presented. • 

Every finitely presented module is finitely generated, but we will soon see that the con¬ 
verse may be false. We begin by comparing two presentations of a module (we generalize 
a bit by replacing free modules by projectives). 

Proposition 7.60 (Schanuel’s Lemma). Given exact sequences 

0->kAp^M-*0 

and 

where P and P' are projective, then there is an isomorphism 

K®P' = K'®P. 

Proof. Consider the diagram with exact rows 


0-^ K — P ——>■ M - ^0 



Since P is projective, there is a map ft: P — > P' with n' ft = tt. that is, the right square 
in the diagram commutes. We now show that there is a map a: K -> K' making the 
other square commute. If x e K, then tt' ftix = nix = 0, because ni = 0. Hence, 
ftix e kei'Tr' = im i'\ thus, there is x' e K' with i'x' — ftix\ moreover, x is unique 
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because i' is injective. Therefore, a: x i —> x' is a well-defined function a: K -> K' that 
makes the first square commute. The reader can show that a is an A 1 -map. 

This commutative diagram with exact rows gives an exact sequence 

0 -* K 4 P © K' X P' -> 0, 

where 0 : x (ix, ax) and i/r: (u, x') i-> fit — i'x', for x e K, u e P, and x' e K'. 
Exactness of this sequence is a straightforward calculation that is left to the reader; this 
sequence splits because P' is projective. • 


Corollary 7.61. If M is finitely presented and 

0-+K-+F-+M-+0 

is an exact sequence, where F is a finitely generated free module, then K is finitely gener¬ 
ated. 

Proof Since M is finitely presented, there is an exact sequence 

0->K'-+F'->M->0 

with F' free and with both F' and K' finitely generated. By Schanuel’s lemma, K © F' = 
K’ © F. Now K' © F is finitely generated because both summands are, so that the left 
side is also finitely generated. But K, being a summand, is also a homomorphic image of 
K © F' , and hence it is finitely generated. • 

We can now give an example of a finitely generated module that is not finitely presented. 

Example 7.62. 

Let R be a commutative ring that is not noetherian; that is, R contains an ideal I that is not 
finitely generated (see Example 6.39). We claim that the /{-module M = /(//is finitely 
generated but not finitely presented. Of course, M is finitely generated; it is even cyclic. 
If M were finitely presented, then there would be an exact sequence 0 —> K —> F -» 
M —> 0 with F free and both K and F finitely generated. Comparing this with the exact 
sequence 0 —>•/—>•/?—>• M —> 0, as in Corollary 7.61, gives I finitely generated, a 
contradiction. Therefore, M is not finitely presented. -4 

There is another type of module that also turns out to be interesting. 

Definition. If E is a module for which the contravariant Horn functor Horn a ( , E) is 
an exact functor—that is, if Hom«( , E ) preserves all short exact sequences—then E is 
called an injective module. 

The next proposition is the dual of Proposition 7.53. 
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Proposition 7.63. A module E is injective if and only if a dotted arrow always exists 
making the following diagram commute whenever i is an injection: 

E 


0->■ A - B 

l 

In words, every homomorphism from a submodule into E can always be extended to a 
homomorphism from the big module into E. 

Remark. Since Hom^( , E) is a left exact contravariant functor, the thrust of the propo¬ 
sition is that i* is surjective whenever i is injective. 

Injective modules are duals of projective modules in that both of these terms are charac¬ 
terized by diagrams, and the diagram for injectivity is the diagram for projectivity having 
all arrows reversed. ◄ 

Proof. If E is injective, then Hom(, E) is an exact functor, so that i* is surjective. There¬ 
fore, if / e Horn(4, E), there exists g e Horns (B, E) with / = i*(g) = gi: that is, the 
diagram commutes. 

For the converse, if E satisfies the diagram condition, then given f: A -> E, there 
exists g: B — »■ E with gi = f. Thus, if / e Homs(A, E), then / = gi — i*(g) e imi*, 
and so i* is surjective. Hence, Hom(, E) is an exact functor, and so E is injective. • 

The next result is the dual of Proposition 7.54. 

Proposition 7.64. A module E is injective if and only if every short exact sequence 


is split. 

Proof. If E is injective, then there exists q: B —*■ E making the following diagram 
commute; that is, qi = 1 e- 

E 

0- >- E -> B 

i 

Exercise 7.17 on page 441 now gives the result. 

Conversely, assume every exact sequence beginning with E splits. The pushout of 

E 

f 

0-^ A — B 

l 
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is the diagram 

E —^ D 

f P 

0 -=- A — r^~B 

l 

By Exercise 7.28 on page 459, the map a is an injection, so that 

0 —> E -> D -> coker a —> 0 

splits; that is, there is q : D —> E with qa = 1 e- If we define g : B —* E by g — qf, then 
the original diagram commutes: 


gi = qfii = quf = 1 E f = /■ 


Therefore, E is injective. • 

This proposition can be restated without mentioning the word exact. 

Corollary 7.65. If an injective module E is a submodule of a module M, then E is a 
direct summand of M: There is a submodule S of M with S = M/E and M = E © S. 


Proposition 7.66. If { If : i e 1} is a family of injective modules, then Y\ i( -j E, is also 
an injective module. 

Proof. Consider the diagram 


E 

A 

/ 

0- A B, 

where E — P| E, . Let pi: E -> E, be the /th projection. Since E, is injective, there is 
gi \ B -» Ej with gjK — pif. Now define g : B —> E by g : b i-> ( gj(b )). The map g 
does extend /, for if b = ku, then 

g(Ka) = (gj(Ka)) = (pi fa) = fa, 

because x = ( p t x ) is true for every x in the product. • 


Corollary 7.67. A finite direct sum of injective modules is injective. 

Proof. The direct sum of finitely many modules coincides with the direct product. • 
A useful result is the following theorem due to R. Baer. 
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Theorem 7.68 (Baer Criterion). An R-module E is injective if and only if every R-map 
f: I —> E, where I is an ideal in R, can be extended to R. 

E 


0 ->■ /- 

l 

Proof. Since any ideal I is a submodule of R, the existence of an extension g of / is just 
a special case of the definition of injectivity of E. 

Suppose we have the diagram 

E 

A 

/ 

0- A — t+B, 

l 

where A is a submodule of a module B. For notational convenience, let us assume that i is 
the inclusion [this assumption amounts to permitting us to write a instead of i (a) whenever 
a € A], We are going to use Zorn’s lemma on approximations to an extension of /. More 
precisely, let X be the set of all ordered pairs (A', g'), where A c A' C B and g': A' —> E 
extends /; that is, g’\A — f. Note that 1/0 because (A, /) e X. Partially order X by 
defining 

(A', g') < (A", g") 

to mean A' C A" and g" extends g'. The reader may supply the argument that Zorn’s 
lemma applies, and so there exists a maximal element (Ao, go) in X. If Ao = B, we are 
done, and so we may assume that there is some b e B with b / Ao- 
Define 

I — {r e R: rb e Ao}. 

It is easy to see that I is an ideal in R. Define /z :/—>£’ by 

h(r) = go(rb). 

By hypothesis, there is a map h* : R -> E extending h. Finally, define Ai = Ao + (b) and 
g\ : Ai -> E by 

gl («0 + rb) = go(«o) + rh*{\), 

where no^Ao and r e R. 

Let us show that gi is well-defined. If ao+rb — a' 0 +r'b, then ( r — r')b — a' 0 —ao e Ao; 
it follows that r — r' e /. Therefore, go((r — r')b) and h(r — r') are defined, and we have 

go( a o ~ a o) = go((r ~ r’)b) = h(r - r') = h*(r - r') = (r - r’)h*{ 1). 

Thus, go(a f Q ) - go (ao) = rh*{ 1) - r'h*( 1) and go(a' 0 ) + r’h*{ 1) = go(ao) + rh*( 1), as 
desired. Clearly, gi(ao) — go(«o) for all ao e Ao, so that the map gi extends go. We 
conclude that (Ao, go) -< (Ai, gi), contradicting the maximality of (Ao, go)- Therefore, 
Aq = B, the map go is a lifting of /, and E is injective. • 
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Are arbitrary direct sums of injective modules injective? 

Proposition 7.69. If R is noetherian and {Ej : i e 1} is a family of injective R-modules, 
then Jf l£l Ei is an injective module. 

Proof By the Baer criterion. Theorem 7.68, it suffices to complete the diagram 


E/e/ Ei 
f 

0 --./-— R, 

where J is an ideal in R. Since R is noetherian, J is finitely generated, say, J = 
(ai,..., a n ). For k = 1 f{af) e E/s/ Ei has only finitely many nonzero co¬ 
ordinates, occurring, say, at indices in S(af) C I. Thus, S — Ua-=i S( a k) is a finite set, 
and so im / C ff j€S Eg, by Corollary 7.67, this finite sum is injective. Hence, there is an 
/Cmap g f : R E/eS Ei extending /. Composing g' with the inclusion of £) into 
Ei completes the given diagram. • 

It is a theorem of H. Bass that the converse of Proposition 7.69 is true: If every direct 
sum of injective R -modules is injective, then R is noetherian (see Theorem 8.105). 

We can now give some examples of injective modules. 

Proposition 7.70. If R is a domain, then Q — Frac( R ) is an injective R-module. 

Proof By Baer’s criterion, it suffices to extend an /Cmap /: I —> Q, where I is an ideal 
in R, to all of R. Note first that if a, he! are nonzero, then af(b ) = f{ab) — bf(a), so 
that 

f(a)/a — f{b)/b in Q for all nonzero a, b e /; 

let c e Q denote their common value (note how I being an ideal is needed to define c: the 
product ab must be defined, and either factor can be taken outside the parentheses). Define 
g : R-> Q by 

g(r) = rc 

for all r e R. It is obvious that g is an /Cmap. To see that g extends /, suppose that a e /; 
then 

g(a) = ac = af(a)/a = f(a). 

It now follows from Baer’s criterion that Q is an injective /^-module. • 


Definition. If R is a domain, then an /^-module D is divisible if, for each d e D and 
every nonzero r e R, there exists d' e D with d — rd'. 
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Example 7.71. 

Let R be a domain. 

(i) Tract R ) is a divisible .R-module. 

(ii) Every direct sum of divisible R-modules is divisible. Hence, every vector space over 
Tract R) is a divisible R-module. 

(iii) Every quotient of a divisible R -module is divisible. 4 


Lemma 7.72. If R is a domain, then every injective R-module E is divisible. 

Proof. Assume that E is injective. Let e e E and let ro e R be nonzero; we must 
find x e E with e = r^x. Define /; (ro) -> E by f(rro) = re (note that / is well- 
defined: Since R is a domain, rro = r'ro implies r = r'). Since E is injective, there exists 
h: R —> E extending /. In particular, 

e = /(r 0 ) = /z fr 0 ) = r 0 /z(l), 

so that x = /z(l) is the element in E required by the definition of divisible. • 

We now prove that the converse of Lemma 7.72 is true for PIDs. Proposition 11.111 
shows that a domain R is a Dedekind ring (defined in the last chapter) if and only if every 
divisible R -module is injective. 

Corollary 7.73. If R is a PID, then an R-module E is injective if and only if it is divisible. 

Proof. Assume that E is divisible. By the Baer criterion. Theorem 7.68, it suffices to 
extend maps /: / —> E to all of R. Since R is a PID, I is principal; say, I — (ro) for 
some ro e I . Since E is divisible, there exists e e E with r$e — /(ro). Define h: R -* E 
by h(r) = re. It is easy to see that h is an A 1 -map extending /, and so E is injective. • 

Remark. There are domains for which divisible modules are not injective; indeed, there 
are domains for which a quotient of an injective module need not be injective. 4 

Example 7.74. 

In light of Example 7.71, the following abelian groups are injective Z-modules: 

Q, R, C, Q/Z, R/Z, S\ 

where .S’ 1 is the circle group; that is, the multiplicative group of all complex numbers z 
with \z | = 1. ◄ 

Proposition 7.51 says that, over any ring, every module is a quotient of a projective 
module (actually, it is a stronger result: Every module is a quotient of a free module). The 
next result is the dual result for Z-modules: Every abelian group can be imbedded as a 
subgroup of an injective abelian group. We will prove this result for modules over any ring 
in Chapter 8 (see Theorem 8.104). 
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Corollary 7.75. Every abelian group M can be imbedded as a subgroup of some injective 
abelian group. 

Proof. By Proposition 7.51, there is a free abelian group F = f/, Z; with M — F/K for 
some K C F. Now 


M — F/K — Zi)/K c (£(},•)/*, 

i i 

where we have merely imbedded each copy Z; of Z into a copy Q ,• of Q. But Example 7.71 
gives each Qdivisible, hence gives Qdivisible, and hence gives divisibility of the 
quotient (JT Q i)/K. By the Proposition, (JT Q/j/W is injective. • 

Writing a module as a quotient of a free module is the essence of describing it by 
generators and relations. We may think of the corollary as dualizing this idea. 

The next result gives a curious example of an injective module; we shall actually use 
it to prove an interesting result (see the remark on page 654 after the proof of the basis 
theorem). 

Proposition 7.76. Let R be a PID, let a e R be neither zero nor a unit, and let J — (a). 
Then R/J is an injective R/ J -module. 

Proof. By the correspondence theorem, every ideal in R/J has the form I/J for some 
ideal I in R containing J. Now I = (b) for some h e /, so that I/J is cyclic with 
generator x — b + J. Since (a) C (/?), we have a = rb for some r e R. We are going to 
use the Baer criterion. Theorem 7.68, to prove that R/J is injective. 

Assume that /: I/J —> R/J is an R/J- map, and write f(b + J) — s + J for some 
s e R. Since r(b+J ) = rb+J — a + J = 0, we have rf(b+J) = r(s + J) = rs + J — 0, 
and so rs e J — (a). Hence, there is some r' e R with rs — r'a — r'br\ canceling r gives 
5 = r'b. Thus, 

f(b + J) = s + J = r'b + J. 

Define h : R/J —> R/J to be multiplication by r'\ that is, h: u + J i->- r'u + J. The 
displayed equation gives h(b + J) = f(b + J), so that h does extend /. Therefore, R/J 
is injective. • 


Exercises 


7.40 Let M be a free R -module, where R is a domain. Prove that if rm = 0, where r e R and 
m e M, then either r = 0 or m = 0. (This is false if R is not a domain.) 

7.41 Use left exactness of Horn to prove that if G is an abelian group, then Hom^(I„, G) = G[n], 
where G[«] = (g e G : ng = 0). 

7.42 Prove that a group G e obj (Groups) is a projective object if and only if G is a free group. (It 
is proved, in Exercise 10.3 on page 793, that the only injective object in Groups is {1}.) 
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7.43 If R is a domain but not a field, and if Q = Frac (R), prove that 

Hom^Ig, R) = {0}. 

7.44 Prove that every left exact covariant functor T: ^Mod —»■ Ab preserves pullbacks. Conclude 
that if B and C are submodules of a module A, then for every module M, we have 

Hom^(M, B fl C) = Hom^(M, B) FI Hom/j(M, C). 

Hint. Use pullback. 

7.45 (i) Prove that a function is an epimorphism in Sets if and only if it is a surjection. 

(ii) Prove that every object in Sets is projective, where an object P in a category is projective 
if a dotted arrow always exists for the diagram 

P 



where p is an epimorphism. 

Hint. Use the axiom of choice. 

7.46 Given a set A, prove that there exists a free R -module F with a basis B for which there is a 
bijection <p: B —> X. 

7.47 (i) Prove that every vector space V over a field k is a free fc-module. 

(ii) Prove that a subset B of V is a basis of V considered as a vector space if and only if B 
is a basis of V considered as a free fc-module. 

7.48 Define G to be the abelian group having the presentation (X\1Z), where 

X = [a, b\, b 2 , • • •, b n ,...} and 1Z = {2a, a — 2 n b n , n > 1). 

Thus, G = F/K, where F is the free abelian group with basis X and K is the subgroup {'R). 

(i) Prove that a + K e G is nonzero. 

(ii) Prove that z = a + K satifies equations z = 2 n y n , where y n e G and n > 1, and that z 
is the unique such element of G. 

(iii) Prove that there is an exact sequence 0 —> {a) —> G —*■ Y-.ii> I ^2" -► 0. 

(iv) Prove that Honi2(Q, G) = {0} by applying Hom^fQ, ) to the exact sequence in part (iii). 

7.49 (i) If [Pi : i e I] is a family of projective fGmodules, prove that their direct sum Y2iel A 

is also projective. 

(ii) Prove that every direct summand of a projective module is projective. 

7.50 Prove that every direct summand of an injective module is injective. 

7.51 Give an example of two injective submodules of a module whose intersection is not injective. 
Hint. Define abelian groups A = Z(p°°) = A': 

A = (a n ,n > 0\pao = 0, pa n+ \ = a„) and A' = (a„, n > 0|pflQ = 0, pa n+l = a n ). 

In A © A'. define E = A © {0} and E' = ({(a„_|_i, a' n ) : n > 0)). 
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7.52 (i) Prove that if a domain R is an injective f?-module, then R is a field. 

(ii) Let R be a domain that is not a field, and let M be an R -module that is both injective 
and projective. Prove that M — {0}. 

(iii) Prove that Ig is simultaneously an injective and a projective module over itself. 

7.53 (i) If R is a domain and I and J are nonzero ideals in R, prove that in I / {0}. 

(ii) Let R be a domain and let I be an ideal in R that is a free R -module; prove that I is a 

principal ideal. 

7.54 Prove that an ^-module E is injective if and only if. for every ideal I in R, every short exact 
sequence 0 —> E —> B —> I 0 splits. 

7.55 Prove the dual of Schanuel’s lemma. Given exact sequences 

0 ^ M \ E S Q 0 and 0 -* M 4 E' 4 Q' 0. 

where E and E' are injective, then there is an isomorphism 

Q © E' = Q! © E. 

7.56 (i) Prove that every vector space over a field k is an injective fc-module. 

(ii) Prove that if 0 —> U V —> W —0 is an exact sequence of vector spaces, then the 

corresponding sequence of dual spaces 0 —>• IV* —»■ V* —y U* —»■ 0 is also exact. 

7.57 (. Pontrjagin Duality ) If G is an abelian group, its Pontrjagin dual is the group 

G* = Hom z (G, Q/Z). 

(Pontrjagin duality extends to locally compact abelian topological groups, and the dual con¬ 
sists of all continuous homomorphisms into the circle group.) 

(i) Prove that if G is an abelian group and a € G is nonzero, then there is a homomorphism 
/: G -» Q/Z with f(a) / 0. 

(ii) Prove that Q/Z is an injective abelian group. 

(iii) Prove that if 0 —> A —> G —>■ B —>■ 0 is an exact sequence of abelian groups, then so is 
0 —► B* -» G* —► A* —► 0. 

(iv) If G = I„, prove that G* = G. 

(v) If G is a finite abelian group, prove that G* = G. 

(vi) Prove that if G is a finite abelian group, and if G/H is a quotient group of G, then G/H 
is isomorphic to a subgroup of G. [The analogous statement for nonabelian groups is 
false: If Q is the group of quaternions, then Q/Z(Q) = V, where V is the four-group; 
but Q has only one element of order 2 while V has three elements of order 2. This 
exercise is also false for infinite abelian groups: Since Z has no element of order 2, it 
has no subgroup isomorphic to Z/2Z = 1L.] 


7.5 Grothendieck Groups 

A. Grothendieck introduced abelian groups to help study projective modules. The reader 
may regard this section as a gentle introduction to algebraic K -theory. 
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Definition. A category C is a ^-category if there is a commutative and associative binary 
operation +: obj(C) x obj(C) -* obj(C); that is, 

(i) If A = A' and B = B’, where A, A ', B, B' e obj(C), then A * B = A' + B'. 

(ii) there is an equivalence A * B = B * ,4 for all A. B e obj(C); 

(iii) there is an equivalence A * (B * C) = (A ★ B) * C for all A, B, C e obj(C). 

Any category having finite products or finite coproducts is a *-category. 

Definition. If C is a ^-category, define |obj(C)| to be the class of all isomorphism classes 
| A| of objects in C, where |A| = {B e obj(C) : B = A}. If 7F(C) is the free abelian group 
with basis 15 |obj(C)| and 1Z is the subgroup of T(C) generated by all elements of the form 

| A ★ B | — | A\ — | B\ where A, B e obj(C), 

then the Grothendieck group Kq(C) is the abelian group 

K 0 (C) = no/n. 

(A characterization of Kq(C ) as a solution to a universal mapping problem is given in 
Exercise 7.58 on page 498.) For any object A in C, we denote the coset | A\ + 1Z by [A], 

We remark that the Grothendieck group Kq(C ) can be defined more precisely: C should 
be a symmetric monoidal category (see Mac Lane, Categories for the Working Mathemati¬ 
cian,pages 157-161). 

Proposition 7.77. Let C be a ★- category. 

(i) If x e Kq(C), then x — [A] — [B]for A, B e obj(C). 

(ii) If A, B e obj(C), then [A] = [B] in Kq(C) if and only if there exists C e obj(C) 
with A ★ C = B * C. 

Proof, (i) Since Kq(C) is generated by |obj(C)|, we may write 

r s 

* = £ [A { ]- £[*;]. 

i=1 7=1 

(we allow objects A, and If to be repeated). If we now define A = Aj ★ • • • ★ A r , then 

[A] = [A 1 *...*A r ] = ^[A { ]. 

i 

Similarly, define B = B\ ★ • • • ★ B s . It is now clear that x = [A] — [5]. 

15 There is a minor set-theoretic problem here, for a basis of a free abelian group must be a set and not a proper 
class. This problem is usually avoided by assuming that C is a small category, that is, the class obj(C) is a set. 
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(ii) If A*C = B*C, then [A * C] = [B ★ C] in K 0 (C). Hence, [A] + [C] = [B] + [C ], 
and the cancellation law in the abelian group Kq(C) gives [A] = [B], 

Conversely, if [A] = [B], then |B| — |A| e TZ and there is an equation in 'F(C): 


\B\ - |A| = £>i(|Xi *Yi\-\Xi \- |F,|) -J2 n j(\ u j * Vj | - \Uj\ -\Vj\), 
i j 

where the coefficients m, and n / are positive integers, and the X , F, U, and V are objects 
in C. Transposing to eliminate negative coefficients. 


\A\ + J2 m i\Xi*Yi\ + J2»j(\Uj\ + Wj\) = \B\ + J2 (I X,1 + | Y t |) + £ nj \ Uj *Vj\. 

i j i j 

This is an equation in a free abelian group, where expressions in terms of a basis are unique. 
Therefore, {A, X, * Y ,, 17 /, Vj }, the set of objects, with multiplicities, on the left-hand side, 
coincides with { B, U j* Vj , X ,•, T,}, the set of objects, with multiplicities, on the right-hand 
side. Since products are commutative and associative, there is an equivalence in C: 

A * (★,- nuiXi * Yt)) * (★; nj(Uj * Vj)) = B * (★, m^X, * T, j) * (★, njiUj * Vj)). 

An inspection of terms shows that 

(★; mi (Xi * Yi )) * njiUj * Vj)) = (★/ nijlX, * Yf)) * (★, njiUj * Vj)). 

If we denote this last object by C, then A*C = B*C. • 


Definition. Let R be a commutative ring, and let C be a subcategory of ^Mod. Two 
B-modules A and B are called stably isomorphic in C if there exists a module C e obj(C) 
with A © C = B © C. 

With this terminology. Proposition 7.77 says that two modules determine the same el¬ 
ement of a Grothendieck group if and only if they are stably isomorphic. It is clear that 
isomorphic modules are stably isomorphic; the next example shows that the converse need 
not hold. 

Example 7.78. 

(i) If Ab is the category of all finite abelian groups, then Exercise 5.10 on page 268 shows 
that two finite abelian groups are stably isomorphic in Ab if and only if they are isomorphic. 

(ii) If R is a commutative ring and F is a free B-module of infinite rank, then 
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Thus, R and R ® R are nonisomorphic modules that are stably isomorphic in /{Mod. 
Because of examples of this type, we usually restrict ourselves to subcategories C of r Mod 
consisting of finitely generated modules. 

(iii) Here is an example, due to R. G. Swan, in which stable isomorphism of finitely gener¬ 
ated projective modules does not imply isomorphism. 

Let R — R[xi, ..., x„]/(l — £© xf) [the coordinate ring of the real (n — l)-sphere]. 
Regard R" as n x 1 column vectors, and let X = (3c i,..., x,,)' e R". where bar denotes 
coset mod (1 — £© xf) in R. Define X: R —> R" by X: r i->- rX, and define (p: R n -> R 
by (p(Y ) = X'Y. Note that the composite <pX: R —> R is the identity, for 

<pX(r) = <p(rX ) = X’rX = r, 

because X' X — J© 3c j = 1. It follows that the exact sequence 

0-» R R n R n /imX -> 0 
splits. Thus, if P = R"/ im /,, then 

R © R"~ l = R" = R © P, 

and P is stably isomorphic to the free /©module R"~ l (of course, P is a projective R- 
module). Using topology. Swan proved that P is a free /©module if and only if n — 1,2,4 
or 8. If n — 3, for example, then P is not isomorphic to /©~ 1 . -4 


Proposition 7.79. If C is the category of all finite abelian groups, then Kq(C) is a free 
abelian group with a basis B consisting of all the cyclic primary groups. 

Proof. By the basis theorem, each finite abelian group A = J© C,, where each C, is a 
cyclic primary group. Thus, [A] = ^ ( [C,] in Kq(C). Since every element in Kq(C) is 
equal to [A] — [Z?], for finite abelian groups A and B , it follows that B generates KifC). 
To see that B is a basis, suppose that t n j[Cj] — 0, where m, and 

nj are positive integers. Then £©[m,C ( ] = 12ji n jCj], where m,C ( - is the direct sum of 
nti copies of C/, and so = [12j n jC'j]- Therefore, 'f2 i niiCi and 12j n jC 

are stably isomorphic in C. By Example 7.78(i), nijCj = 12)=] n j^j- Finally, the 
fundamental theorem of finite abelian groups applies to give r = s, a permutation o e S, 
with C' a( ^ = Cj and nij — n a ^) for all i. Therefore, B is a basis of Zfo(C)- • 


Definition. If R is a commutative ring, then the subcategory Pr( R) of all finitely gener¬ 
ated projective /©modules is a ★-category (for the direct sum of two such modules is again 
finitely generated projective). We usually denote Kq(Pt(R)) by Kq(R) in this case. 



492 


Modules and Categories Ch. 7 


Example 7.80. 

We now show that Kq(R ) = Z if R is a commutative ring for which every finitely generated 
projective R -module is free. It is clear that Ko(R) is generated by | R], so that Kq(R) is 
cyclic. Define r : obj(Pr( R)) -> Z by r(F) — ranl<( F), where F is a finitely generated 
free R -module. Since r(F ® F') — r(F ) + r(F'), Exercise 7.58 on page 498 shows 
that there is a homomorphism 7: Ko(R) -> Z with 7([,F]) = rank(F) for every finitely 
generated free F . Since Kq(R) is cyclic, 7 is an isomorphism. < 

If C is a category of modules, there is another Grothendieck group K'(C) we can define. 

Definition. If C is a category of modules, define F(C)\ to be the free abelian group with 
basis |obj(C), and 1Z' to be the subgroup of fF(C) generated by all elements of the form 

|B| — | A | — \C\ if there is an exact sequence 0->A->B->C->0. 

The Grothendieck group K'(C) is the abelian group 

K'(C) = T(C)/n'\ 

that is, K'{C) is the abelian group with generators |obj(C)| and relations TV. For any 
module A e obj(C), we denote the coset |A| + 1Z' by (A). 

Example 7.81. 

If R is a domain and a e R is neither 0 nor a unit, there is an exact sequence 

0 -> R R -> R/Ra 0, 
where p, a : r i->- ar. Thus, there is an equation in K f (C): 

(R) = (R) + (R/Ra). 


Hence, (R/Ra) —0. ◄ 

The next proposition uses the observation that the two notions of Grothendieck group— 
Kq(R) = k'o(Pr( R)) and K'(Vr(R)) —coincide. The reason is that there is an exact se¬ 
quence 0-*A-*A®C-*C->0, so that (A ® C) — (A) + (C) in K'(C). 

Proposition 7.82. If R is a commutative ring and C is the category of all finitely generated 
R-modules, then there is a homomorphism 

£■ Kq(R) —> K'(C) 

with s: [P] I —>■ (P) for every projective R-module P. 

Proof Since every short exact sequence of projective modules splits, the relations defin¬ 
ing Kq(R) — k'o(Pr( R)) are the same as those defining K'(Vr(R)). Hence, the inclusion 
map ^(PrfP)) —> T(C) induces a well-defined homomorphism. • 
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Proposition 7.83. Let R be a commutative ring and let C be the category of all finitely 
generated R-modules. If M e obj(C) and 


M = M 0 3Mi2M 2 2"OM„ = {0} 


has factor modules Q , = then 

(M) = (gi) H-b (Q n ) in K\C). 

Proof. Since Qi — Mi-i/Mj, there is a short exact sequence 

0 Mi M\-\ -» Qi -> 0, 

so that (QQ — (M,- 1 ) — (Mi) in K'(C). We now have a telescoping sum: 

n n 

£(Gi) = - (Mi)] = (Mo) - (M n ) = (M). . 

i= 1 1=1 

The next obvious question is how to detect when an element in K'(C) is zero. 

Proposition 7.84. Let R be a commutative ring and let C be the category of all finitely 
generated R-modules. If A, B e obj(C), then (A) — ( B) in K'(C) if and only if there are 
C, U, V € obj(C) and exact sequences 

0-»t/^A©C-»V->0 and 0->t/->B©C-*V->0. 

Proof. If there exist modules C, II. and V as in the statement, then 

(A © C) = (U) + ( V ) = (B © C). 

But exactness ofO->A—>A©C->C-»0 gives (A © C) = (A) + (C). Similarly, 
(B © C) = (B) + (C), so that (A) + (C) = (B) + (C) and (A) = (B). 

Conversely, if (A) = (B), then |A| — |B| e TV. As in the proof of Proposition 7.77, 
there is an equation in . F(C): 

IA | + J2\*i\ + ^(|Pj| + \YJ\) = \B\ + ^(|X(| + \X'!\) + \Yj \, 

where 0 —» Xj -> X ,■ —>■ X" -> 0 and 0 —> Fj —>■ F ; - -> Y'! -¥ 0 are exact sequences. 
Define 

c = A © J2 Xi © J2 (Y j ® up¬ 
setting X' to be the direct sum of the X' r X to be the direct sum of the Xi, and so forth, 
the argument in the proof of Proposition 7.77 gives 

A © X © Y' © Y" = B © X' © X" © Y. 
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By Exercise 7.12 on page 440, this isomorphism gives rise to exact sequences 
0 -> X' © Y" -* X © Y" X" -* 0, 

0 -» X' © Y" -> (X © Y") © Y' -> X" © Y' -* 0, 
and 

0 Y") ->A® (X" © Y') -+ 0. 

The middle module is C. Applying Exercise 7.12 once again, there is an exact sequence 

0 -> X' © Y' -> B © C B © (A © X" © Y") 0. 

Define U = X' © Y' and V — B © A © X" © Y"\ with this notation, the last exact sequence 
is 

Similar manipulation yields an exact sequence 0->f/—>A©C->V->0. • 

In Chapter 8, we will prove a module version of Theorem 5.52, the Jordan-Holder 
theorem. For now, we merely give a definition. 

Definition. A sequence in a category C of modules, 

M=M 0 3Mi3M 2 2"OM„ = {0}, 

is called a composition series of M if each of its factor modules Q, = isasimple 

module in obj(C). We say that a category C of modules is a Jordan-Holder category if: 

(i) Each object M has a composition series; 

(ii) For every two composition series 

M = M 0 3Mi3M 2 3"OM„ = {0} 


and 

m = = {oj, 

we have m — n and a permutation o e S n such that Q'j = Q a j for all /, where 
Q, = /W«-i /Mi and Q) = M' j _ l /M). 

Define the length l(M) of a module M in a Jordan-Holder category to be the number n 
of terms in a composition series. If the simple factor modules of a composition series are 
0i,, Qn , we define 

jh(Af) = 0i ©•••© Q„. 

A composition series may have several isomorphic factor modules, and jh(M) records their 
multiplicity. 
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Lemma 7.85. Let C be a Jordan-Holder category, and let Qi,..., Q n , Q\...., Q' m be 
simple modules in obj (C). 

(i) If 

Qx®---®Qn = Q\®---®Q' m , 

then m = n and there is a permutation a e S n such that Q'- = Q a j for all j, where 
Qi = Mi-i/Mi and Q'. = 

(ii) If M and M' are modules in obj(C), and if there is a simple module S e obj(C) with 

S © jh(M) = 5©jh(M'), 


then )h(M) = jh (M'). 

Proof, (i) Now 

Ql © • • • © Qn 2 Ql © • • • © Qn 2 Ql © • • • © Qn 2 ■ ■ ■ 

is a composition series with factor modules Q\,..., Q„\ similarly, the isomorphic module 
Q\ © • • • © Q' m has a composition series with factor modules Qf v .... Q' m . The result 
follows from C being a Jordan-Holder category. 

(ii) This result follows from part (i) because S is simple. • 


Lemma 7.86. If 0 —> A —> B —>■ C —> 0 is an exact sequence in a Jordan-Holder 
category, then 

jh(B)=jh(A)©jh(C). 

Proof. The proof is by induction on the length 1(C). Let A = Ao 5 A\ 3 • • • 3 A n = 
{0} be a composition series for A with factor modules Q i,..., Q n . If 1(C) = 1, then C is 
simple, and so 

£3A3Ai 3 ■ ■ O A„ = (0) 

is a composition series for B with factor modules C, Qi,.... Q„. Therefore, 

jh(fi) = C © Qi © • • • © Q n = jh(C) © jh(A). 

For the inductive step, let £(C ) > 1. Choose a maximal submodule C\ of C (which 
exists because C has a composition series). If v: B -> C is the given surjection, define 
B i = v _1 (Ci). There is a commutative diagram (with vertical arrows inclusions) 

0 - A ->■ B — V -^C - ^0 

AAA 

0-> A-^ Bi -^ Cl -^ 0 
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Since C\ is a maximal submodule of C, the quotient module 


C" = C/Ci 


is simple. Note that B/B\ = (B / A)/(B\/A) = C/Cj = C". By the base step, we have 
jh(C) = C" ® jh(Ci) and jh(B) = C" © jh(fii). 

By the inductive hypothesis. 


jh(Bi) =jh(A)®jh(Ci). 


Therefore, 


jh(B) = C" ® jh(Z?r) 

= C" © jh(A) © jh(Cj) 

= jh(A) © C" © jh(Cj) 

= jh(A) © jh(C). . 

Theorem 7.87. Let C be a category of modules in which every module M e obj(C) has 
a composition series. Then C is a Jordan-Holder category if and only if K'(C) is a free 
abelian group with basis the set B’ of all (5) as S varies over all nonisomorphic simple 
modules in obj(C). 

Proof Assume that K'(C) is free abelian with basis B 1 . Since 0 is not a member of a 
basis, we have ( S ) f 0 for every simple module .S'; moreover, if S ^ S', then ( S) f (S'), 
for a basis repeats no elements. Let M e obj(C), and let Q\,, Q„ and Q' v ..Q' m be 
simple modules arising, respectively, as factor modules of two composition series of M. 
By Proposition 7.83, we have 


(Gt) + • • • + (Qn) = (M) = (Gi) + • • • + (Q'J- 


Uniqueness of expression in terms of the basis B' says, for each Q' ., that there exists Q t 
with (Qj) = ( Q'. ); in fact, the number of any (Qi) on the left-hand side is equal to the 
number of copies of (Q'j) on the right-hand side. Therefore, C is a Jordan-Holder category. 

Conversely, assume that the Jordan-Holder theorem holds for C. Since every M e 
obj(C) has a composition series. Proposition 7.83 shows that B' generates K'(C). Let S be 
a simple module in obj(C). If ( S ) = (T), then Proposition 7.84 says there are C,U,V e 
obj(C) and exact sequences 0 —> U -> 5©C -> V -> 0 and 0 -> U -> T ©C —> V -> 0. 
Lemma 7.86 gives 

jh(S) ©jh(C) = jh(£7) © jh(V) = jh(T) © jh(C). 

By Lemma 7.85, we may cancel the simple summands one by one until we are left with 
S = T, a contradiction. A similar argument shows that if S is a simple module, then 
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( S ) f 0. Finally, let us show that every element in K'(C ) has a unique expression as a 
linear combination of elements in £>'. Suppose there are positive integers m, and n / so that 

Y J >n l (S l )-Y j n j (T j ) = {), ( 1 ) 

' j 

where the .S', and Tj are simple modules in obj(C) and .S', ^ Tj for all i, j. If we denote 
the direct sum of mi copies of S ; by then Eq. (1) gives 

(Y. m ’ S ‘) - (J2 n J T j)- 

i j 

By Proposition 7.84, there are modules C,U,V and exact sequences 

0 -* U -* C V 0 and 0^ U C© ^njTj V 0, 

i j 

and Lemma 7.86 gives 

jh(y]m,S') =jhQ>;7)). 

i j 

By Lemma 7.85, some Sj occurs on the right-hand side, contradicting S, ^ Tj for all i. j. 
Therefore, Eq. (1) cannot occur. • 

Remark. A module M is called indecomposable if there do not exist nonzero modules A 
and B with M = A © B. We say that a category C of modules is a Krull-Schmidt category 
if: 

(i) Each module in obj(C) is isomorphic to a finite direct sum of indecomposable mod¬ 
ules in obj(C); 

(ii) If 

Dt © • • • © = D[ ©•••©£>;, 

where all the summands are indecomposable, then m — n and there is a permutation 
a e S n with D'j = D a j for all j. 

There is a theorem analogous to Theorem 7.87 saying that a category C of modules is a 
Krull-Schmidt category if and only if Kq(C) is a free abelian group with basis consisting 
of all 1 1 )] as D varies over all nonisomorphic indecomposable modules in obj(C). ◄ 

Compare the next corollary with Proposition 7.79. 

Corollary 7.88. IfC is the category of all finite abelian groups, then K'(C) is the free 
abelian group with generators all (5), where S is a cyclic group of prime order p. 

Proof. By Theorem 5.52, the category of all finite abelian groups is a Jordan-Holder 
category, and the simple Z-modules are the abelian groups Ip for primes p. • 
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H. Bass defined higher groups K\ and Ki, proved that there is an exact sequence relating 
these to Kq, and showed how these groups can be used to study projective modules (see 
Milnor, Introduction to Algebraic K-Theory). D. Quillen constructed an infinite sequence 
K n (C ) of abelian groups by associating a topological space X(C) to certain categories C. 
He then defined K n (C) = n„+i (X(C)) for all n > 0, the homotopy groups of this space, 
and he proved that his K n coincide with those of Bass for n — 0, 1, 2 (see Rosenberg, 
Algebraic K-Theory and Its Applications ). 

Exercises 


7.58 Let C be a *-category. Prove that Kq(C) solves the following universal mapping problem. 

obj(C) * > K 0 (C), 



where G is any abelian group. If h: obj(C) -» Kq(C) is the function A [A], and if 
/: obj(C) —>• G satisfies /(A) = f(B ) whenever A = B and /(A * B) = /(A) + /(B), 
then there exists a unique homomorphism /: Kq(C) —> G making the diagram commute. 

7.59 Regard C = PO(N) as a *-category, where m *n = m + n, and prove that Kq(C ) = Z. (Thus, 
we have constructed the integers from the natural numbers. In a similar way, we can construct 
an abelian group G from a semigroup 5, although we cannot expect that S is always imbedded 
in G.) 

7.60 (i) If C is a category of modules having infinite direct sums of its objects, prove that 

*o(C) = {0}. 

(ii) ( Eilenberg ) Prove that if P is a projective R-module (over some commutative ring R), 
then there exists a free R-module Q with P © Q a free /{-module. Conclude that 
K 0 {C) — {0} for C the category of countably generated projective R-modules. 

Hint. Q need not be finitely generated. 

7.61 Prove that //)(!(,) = Z © Z. 

7.62 Let C and C' be *-categories, and let F : C —> C be a *-preserving functor; that is, F(A * B) = 

F(A) * F(B). Prove that F induces a homomorphism Kq(C ) —>• Kq(C') by [A] [FA]. 

7.63 If C is a category of modules, prove that every element in K'(C ) has the form (A) — (B) for 
modules A and B in obj(C). 

7.64 Let C be a category having short exact sequences. Prove that there is a surjection Kq(C ) —> 
K'(C). 


7.6 Limits 


There are two more general constructions, one generalizing pullbacks and intersections, 
the other generalizing pushouts and unions; both involve a family of modules {M, : i el] 
whose index set / is a partially ordered set. 
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Definition. Let I be a partially ordered set. An inverse system of R-modules over I is 
an ordered pair {Mj, i// 7 } consisting of an indexed family of modules {M, : i e 7} together 
with a family of morphisms [f- : M j -» M,} for i < j, such that i/rf = 1^, for all i and 
such that the following diagram commutes whenever i < j < k: 

M k -'-*- Mi 



Mj 


In Example 7.25(v), we saw that a partially ordered set I defines a category PO(7): The 
objects of PO(7) are the elements of 7 and Homli, j ) is empty when i f j while it contains 
exactly one element, k 1 -, whenever i < j. If we define F(i) = Mj and F(k‘.) — x[r! , then 

it is easy to see that {Mj, ij/j ) is an inverse system if and only if F : PO(7) -> ^Mod is a 
contravariant functor. We now see that inverse systems involving objects and morphisms 
in any category C can be defined: Every contravariant functor F: PO(7) —> C yields one. 
For example, we can speak of inverse systems of commutative rings. 

Example 7.89. 

(i) If 7 = {1,2, 3} is the partially ordered set in which 1^2 and 1^3, then an inverse 
system over 7 is a diagram of the form 


B 


A 

I 8 


J 


^ c 


(ii) If X is a family of submodules of a module A , then it can be partially ordered under 
reverse inclusion ; that is, M < M' in case M 3 M'. For M < M', the inclusion map 
M' —> M is defined, and it is easy to see that the family of all M e X with inclusion maps 
is an inverse system. 

(iii) If 7 is equipped with the discrete partial order, that is, i < j if and only if i — j , then 
an inverse system over 7 is just an indexed family of modules. 

(iv) If N is the natural numbers with the usual partial order, then an inverse system over N 
is a diagram 

M 0 •<— M\ <— Mi *— • • • . 


(v) If J is an ideal in a commutative ring R, then its nth power is defined by 

J n — ai ■ ■ ■ a n : cij e /}. 
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Each J n is an ideal and there is a decreasing sequence 

R D / D J 2 D J 3 D • • • . 

If A is an R -module, there is a sequence of submodules 

A3/A3 J 2 A D / 3 A 2 ••• . 

If m > n, define i/C : A/J m A A/J n A by 

1/C: + J m A h* a + J n A 

(these maps are well-defined, for m > n implies J'" A C J n A). It is easy to see that 

{A/J n A, VC) 

is an inverse system over N. 

(vi) Let G be a group and let A f be the family of all the normal subgroups N of G having 
finite index partially ordered by reverse inclusion. If N < N' in J\f, then /V' < /V; define 
VC : G/N' —> G/N by gN' h->• gN. It is easy to see that the family of all such quotients 
together with the maps i/C form an inverse system over Af. ◄ 


Definition. Let I be a partially ordered set, and let {M, , xjrj) be an inverse system of 
/^-modules over I . The inverse limit (also called projective limit or limit ) is an /(-module 
lim Mj and a family of /(-maps {a,- : lhn M\ -» M\ : i e /}, such that 

(i) V '/otj = a, whenever i < j; 

(ii) for every module X having maps j]: X M, satisfying i lr{ fj — fi for all i < j, 
there exists a unique map 9: X —> fim Mj making the following diagram commute: 

lim Mj < 6 x 



The notation lim M, for an inverse limit is deficient in that it does not display the maps 
of the corresponding inverse system (and limM, does depend on them). However, this is 
standard practice. 

As with any object defined as a solution to a universal mapping problem, the inverse 
limit of an inverse system is unique (to isomorphism) if it exists. 
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Proposition 7.90. The inverse limit of any inverse system { M ;, fj } of R-modules over a 
partially ordered index set I exists. 

Proof. Define 

L = {(mi) e ]~[m, : nij = fj (m /) whenever / < j}\ 

it is easy to check that L is a submodule of ]~[ ( M,. If p t is the projection of the product to 
Mi. define a,: L —> Mj to be the restriction pi\L. It is clear that i fj a j — ai. 

Assume that X is a module having maps f : X —>■ M,- satisfying fj fj = f for all 
/ < j. Define 0: A -> ]”[ M, by 

6(x) - (fi(x)). 

That im 6 C L follows from the given equation fj fj — f for all i < j. Also, 9 makes 
the diagram commute: oij9: x i-> (fj (x)) i-> fi(x). Finally, 9 is the unique map X —> L 
making the diagram commute for all / ■< j. If (p: X -* L. then (p(x) — (nij) and 
<Xj(p(x) — «!/. Thus, if (p satisfies aj(p(x ) = f)(x) for all / and all x, then m, = fi(x), and 
so q> — 9. We conclude that L = lim Mj. • 

Inverse limits in categories other than module categories may exist; for example, inverse 
limits of commutative rings exist, as do inverse limits of groups or of topological spaces. 

The reader should supply verifications of the following assertions in which we describe 
the inverse limit of each of the inverse systems in Example 7.89. 

Example 7.91. 

(i) If I is the partially ordered set {1, 2, 3} with 1^3 and 2^3, then an inverse system is 
a diagram 


A 



and the inverse limit is the pullback. 

(ii) We have seen that the intersection of two submodules of a module is a special case 
of pullback. Suppose now that X is a family of submodules of a module A. so that X 
and inclusion maps is an inverse system, as in Example 7.89(h). The inverse limit of this 
inverse system is Ha/sX M. 

(iii) If I is a discrete index set, then the inverse system {Mj : i e 1} has the product |~[ M, 
as its inverse limit. Indeed, this is just the diagrammatic definition of a product. 

(iv) If J is an ideal in a commutative ring R and M is an /^-module, then the inverse limit 
of {M/ J" M, f’f | [in Example 7.89(v)] is usually called the J-adic completion of M; let 
us denote it by M. In order to understand the terminology, we give a rapid account of a 
corner of point-set topology. 
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Definition. A metric space is a set A equipped with a function d : X x X —> 1, called a 
metric , that satisfies the following axioms. For all x, y, z € X, 

(i) d(x,y) > 0 and d(x, y) — 0 if and only if x — y; 

(ii) d(x, y) — d(y , x); 

(iii) (Triangle inequality) d(x, y ) < c/(x, z) + d(z■ y). 

For example, d(x,y) = \x — y| is a metric on R. Given a metric space X, the usual 
definition of convergence of a sequence makes sense: A sequence (x n ) of points x n in X 
converges to a limit y e X if, for every e > 0, there is N so that d (x n . y) < e whenever 
n > N\ we denote (x„) converging to y by 

x n -> y. 

A difficulty with this definition is that we cannot tell if a sequence is convergent without 
knowing what its limit is. A sequence (x„) is a Cauchy sequence if, for every e > 0, there 
is N so thatd(x m , x„) < f whenever in, n > N. The virtue of this condition on a sequence 
is that it involves only the terms of the sequence and not its limit. If X = R, then a sequence 
is convergent if and only if it is a Cauchy sequence. In general metric spaces, however, we 
can prove that convergent sequences are Cauchy sequences, but the converse may be false. 
For example, if X consists of the positive real numbers, with the usual metric \x — y\, then 
the sequence (1 /n) is a Cauchy sequence, but it does not converge in X because 0 ^ X. 

Definition. A completion A of a metric space A is a metric space with the following two 
properties: 

(i) A is a dense subspace of A; that is, for every x e A, there is a sequence (x„) in A with 
x„ x; 

(ii) every Cauchy sequence in A converges to a limit in A. 

It can be proved that any two completions of a metric space A are isometric (there is a 
bijection between them that preserves the metrics), and one way to prove existence of A 
is to define its elements as equivalence classes of Cauchy sequences (x„) in A, where we 
define (x„) = (}•„) if d(x„, y„) -> 0. 

Let us return to the inverse system {M/ J"M, ijr™ }. A sequence 

(c?i + JM, a 2 + J 2 M , a 3 + 7 3 M, ...) e lim(M//"M) 

satisfies the condition ^r"'(a m + J m M) = a m + J"M for all m > n, so that 

a m — a n e J n M whenever m > n. 

This suggests the following metric on M in the (most important) special case when 
nr=i J n M — {0}. If x e M and x ^ 0, then there is i with x e J‘M and x ^ J ,+l M\ 
define ||x|| = 2~‘ ; define ||0|| = 0. It is a routine calculation to see that d (x, y) — ||x — y|| 
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is a metric on M (without the intersection condition, ||x|| would not be defined for a non¬ 
zero a - e n~i J"M). Moreover, if a sequence (a„) in M is a Cauchy sequence, then 
(a i + J M, 02 + J 2 M, a?, + J 3 M, ...) e lim M/ J n M, and conversely. 

In particular, when M — Z and J — ( p ), where p is a prime, then the completion Z p is 
called the ring of p-adic integers. It turns out that Z p is a domain, and Q p = Frac(Z p ) is 
called the field of p-adic numbers. 

(v) We have seen, in Example 7.89(vi), that the family A F of all normal subgroups of finite 
index in a group G forms an inverse system; the inverse limit of this system, lim G/N, 
denoted by G, is called the profinite completion of G. There is a map G —> G, namely, 
g m* (gN), and it is an injection if and only if G is residually finite ; that is, f~j.vc.V' ^ = 
{1}. It is known, for example, that every free group is residually finite. 

There are some lovely results obtained making use of profinite completions. If r is a 
positive integer, a group G is said to have rank r if every subgroup of G can be generated 
by r or fewer elements. If G is a residually finite p-group (every element in G has order 
a power of p) of rank r, then G is isomorphic to a subgroup of GL(n, Z p ) for some n 
(not every residually finite group admits such a linear imbedding). See Dixon-du Sautoy- 
Mann-Segal, Analytic Pro-p Groups, page 98. ◄ 

The next result generalizes Theorem 7.32. 

Proposition 7.92. If {Mi, } is an inverse system, then 

Hom(A, limM,) = limHom(A, M,) 

for every module A. 

Proof. This statement follows from inverse limit being the solution of a universal map¬ 
ping problem. In more detail, consider the diagram 



Hom(A, Mj) 


where the f, are the maps given in the definition of inverse limit. 

To see that 9: Hom(A, lim M,) —> limHom(A, M, ) is injective, suppose that /: A —» 
lim Mi and 9(f) — 0. Then 0 = fit Of = a,/ for all/, and so the following diagram 
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commutes: 



But the zero map in place of / also makes the diagram commute, and so the uniqueness of 
such a map gives f — 0; that is, 0 is injective. 

To see that 0 is surjective, take g e limHom(A, M, ). For each i, there is a map 

Pig: A Mi with ty{Pig = fjg. 



The definition of limM; provides a map g': A —> limM; with a,g' — p,g for all i. It 
follows that g — 0(g'y, that is, 6 is surjective. • 

We now consider the dual construction. 


Definition. Let I be a partially ordered set. A direct system of R-modules over I is an 
ordered pair {M;, <//•} consisting of an indexed family of modules { M, : i e 1} together 
with a family of morphisms : M; -* Mj} for i < j, such that <p\ = 1m, for all i and 
such that the following diagram commutes whenever i < j < k: 



If we regard I as the category PO(7) whose only morphisms are k‘- when i < j, and if 
we define F(i) — Mj and F(k'j) — (p’j, then it is easy to see that {M;, <//.} is a direct system 
if and only if F: PO(/) -> r Mod is a (covariant) functor. Thus, we can consider direct 
systems involving objects and morphisms in any category C as being a (covariant) functor 
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F : PO(/) -* C. For example, it makes sense to consider direct systems of commutative 
rings. 

Example 7.93. 

(i) If I — {1, 2, 3} is the partially ordered set in which 1^2 and 1^3, then a direct 
system over / is a diagram of the form 


A 


g 


B 


f l 

C 


(ii) If X is a family of submodules of a module A , then it can be partially ordered under 
inclusion; that is, M < M' in case M c M'. For M < M' , the inclusion map M —»• M' 
is defined, and it is easy to see that the family of all Mel with inclusion maps is a direct 
system. 

(iii) If I is equipped with the discrete partial order, then a direct system over I is just a 

family of modules indexed by I. ■* 

Definition. Let I be a partially ordered set, and let {Mj, (p l -} be a direct system of 
/^-modules over I. The direct limit (also called inductive limit or colimit ) is an R -module 
limM, and a family of /?-maps {a, : Mj —»■ lim M, : i e /}, such that 

(i) oijcp'j — aj whenever i < j\ 

(ii) for every module X having maps f, : Mj —> X satisfying fjcp'j = fj for all i < j, 
there exists a unique map 0 : lim Mj —»■ X making the following diagram commute; 

lim Mj 6 > \ 



The notation lim M, for a direct limit is deficient in that it does not display the maps 
of the corresponding direct system (and lim Mj does depend on them). However, this is 
standard practice. 

As with any object defined as a solution to a universal mapping problem, the direct limit 
of a direct system is unique (to isomorphism) if it exists. 
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Proposition 7.94. The direct limit of any direct system {Mi, (pi } of R-modules over a 
partially ordered index set I exists. 

Proof For each i e I, let /., be the injection of M,- into the sum Jfj M ( . Define 

D=(j2 M i)/S, 

i 

where S is the submodule of Mi generated by all elements Xjcp'jnij —A.,m; with w, e M; 
and i < j. Now define a,: M, —> I) by 

a ,: m, i —> Ai(mj) + S. 

It is routine to check that D = lim M,. • 

Thus, each element of lim M, has a representative of the form ^ /.,/«, + S. 

The argument in Proposition 7.94 can be modified to prove that direct limits in other cat¬ 
egories exist; for example, direct limits of commutative rings, of groups, or of topological 
spaces exist. 

The reader should supply verifications of the following assertions, in which we describe 
the direct limit of some of the direct systems in Example 7.93. 

Example 7.95. 

(i) If I is the partially ordered set {1,2, 3} with 1^2 and 1^3, then a direct system is a 
diagram 



C 

and the direct limit is the pushout. 

(ii) If I is a discrete index set, then the direct system is just the indexed family {M, : ; e /}, 
and the direct limit is the sum: limM; = ; Mf for the submodule S in the construction 

of hrn Mi is {0}. Alternatively, this is just the diagrammatic definition of a coproduct. ◄ 

The next result generalizes Theorem 7.33. 

Proposition 7.96. If {Mi, (pi) is a direct system, then 

Hom( lim M,, B) = lim Hom(M,, B) 

for every module B. 

Proof This statement follows from direct limit being the solution of a universal mapping 
problem. The proof is dual to that of Proposition 7.92 and it is left to the reader. • 

There is a special kind of partially ordered index set that is useful for direct limits. 
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Definition. A directed set is a partially ordered set I such that, for every i, j e I. there 
is k e I with i < k and j < k. 

Example 7.97. 

(i) Let T be a simply ordered family of submodules of a module A; that is, if M, M' el, 
then either MCM'or M' C M. As in Example 7.93(h), 1 is a partially ordered set; here, 
X is a directed set. 

(ii) If I is the partially ordered set {1,2, 3} with 1^2 and 1^3, then I is not a directed 
set. 

(iii) If {Mi : i el} is some family of modules, and if I is a discrete partially ordered index 
set, then I is not directed. However, if we consider the family T of all finite partial sums 

M h © • • • © Mj n , 

then T is a directed set under inclusion. 

(iv) If A is a module, then the family Fin(A) of all the finitely generated submodules of A 
is partially ordered by inclusion, as in Example 7.93(h), and it is a directed set. 

(v) If R is a domain and Q — Frac( R). then the family of all cyclic //-submodules of Q of 
the form (1/r), where r e R and r f 0, is a partially ordered set, as in Example 7.93(h); 
here, it is a directed set under inclusion, for given (1/r) and (1 js ), then each is contained 
in (1 /rs). 

(vi) Let U be the family of all the open intervals in R containing 0. Partially order U by 
reverse inclusion: 

U < V if VCf/. 

Notice that U is directed: Given U, V e U. then U fl V e U and U < U fi V and 

v <u nv. 

For each U eU, define 

T(U) — {/: U —*■ R : / is continuous), 

and, if U < V, that is, V C U, define Py : T{U) —> T{V) to be the restriction map 
/ i-> f\V. Then [T(U), Py) is a direct system. ◄ 

There are two reasons to consider direct systems with directed index sets. The first is 
that a simpler description of the elements in the direct limit can be given; the second is that 
lim preserves short exact sequences. 

Proposition 7.98. Let {M ,■, (p ‘.} be a direct system of left R-modules over a directed index 
set /, and let A,-: M; -> ^ M\ be the ith injection, so that limM,- = (^ Mj)/S, where 

S — [Xjcplmi — \jtn( : m; e M, and i < j). 

(i) Each element of lim M\ has a representative of the form Xprii + S ( instead of 
Jfi kjirii + S). 
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(ii) 7/777/ + 5 = 0 if mid only if <p\ (m/) — 0 for some t > i. 

Proof (i) As in the proof Proposition 7.94, the existence of direct limits, Jim A// = 
(ff M;)/S, and so a typical element x e lim +/, has the form x = 7,m/ + 5. Since I 

is directed, there is an index j with j > i for all i occurring in the sum for x. For each 
such i, define b 1 — (plnij e Mj, so that the element h, defined by b = b' lies in M/. It 
follows that 


^2 7 / 777 / - Xjb — 7 / 777 / - Xjb ‘) 

= 7 / 777/ - Xjip'jini) e S. 

Therefore, x = ^ A/m, +5 = 7,77 + 5, as desired. 

(ii) If (p' t nii — 0 for some t > i, then 


7/777/ + 5 = A/m, + {X t (p l t mi — Xiirij ) + 5 = 5. 


Conversely, if A,//;, +5 = 0, then 7,m/ e 5, and there is an expression 

a/7W/ = aj(Xk(p } k mj — Xjtiij) e 5, 

j 

where a ,• e R. We are going to normalize this expression; first, we introduce the following 
notation for relators: If j < k, define 

r(j, k , 777 j) = X k (p J k mj - X/mj. 

Since ajr(j, k , mj) = r(j, k , ajmj), we may assume that the notation has been adjusted 
so that 

j 

As I is directed, we may choose an index tel larger than any of the indices i, j,k 
occurring in the last equation. Now 

X,(p' t nij = (X t (p' t mi - Xiiiii) + 7/777/ 

= r(i, f, 777/) + 7/777/ 

= r(i, f, 777/) + r(j, k , mj ). 

j 


r(j , k , mj) = X k (p J k mj - 7 jmj 

= (X t (pj m j - X jin j ) + ^7,(^7^ ( (p'l^tii j) - X k (-^m ; -)] 
= r(j,t,mj) +r(k,t , ~(p ] k mj), 


Next, 
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because (p((pl = (p' t , by definition of direct system. Hence, 

X t (p\nii = ^ r(l, t, xt), 

l 

where xt e Mg. But it is easily checked, for l < t, that 


r{t, t, mi) + r(t , t, m'f) = r(£, t, mi + m f t ). 

Therefore, we may amalgamate all relators with the same smaller index t and write 

X,(p' r mj — ^ r (Z, L xe) 
t 

— A , t <pfxt - Xixi 

t 

= h(^<Pt x i) 

i t 

where xt e Mi and all the indices l are distinct. The unique expression of an element in 
a direct sum allows us to conclude, if l ^ t, that X(Xt = 0; it follows that xt — 0, for Xt 
is an injection. The right side simplifies to X t (p’ t m r — X t m t = 0, because ip\ is the identity. 
Thus, the right side is 0 and X t <p\mi — 0. Since X t is an injection, we have (p\mj = 0, as 
desired. • 

Our original construction of lim M, involved a quotient of ^ M,; that is, lim M, is a 
quotient of a coproduct. In the category Sets, coproduct is disjoint union |_] ( M, . We may 
regard a “quotient” of a set X as the family of equivalence classes of some equivalence 
relation on X. This categorical analogy suggests that we might be able to give a second 
construction of limM, using an equivalence relation on |_| ( M\. When the index set is 
directed, this can actually be done (see Exercise 7.65 on page 517). 

Example 7.99. 

(i) Let X be a simply ordered family of submodules of a module A; that is, if M, M' el, 
then either M C M' or M' C M. Then I is a directed set, and limM; = [J ( - M\. 

(ii) If {Mj : i e 1} is some family of modules, then T. all finite partial sums, is a directed 
set under inclusion, and lim M,- = M,. 

(iii) If A is a module, then the family Fin(A) of all the finitely generated submodules of A 
is a directed set and limM,- = A. 

(iv) If R is a domain and Q — Frac( R ), then the family of all cyclic /^-submodules of 
Q of the form (1 /r), where r e R and r ^ 0, forms a directed set under inclusion, and 
limM, = Q\ that is, Q is a direct limit of cyclic modules. ◄ 
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Definition. Let {A,-, a'-} and {11 j. /S'.} be direct systems over the same index set I. A 
transformation r : {,4,, } -> {B ,. />'.} is an indexed family of homomorphisms 

r = {r t : A; -> B,} 

that makes the following diagram commute for all i < j: 

Ai B, 

a J Pj 

S' 

A j -s- B j 

j r . j 


A transformation r : {A,, a'-} —>■ {B;, /S'} determines a homomorphism 

F: lim A; —>■ lim B, 
by 

r: A/a,- + 5 i-> /X/r/a,- + 7\ 

where S C ^ 4/ and 7’ C ^ B, are the relation submodules in the construction of lim A, 
and lim B ( , respectively, and 7., and //,■ are the injections of A, and B, into the direct sums. 
The reader should check that r being a transformation of direct systems implies that r is 
independent of the choice of coset representative, and hence it is a well-defined function. 


Proposition 7.100. Let I be a directed set, and let {A/,a4}, {B/, /S'.}, and {Cj-, y'j } 
foe direct systems over I. Ifr: {A ( -,al} —> {B;,/3'} and s: {B,■,/!'.} —>■ {C/.yj} are 
transformations, and if 

o-> A/ A B, 4c,^o 

A exact for each i e /, ffoeu ffoere A a« exact sequence 

0 —> lirnA/ \ limB; lirnC/ —>■ 0. 

Remark. The hypothesis that I be directed enters the proof only in showing that r is an 
injection. ■* 

Proof. We prove only that r is an injection, for the proof of exactness of the rest is routine. 
Suppose that r(x) = 0, where x e lim A, . Since I is directed. Proposition 7.98(i) allows 
us to write x = 7.,-a/ + S (where S C ^ A/ is the relation submodule and X, is the 
injection of A; into the direct sum). By definition, r(x + S) = /x/r, a, + T (where T C 
B, is the relation submodule and /x,- is the injection of B, into the direct sum). Now 
Proposition 7.98(h) shows that /u,r,a/ + T — 0 in limB, implies that there is an index 
k > i with (fjiUj = 0. Since r is a transformation of direct systems, we have 

0 = filriUj — r k a' k aj. 

Finally, since r k is an injection, we have a' k Ui — 0 and, hence, that x — X,a / + 5 = 0. 
Therefore, r is an injection. • 
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Example 7.101. 

Let U be the family of all the open intervals in R containing 0, partially ordered by reverse 
inclusion, and let {B(U), /3y} be the direct system of Example 7.97(vi), where 

B(U) = {/:{/-> K :/ is continuous} 


and ft: f \V. 

We now present two more direct systems over U. Define 

A(U) — {constant functions /: U —>• Z} 
and 

C(U) — {continuous /: U -» R — {0}}, 

the abelian group under pointwise multiplication. Then {A(L/), o/y } and {C(t/), Yv 1 are 
direct systems, where the a and y are restriction maps. 

Define transformations s: {B(U), fy} —► {C{U),Yy} by setting s(U): B{JJ ) -» 
C(U ) to be the map / i-> e 2ni f, and define r: {A{U),oty} —>■ {B(U), /3y} by setting 
r(U ): A(f/) B(U) to be the inclusion map. It is easy to see that 

0 -> A(U) ^ B(U ) C(U) -+ 0 
is exact for all U , and so Proposition 7.100 gives exactness of 

0 —> limA(f/) —>■ lwaB(U) —> limC(f/) —> 0. 

It is easy to check that lim A(U) = Z, and so lim B{U) ^0. ◄ 

There is a way to compare two functors. 


Definition. Let F : C -> V and G.C -> V be covariant functors. A natural transfor¬ 
mation is a family of morphisms r = {ty : FC —> GC}, one for each object C in C, so 
that the following diagram commutes for all /: C -> C' in C: 


Ff 

FC — » FC' 


TC 


r c 


Gf 

GC — GC' 


If each xy is an equivalence, then r is called a natural equivalence and F and G are called 
naturally equivalent. 

There is a similar definition of natural transformation between contravariant functors. 

The next proposition shows that the isomorphisms <y>m ■ Horn/A/L M) -> M in Exer¬ 
cise 7.5 on page 440 constitute a natural transformation. 
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Proposition 7.102. If R is a commutative ring, then Horn a ( R. M ) is an R-module, and 
the R-isomorphisms 

(pm '■ Horn/?(/?, M) —> M, 

given by (PM(f) — /(1), comprise a natural equivalence ip: Horn r(R, ) —» 1 r, the 
identity functor on a Mod. 


Remark. Proposition 8.85 generalizes this proposition to modules over noncommutative 
rings. ■* 

Proof It is easy to check that q>M is an additive function. To see that <pm is an R- 
homomorphism, note that 

= (r/)( 1) = f(lr) = f(r) = r[f( 1)] = r<p M {f), 

because / is an //-map. Consider the function M —> Hom/H/C M) defined as follows: 
If m e M , then f m : R —* M is given by f m ir) — rm\ it is easy to see that /,„ is an 
//-homomorphism, and that m f m is the inverse of <pm- 

To see that the isomorphisms <pm constitute a natural equivalence, it suffices to show, 
for any module homomorphism h: M —> N, that the following diagram commutes: 

Hom/((k, M ) —Homfl(/?, N) 

<PN 


where /z*: / hf. Let f:R->-M. Going clockwise, / i-^ hf i->- hf( 1), while going 
counterclockwise, / /(1) h(f( 1)). • 

An analysis of the proof of Proposition 7.92 shows that it can be generalized by re¬ 
placing Hom(A, ) by any (covariant) left exact functor F: a* Mod —> Ab that preserves 
products. However, this added generality is only illusory, for it is a theorem of C. E. 
Watts, given such a functor F, that there exists a module A with F naturally equivalent 
to HoiuaIA. ); that is, these representable functors are characterized. Another theorem 

of Watts characterizes contravariant functors: If G : a Mod —> Ab is a contravariant left 
exact functor that converts sums to products, then there exists a module B with G natu¬ 
rally equivalent to Homg( , 11). Proofs of Watts’s theorems can be found in Rotman, An 
Introduction to Homological Algebra, pages 77-79. 

Example 7.103. 

(i) In Proposition 7.100, we introduced transformations from one direct system over a 
partially ordered index set I to another. If we recall that a direct system of //-modules 
over I can be regarded as a functor PO(/) —> a Mod, then the reader can see that these 
transformations are natural transformations. 
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If we regard inverse systems over a partially ordered index set as contravariant functors, 
then we can also define transformations between them (as natural transformations). 

(ii) Choose a point p once for all, and let P = {/?}; we claim that Hom( P, ): Sets -* Sets 
is naturally equivalent to the identity functor on Sets. If X is a set, define 

r x : Hom(L, X) -> X by / h* f(p). 

Each T X is a bijection, as is easily seen, and we now show that r is a natural transforma¬ 
tion. Let X and Y be sets, and let /;: X —> F; we must show that the following diagram 
commutes: 

Horn (P, X) —^ Hom(P, Y) 



where /z*: / m hf. Going clockwise, / i-> hf i-> hf(p), while going counterclockwise, 
/ i-> f(p) h(f(p)). 

(iii) If A: is a field and V is a vector space over k. then its dual space V* is the vector space 
HomGV, k) of all linear functionals on V. The evaluation map e v : f m /( v ) is a linear 
functional on V*\ that is, e v e (V*)* = V**. Define ty : V —>• V** by 

xy: me,. 

The reader may check that r is a natural transformation from the identity functor on / Mod 
to the double dual functor. The restriction of r to the subcategory of all finite-dimensional 
vector spaces is a natural equivalence. -4 

There is a lovely part of ring theory developing these ideas. The first question is when a 
category C is “isomorphic” to a category r Mod of modules; we have to be a bit fussy about 
what isomorphism means here; it is a bit weaker than having functors F: C —> //Mod and 
G : //Mod —»• C with both composites equal to identity functors. 

Definition. A functor F: C —> V is an equivalence if there is a functor G: V —> C such 
that the composites GF and FG are naturally equivalent to the identity functors 1q and 
lj), respectively. 

Morita theory proves that if R and 5 are commutative rings, then equivalence of their 
module categories implies R = S. We will say a few words about Morita theory in Chap¬ 
ter 9, once we introduce modules over noncommutative rings, but the reader should really 
read accounts of Morita theory in Jacobson, Basic Algebra II or in Lam, Lectures on Mod¬ 
ules and Rings. 

Definition. Given functors F: C -» V and G : V —> C, then the ordered pair (F, G) is 
called an adjoint pair if, for each pair of objects C e C and I) e 'D, there are bijections 

r C.D'- Homx>(LC, D) —> Homc(C, GD) 
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that are natural transformations in C and in D. 

In more detail, the following two diagrams commute: For every /: C' -» C in C and 
g: D -> D' in V, 


Horn d {FC, D) 

r C.D 

Homc(C, GO) - 
Horn v(FC, D) 

*C,D 

\ 

Homc(C, GD) 


( Ff)* 


8 * 


(GgU 


Horn D) 

t c',d 

•Horn c (C',GD); 

-Horn v(FC, O') 

t c,d' 

¥ 

-HomcCC, GD'). 


Here is the etymology of “adjoint.” Let F — ®rB: Mod/; -> Mod/,, and let 
G = Homs(B, ): Mod/ —>■ Mod/;. The isomorphism in Theorem 8.99 is 


r: Hom s (F(A),C) Horn/;(A, G(C)). 


If we pretend that Hom( , ) is an inner product, then this reminds us of the definition of 
adjoint pairs in linear algebra: If T: V —> W is a linear transformation of vector spaces 
equipped with inner products, then its adjoint is the linear transformation T*: W —> V 
such that 

(Tv, w) — ( v, T*w) 

for all v e V and w e W. 


Example 7.104. 

(i) Let U : Groups — > Sets be the underlying functor , which assigns to each group G 
its underlying set and views each homomorphism as a mere function, and let F : Sets -» 
Groups be the free functor , which assigns to each set X the free group FX having basis 
X. That FX is free with basis X says, for every group H. that every function tp: X -> FI 
corresponds to a unique homomorphism {p: FX -* H. It follows that if <p: X —> Y is 
any function, then ip: FX -> FY ; this is how F is defined on morphisms: Fq> — f. The 
reader should realize that the function / f\X is a bijection (whose inverse is <p i-> <p) 

T X .H ■ Hom (; nmps (FX, H ) Hom Se t S (X,UH). 

Indeed, rx,H is a natural bijection, showing that (F, U) is an adjoint pair of functors. 

This example can be generalized by replacing Groups by other categories having free 
objects; for example, /;Mod for any ring R. 

(ii) Adjointness is a property of an ordered pair of functors. In (i), we saw that (F, U) is an 
adjoint pair, where F is a free functor and U is the underlying functor. Were (LI. F) an ad¬ 
joint pair, then there would be a natural bijection Homs e ts(L//, Y) = HomGroups(H, FY), 
where H is a group and Y is a set. This is false in general; if H is a finite group with more 
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than one element and Y is a set with more than one element, then Hom.setslC II, Y ) has 
more than one element, but HomcroupsC#, FY) has only one element. Therefore, (U, F) 
is not an adjoint pair. 

(iii) In the next chapter, we shall see (Theorem 8.99) that for every covariant Horn functor 
G — Horn«(.4, ), there exists a functor F such that (F, G ) is an adjoint pair (F = A®ft 
is called tensor product ). ◄ 

For many more examples of adjoint pairs of functors, see Mac Lane, Categories for the 
Working Mathematician , Chapter 4, especially pages 85-86. 

Let (F . G ) be an adjoint pair of functors, where F: C -* V and G: V —»■ C. If C e 
obj(C), then setting D = FC gives a bijection r: Horn x>(FC, FC ) —> Homc(C, GFC), 
so that rjc, defined by 

ilc — r(l fc), 

is a morphism C —> GFC. Exercise 7.75 on page 518 shows that i]: 1 q -> GF is a 
natural transformation; it is called the unit of the adjoint pair. 

Theorem 7.105. Let (F, G ) be an adjoint pair of functors, where F: C —> T> and 
G: T> —> C. Then F presents all direct limits and G presents all inverse limits. 


Remark. 

(i) There is no restriction on the index sets of the limits; in particular, they need not be 
directed. 

(ii) A more precise statement is that if lim C, exists in C, then lim FC, exists in V, and 
lim FCi = EllimC/). ◄ 


Proof. Let I be a partially ordered set, and let {C;, cpl} be a direct system in C over I. 
It is easy to see that {FCi, Ftp’j] is a direct system in V over I . Consider the following 
diagram in V: 

/• (Inn Cj) y > p 



where a,- : C,- —»■ lim C,- are the maps in the definition of direct limit. We must show that 
there exists a unique morphism y: F(limC,) —> D making the diagram commute. The 
idea is to apply G to this diagram, and to use the unit r]: lg GF to replace G /- (lim C,) 
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and GFCj by limC; and C,, respectively. In more detail, there are morphisms i] and ly, 
by Exercise 7.75 on page 518, making the following diagram commute: 

lim C, _ 'L GFQimCt) 

A 

GFoii 

Ci -— GFCi 

Combining this with G applied to the original diagram gives commutativity of 

lim Ci /f C,D 



By definition of direct limit, there exists a unique /3 : lim C, —> GD making the diagram 
commute; that is, ji e Home( lim C;, GD). Since ( F,G ) is an adjoint pair, there exists a 
natural bijection 


r: Hom©(.F( lim C/), D) -> Home ( lim C,. GD). 

Define 

Y = r _1 (/6) e Homx,(C(lhnC,), D). 

We claim that y: /- (lim C,) —»• D makes the first diagram commute. The first commuta¬ 
tive square in the definition of adjointness gives commutativity of 


Hom c (lim C/, GD j J**—Home(C,, GD) 

r~> r - 1 

V 

(Fry}* 

Horn©( F (hm Cj). D) ' Horn©( FCi.D ). 

Hence, r _1 o:* = (Fo!,-)*r _1 . Evaluating both functions on (i, we have 


(Fch)*t- 1 (P) = (Fctify = yFon. 


On the other hand, since fia, = ( Gfi)rn , we have 

= T-'tfai) = r“ 1 ((G//)i ?I ). 
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Therefore, 

yFo/t = t ~ 1 ((Gfi)rii). 

The second commutative square in the definition of adjointness gives commutativity of 


Ho m v (FCi, FQ ) Homp (FCj , D) 

Z Z 

'' (Gf) • 

Hom c (C ; -, GFCi) Hom c (C/, CD): 


that is. 


= (G/i)* r. 


Evaluating at 1 /tq, the definition of rj, gives r(/})*(l) = (G//)*r(l), and so rf) = 
( Gfi)*rii■ Therefore, 


yLa, = r 1 ((Gfj)rii) = r l rf = /;, 

so that y makes the original diagram commute. 

We leave the proof of the uniqueness of y as an exercise for the reader, with the hint to 
use the uniqueness of fi. 

The dual proof shows that G preserves inverse limits. • 

There is a necessary and sufficient condition, called the adjoint functor theorem , that a 
functor be part of an adjoint pair; see Mac Lane, Categories for the Working Mathemati¬ 
cian, page 117. 

Exercises 

7.65 Let [Mj , (pi ) be a direct system of R -modules with index set 1, and let |_J,- M, be the disjoint 
union. Define m,- ~ mj on M,-, where m / e Mj and mj e M/, if there exists an index k 
with k > i and k > j such that (pi mi = tpl m j. 

(i) Prove that ~ is an equivalence relation on |_| ( - Mj. 

(ii) Denote the equivalence class of nij by [w, ], and let L denote the family of all such 
equivalence classes. Prove that the following definitions give L the structure of an R- 
module: 

r[nij] = [rttij] if r e R\ 

[mj] + [m'j] = [(p^nij + <pj.m'j ], where k > i and k > j. 

(iii) Prove that L = lim Mj. 

Hint. Use Proposition 7.98. 

7.66 Let {Mj, (pi) be a direct system of f?-modules, and let F: ^Mod —> C be a functor to some 
category C. Prove that {FMj, F(pi) is a direct system in C if F is covariant, while it is an 
inverse system if F is contravariant. 
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7.67 Give an example of a direct system of modules, (A/, oA}, over some directed index set I, for 
which A, 7 ^ {0) for all i and lim A,- = {0}. 

7.68 (i) Let K be a cofinal subset of a directed index set I (that is, for each i 6 /, there is k e K 

with i < k ), let (M,\ i pi) be a direct system over I, and let {M;, tp 1 .} be the subdirect 
system whose indices lie in K. Prove that the direct limit over I is isomorphic to the 
direct limit over K. 

(ii) A partially ordered set I has a top element if there exists oo e / with i < oo for all 
i e I. If ( Mj. (pi } is a direct system over /, prove that 

limMj = Moo. 

(iii) Show that part (i) may not be true if the index set is not directed. 

Hint. Pushout. 

7.69 Let C and V be categories, and let J-(C, V) denote the class of all (covariant) functors C —y V. 
Prove that T(C,V) is a category if we define 

Hom(F, G) = {all natural transformations F —> Gj. 

Remark. There is a technical, set-theoretic, problem; why is Hom(F, G) a set (and not a 
proper class)? The answer is that it may not be a set; the easiest (but not the only) way to 
resolve this problem is to assume that the objects in C and T> form a set; that is, C and V are 
small categories. We allow the reader to do this here. 

7.70 A functor T: /jMod -*■ Ab is called representable if there exists an F-module A and a natural 
equivalence r : T —> Hom^(A, ). Prove that if Hom^ (A, ) and Horn r(B, ) are naturally 
equivalent, then A = B. Conclude that if a representable functor T is naturally equivalent to 
Hom^fA, ), then A is determined, up to isomorphism, by T. 

7.71 If pY is the category of all finite-dimensional vector spaces over a field k, prove that the double 
dual, V V**, is naturally equivalent to the identity functor. 

7.72 Let { Ej , <p '.} be a direct system of injective F-modules over a directed index set I. Prove that 
if R is noetherian, then lim E ; is an injective module. 

Hint. Use Proposition 7.69. 

7.73 Consider the ideal I = (x) in k[ jc], where k is a commutative ring. Prove that the completion 
of the polynomial ring k[x} is L[[x]], the ring of formal power series. 

7.74 Let r : {A/, a '.} —>■ {6 ,, /S', j and j : [Bj, fl l .} —> [C ,, yj) be transformations of inverse sys¬ 
tems over an index set I. If 

0 -»• A; X Bi X C, 

is exact for each i G l, prove that there is an exact sequence 

0 -*■ lim Aj hm Bj hm C,. 

7.75 Let (F, G) be an adjoint pair of functors, where F: C -*■ T> and G: V —>■ C, and let 
T C.D '■ HomfFC, D) —> Hom(C, GC) be the natural bijection. 

(i) If D = FC, there is a natural bijection 

r C FC : HornfFC, FC) -+ Hom(C, GFC) 

with t(1 pc) — SC e Hom(C, GFC). Prove that t): lp -*■ GF is a natural transfor¬ 
mation. 
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(ii) If C = GD. there is a natural bijection 

r GD D : Hom (G», GD) -+ Horn {FGD, D) 

with r 1 (1 z?i) — e D e Hom(FGZ), D). Prove that g: FG —*■ lj) is a natural transfor¬ 
mation. (We call s the counit of the adjoint pair.) 

7.76 If I is a partially ordered set and C is a category, then a presheaf over / to C is a contravariant 
functor T : PO(/)-*-C. 

(i) If / is the family of all open intervals U in R containing 0, show that T in Exam¬ 
ple 7.97(vi) is a presheaf of abelian groups. 

(ii) Let X be a topological space, and let I be the partially ordered set whose elements are 
the open sets in X. Define a sequence of presheaves T' -* T -> T" over I to Ab to 
be exact if 

T'(U) -» T(/U) -» T'\U) 

is an exact sequence for every JJ e /. If T is a presheaf on /, define T x , the stalk at 
.v e X , by 

T x = lim T(U). 

U3X 

If T' —> T —> T" is an exact sequence of presheaves, prove, for every x e X, that 
there is an exact sequence of stalks 

K -+ -+ ?"■ 

7.77 (i) Let F : Groups — >■ Ab be the functor with F(G) = G/G where G' is the commutator 

subgroup of a group G, and let U : Ab —>■ Groups be the functor taking every abelian 
group A into itself (that is, U A regards A as a not necessarily abelian group). Prove that 
( F, U) is an adjoint pair of functors. 

(ii) Prove that the unit of the adjoint pair (F, U) is the natural map G —»■ G/G 1 . 

7.78 Prove that if T: ^Mod —»■ Ab is an additive left exact functor preserving products, then T 
preserves inverse limits. 

7.79 Generalize Proposition 5.4 to allow infinitely many summands. Let {S, : i e /} be a family of 
submodules of an F-module M, where R is a commutative ring. If M = (U;'e/ 5'/), then the 
following conditions are equivalent. 

(i) M = Ziel Si. 

(ii) Every a e M has a unique expression of the form a = sq + • • • + Sj n , where Sjj e Sij. 

(iii) For each i e /, 






Algebras 


This chapter introduces noncommutative rings, along with modules over them. We begin 
by showing that modules are just another way of viewing representations of rings; that is, 
ring elements can be viewed as operators on an abelian group. Afterward, we prove the 
Wedderburn-Artin theorems, which classify semisimple rings, and Maschke’s theorem, 
which says that group algebras are usually semisimple. After a formal interlude investi¬ 
gating tensor products, a construction intimately related to Horn functors (thanks to the 
adjoint isomorphism ), we introduce representations and characters of finite groups. This 
discussion is then applied to prove group-theoretic theorems of Burnside and of Frobenius. 


8.1 Noncommutative Rings 

All the rings we have considered so far are commutative, but there are interesting examples 
of noncommutative rings as well. 

Definition. A ring R is an additive abelian group equipped with a multiplication 
R x R —> R, denoted by (a, b) ab, such that, for all a, b, c e R , 

(i) a(bc) — ( ab)c ; 

(ii) a(b + c) — ab + ac and ( b + c)a = ba + ca\ 

(iii) there is 1 e R such that, for all a e R, 


la = a — al. 


Here are some examples of rings that are not commutative. 

Example 8.1. 

(i) If k is any commutative ring, then Mat,, ik), all n x n matrices with entries in k, is a ring 
under matrix multiplication and matrix addition; it is commutative if and only if n — 1. 
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If k is not commutative. Mat,, ( k ) is a ring, for the usual definition of matrix multiplica¬ 
tion still makes sense: If A — [aij] and B — [bjj], then the ij entry of AB is ^ a i p b p j ; 
just make sure that entries in A always appear on the left and that entries of B always 
appear on the right. 

(ii) If k is any commutative ring and G is a group (whose operation is written multiplica- 
tively), then we define the group algebra kG as follows. Its additive abelian group is the 
free /.'-module having a basis labeled by the elements of G; thus, each element has a unique 
expression of the form Yl-.cO a g8' where a g e k for all g e G and almost all a„ — 0; that 
is, only finitely many a g can be nonzero. If g and h are basis elements (i.e., if g. h e G), 
define their product in kG to be their product gh in G, while ag — ga whenever a e k and 
g e G. The product of any two elements of kG is defined by extending by linearity: 

(J2 a s8)(J2 b h h ) = J2{J2 a s bh ) z - 

geG heG zzG gh=z 

A group algebra kG is commutative if and only if the group G is abelian. 

In Exercise 8.17 on page 533, we give another description of kG , when G is a finite 
group, as all functions G -> k under pointwise addition and convolution. 

(iii) An endomorphism of an abelian group A is a homomorphism /: A -* A. The endo¬ 
morphism ring of A, denoted by End(A), is the set of all endomorphisms under pointwise 
addition 

f + g: a i-» f(a)+g(a), 

and composition as multiplication. It is easy to check that End(A) is always a ring, and 
simple examples show that it may not be commutative. For example, if p is a prime, then 
End(I p ® Ip) = Mat 2 (Fp). 

(iv) Let k be a ring, and let a : k -+ k be a ring endomorphism. Define a new multiplication 
on k[x ] = {JT ajX 1 : a,- e k} by 

xa — a{a)x. 

Thus, multiplication of two polynomials is now given by 

( aix ')( b i xJ ) = J2 r c ’ xr - 

where c r — • a,cr' (bj). It is a routine exercise to show that k[x \, equipped with this 

new multiplication, is a not necessarily commutative ring. We denote this ring by k[x\ cr], 
and we call it a ring of skew polynomials. 

(v) If R\,... ,R t are rings, then their direct product, 

R = R l x • • • X R t , 

is the cartesian product with coordinatewise addition and multiplication: 

(n) + (rf) = in + r' t ) and (r,)(r/) = (r.-r/); 
we have abbreviated (r\,... ,r t ) to (r,). 
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It is easy to see that R — R\ x • • • x R t is a ring. Let us identify r, e Rj with the “vector” 
whose ;th coordinate is r,- and whose other coordinates are 0. If i ^ /', then r,r ; = 0. 

(vi) A division ring D (or a skew field) is a “noncommutative field”; that is, D is a ring in 
which 1^0 and every nonzero element a e I) has a multiplicative inverse: there exists 
a' e D with a a' — 1 = a'a. Equivalently, a ring D is a division ring if the set Z) x of its 
nonzero elements forms a group under multiplication. Of course, fields are division rings; 
here is a noncommutative example. 

Let H be a four-dimensional vector space over R, and label a basis 1, i, j, k. Thus, a 
typical element h in H is 

h = a + bi + cj + dk , 

where a, b, c, d el. We define a multiplication of basis elements as follows: 



ij — k — —ji\ jk — i = —kj; ki = j = —ik, 

and we insist that every ael commutes with 1. i, j. k. If we now define a multiplication 
on arbitrary elements by extending by linearity, then H is a ring, called the (real) quater¬ 
nions 1 (associativity of multiplication follows from associativity of multiplication in the 
group Q = {±1, ±L ± j, ±A'| of quaternions). To see that H is a division ring, it suffices 
to find inverses of nonzero elements. Define the conjugate of u — a + bi + cj + dk e H 
by 

u — a — bi — cj — dk\ 


we see easily that 


uu — a~ + b~ T -(- d ~. 


Hence, uu ^ 0 when u 0, and so 

11 — u juu — u/{a~ -t- b~ -f- c“ d~). 


It is not difficult to prove that conjugation is an additive isomorphism satisfying 


uw = w LI. 


Just as the Gaussian integers were used to prove Fermat’s two-squares theorem (Theo¬ 
rem 3.66)—An odd prime p is a sum of two squares if and only if p = 1 mod 4—so, too, 
can the quaternions be used to prove Lagrange’s theorem that every positive integer is the 
sum of four squares (see Samuel, Algebraic Theory of Numbers, pages 82-85). 

The only property of the field R we have used in constructing H is that a sum of nonzero 
squares be nonzero; any subfield of R has this property, but C does not. For example, there 
is a division ring of rational quaternions. 

We shall construct other examples of division rings in Chapter 10 when we discuss 
crossed product algebras. ◄ 


1 The quaternions were discovered in 1843 by W. R. Hamilton when he was seeking a generalization of the 
complex numbers to model some physical phenomena. He had hoped to construct a three-dimensional algebra 
for this purpose, but he succeeded only when he saw that dimension 3 should be replaced by dimension 4. This 
is why Hamilton called H the quaternions , and this division ring is denoted by H to honor Hamilton. 



Sec. 8.1 Noncommutative Rings 


523 


Remark. Some mathematicians do not assume, as part of the definition, that rings must 
contain a unit element 1. They point to natural examples, as the even integers or the 
integrable functions, where a function /: [0, oo) —» R is integrable if 

\f(x)\dx= lim / \f(x)\dx<oo. 
t^ocJ Q 

It is not difficult to see that if / and g are integrable, then so are their pointwise sum f + g 
and pointwise product fg. The only candidate for a unit is the constant function e with 
e(x) = 1 for all x e [0, oo) but, obviously, e is not integrable. 

The absence of a unit, however, makes many constructions more complicated. For 
example, if R is a “ring without unit” and a e R. then defining ( a ), the principal ideal 
generated by a, as (a) = {ra : r e R}. leads to the possibility that a £ (a); thus, we must 
redefine (a) to force a inside. Polynomial rings become strange: If R has no unit, then 
x $. There are other (more important) reasons for wanting a unit, but these examples 
should suffice to show that not assuming a unit can lead to some awkwardness; therefore, 
we have decided to insist that rings do have units. 

Exercise 8.1 on page 531 shows that every “ring without unit” can be imbedded as an 
ideal in a ring (with unit). ■* 

A subring S of a ring R is a ring contained in R so that 1 e S and if ,v, s' e .S’, then 
their sum s + s' and product ss' have the same meaning in S as in R. Here is the formal 
definition. 

Definition. A subring S of a ring R is a subset of R such that 

(i) 1 e 5; 

(ii) if a, b e S , then a — b e S\ 

(iii) if a, b e S , then ab e S. 



Example 8.2. 

(i) The center of a ring R. denoted by Z(R), is the set of all those elements z e R com¬ 
muting with everything: 

Z(R) — {z e R : zr — rz for all r e R}. 

It is easy to see that Z(R) is a subring of R. If k is a commutative ring, then k C Z(kG). 
Exercise 8.10 on page 532 asks you to prove that the center of a matrix ring, Z (Mat,, (/'')), is 
the set of all scalar matrices a I , where a e Z(R ) and I is the identity matrix; Exercise 8.11 
on page 532 says that Z(H) = {al : a e R}. 

(ii) If D is a division ring, then its center, Z(D), is a field. Moreover, if D x is the multi¬ 

plicative group of the nonzero elements of D , then Z(D X ) = Z(D) x \ that is, the center of 
the multiplicative group Z) x consists of the nonzero elements of Z(D). < 

Here are two “nonexamples” of subring. 
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Example 8.3. 

(i) Define S — [a + ib : a, b e Z} C C. Define addition in 5 to coincide with addition in 
C, but define multiplication in S by 

(a + bi)(c + di ) — ac + (ad + bc)i 

(thus, i 2 = 0 in S, whereas i 2 f 0 in C). It is easy to check that S is a ring, but it is not a 
subring of C. 

(ii) If R = Z x Z, then its unit is (1, 1). Let 

S — {(«, 0) e Z x Z : n e Z}. 

It is easily checked that S is closed under addition and multiplication; indeed, S is a ring, 
for (1,0) is the unit in S. However, S is not a subring of R because S does not contain the 
unit of R. a 

An immediate complication arising from noncommutativity is that the notion of ideal 
splinters into three notions. There are now left ideals, right ideals, and two-sided ideals. 

Definition. Let R be a ring, and let I be an additive subgroup of R. Then I is a left ideal 
if a € I and r e R implies ra e /, while I is a right ideal if ar e I. We say that I is a 
two-sided ideal if it is both a left ideal and a right ideal. 

Example 8.4. 

In Mat 2 (R), the equation 


\a bl 

r u 

o' 


r* 

o' 

o 

_1 

i — 

c; 

0 


* 

0 


shows that the “first columns” (that is, the matrices that are 0 off the first column), form a 
left ideal (the “second columns” also form a left ideal.) The equation 


U 

V 

a 

b 


* 

* 

_0 

0 _ 

c 

d _ 


_0 

0 _ 


shows that the “first rows” (that is, the matrices that are 0 off the first row), form a right 
ideal (the “second rows” also form a right ideal). The reader may show that neither of these 
one-sided ideals is two-sided; indeed, the only two-sided ideals are {0} and Mat 2 (R) itself. 
This example generalizes, in the obvious way, to give examples of left ideals and of right 
ideals in Mat,, (k ) for all n > 2 and every ring k. a 

Example 8.5. 

In a direct product of rings, R = R\ x • • • x /(,, each If is identified with 
Rj = {(0. 0, rj, 00) : rj e Rj }, 

where rj occurs in the /'th coordinate. It is easy to see that each such Rj is a two-sided 
ideal in R (for if j f i, then r/r, = 0 and r,r ; = 0). Moreover, any left or right ideal in 
R j is also a left or right ideal in R. 
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Homomorphisms <p : R —> S of rings are defined exactly as in the commutative case; 
we shall see that their kernels are two-sided ideals. Annihilator ideals, defined in the next 
section, are another source of two-sided ideals. 

Definition. If R and S are rings, then a ring homomorphism (or ring map) is a function 
<p\ R -* S such that, for all r. r' e R. 

(i) <p(r + r') = <p(r) + <p(r f ); 

(ii) (p(rr') = <p(r)<p(r')\ 

(iii) <P( 1) = 1. 

If : R —> S is a ring homomorphism, then the kernel is defined as usual: 

ker tp — {r e R : (p(r) = 0}. 

The image is also defined as usual: 

im^={seX:} = (p(r) for some r e R}. 

The kernel is always a two-sided ideal, for if (p(a) — 0 and re R. then 

cp(ra) — (p(r)(p(a) — 0 = (p(a)ip(r) — (p{ar), 

so that a e ker <p implies both ra and ar lie in ker <p. On the other hand, mup is only a 
subring of S. 

We can form the quotient ring R/I when I is a two-sided ideal, because the multiplica¬ 
tion on the quotient abelian group R //, given by (r + I)(s + I) — rs + I , is well-defined: 
If r + I — r' + I and s + I = s' + I, then rs + I — r's' + I. That is, if r — r' e / and 
s — s' € /, then rs — r's' e I. To see this, note that 

rs — r's' — rs — rs' + rs' — r's' — r(s — s') + (r — r')s e /, 

for both s — s' and r — r' lie in I and, since I is a two-sided ideal, each term on the right 
side also lies in I. It is easy to see that the natural map n : R —> R/I , defined (as usual) 
by r r + /, is a ring map. It is routine to check that the isomorphism theorems and the 
correspondence theorem hold for (noncommutative) rings. 

We now define /^-modules when R is any, not necessarily commutative, ring. In contrast 
to the commutative case, there are now two different kinds of /^-modules: left R-modules 
and right R-modules. We have already defined left R-modules (although we have been 
calling them R-modules until now). 

Definition. Let R he a ring. A left R-module is an (additive) abelian group M equipped 
with a scalar multiplication R x M —> M, denoted by 

(r, m) i—>• rm, 

such that the following axioms hold for all m, m' e M and all r. r', 1 e R: 

(i) r(m + in') = rm + rm!\ 



526 


Algebras Ch. 8 


(ii) (r + r')m — rm + r'm; 

(iii) (rr')m = r(r'm)\ 

(iv) 1 m = m. 


Definition. A right R-module is an (additive) abelian group M equipped with a scalar 
multiplication M x R -> M , denoted by 

(m, r) i —> mr, 

such that the following axioms hold for all m, in' e M and all r. r', 1 e R: 

(i) ( m + m')r = mr + m’r\ 

(ii) m(r + r') — mr + mr'; 

(iii) m(rr') — (mr)r’\ 

(iv) ml — m. 

Notation. We denote a left /?-module M by rM. and we denote a right R -module M by 
Mr. 

Of course, there is nothing to prevent us from denoting the scalar multiplication in a 
right /^-module by (in. r) i-^ rm. If we do so, then we see that only axiom (iii) differs 
from the axioms for a left /?-module; the right version now reads 

(rr )m — r (rm). 

That there is an honest difference between these two definitions is apparent from ideals. A 
left ideal in a ring R is a left /^-module, a right ideal is a right /^-module, and we have seen 
in Example 8.4 that these are different things. 

We define submodule in the obvious way; it is a subgroup that is closed under scalar 
multiplication. Note that a ring R can be regarded as a left /^-module (denoted by rR) 
or as a right /^-module (denoted by Rr). The submodules of r R are the left ideals; the 
submodules of Rr are the right ideals. If A is a submodule of a left /^-module M. then the 
quotient module M/N is the quotient group made into a left /^-module by defining scalar 
multiplication to be r(m + N) — rm + N. 

Definition. An additive function /: Mr — > Nr between right R -modules M and N is 
an R-homomorphism (or /Amap) if f (mr) — f (m)r for all in e M and r e R. All the 
right R -modules and /Amaps form a category, denoted by Mod r. The notation r Mod has 
already been introduced to denote the category of all left /^-modules. In either category, 
we denote the set of all /Amaps between /^modules M and N , where both are R -modules 
on the same side, by 


Hom^(M, N). 
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Example 8.6. 

Let G be a group, let k be a commutative ring, and let A be a left AG-module. Define a 
new action of G on A, denoted by g * a, by 


g*a= g l a, 

where a e A and g e G. For an arbitrary element of AG, define 

(E w ^)* a = E m ^“ la - 

geG geG 

It is easy to see that A is a right AG-module under this new action; that is, if u e kG 
and a e A, the function A x kG —> A, given by (a, u) i-> u * a, satisfies the axioms in 
the definition of right module. Of course, we usually write au instead of u * a. Thus, a 
AG-module can be viewed as either a left or a right AG-module. ■* 

Example 8.7. 

We now generalize Example 8.1(iii). If M is a left R-module, then an A 1 -map /; M —> M 
is called an R-endomorphism of M. The endomorphism ring , denoted by End«(M), is the 
set of all R-endomorphisms of M. As a set. End r(M) — Homs(M, M), which we have 
already seen is an additive abelian group. Now define multiplication to be composition: If 
/, g : M AT, then fg : m f(g(m)). 

If M is regarded as an abelian group, then we write End^(M) for the endomorphism 
ring End(M) (with no subscript) defined in Example 8.1(iii), and End^( M) is a subring of 
End^(M). ◄ 

We are now going to show that ring elements can be regarded as operators (that is, as 
endomorphisms) on an abelian group. 

Definition. A representation of a ring R is a ring homomorphism 

(7 : R —> End^(M), 


where M is an abelian group. 

Representations of rings can be translated into the language of modules. 

Proposition 8.8. Every representation a : R —> End z(M), where M is an abelian group, 
equips M with the structure of a left R-module. Conversely, every left R-module M deter¬ 
mines a representation a : R -> End^(M). 

Proof. Given a homomorphism a : R -* End^(M), denote <r(r): M -> M by o r , and 
define scalar multiplication R x M -> M by 


rm = cr r (m), 
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where m e M. A routine calculation shows that M, equipped with this scalar multiplica¬ 
tion, is a left /?-module. 

Conversely, assume that M is a left R -module. If r e R, then m i-> rm defines an 
endomorphism T r : M —»■ M. It is easily checked that the function o : R —> Endz(Af), 
given by a : r i-»- T r , is a representation. • 

Definition. A left /^-module is called faithful if, for all r e R, whenever rm — 0 for all 
m e M, then r = 0. 

Of course, M being faithful merely says that the representation a : R —> End^i/V/ ) 
(given in Proposition 8.8) is an injection. 

An R -module M is finitely generated if there are finitely many elements m i, ..., ni „ e 
M with every x e M an /^-linear combination of m i,..., m n . In particular, an R -module 
is cyclic if it generated by one element. 

Example 8.9. 

Let E/k be a Galois extension with Galois group G = Gal( E/k). Then E is a /cG-module: 
If e e E, then 

a a o)(e) — ^ a c ,a(e). 
creG creG 

We say that E/k has a normal basis if £ is a cyclic kG-module. Every Galois extension 
E/k has a normal basis (see Jacobson, Basic Algebra I, p. 283). ◄ 

We can now augment Proposition 7.24, an earlier result about algebraic integers. 

Proposition 8.10. 

(i) If M is a finitely generated abelian group that is a faithful left R-module for some 
ring R, then the additive group of R is finitely generated. 

(ii) If a is a complex number, let Z[a\ be the subring of C it generates. If there is a 
faithful Z [a]-module M that is finitely generated as an abelian group, then a is an 
algebraic integer. 


Proof, (i) By Proposition 8.8, the ring R is isomorphic to a subring of End^(M). Since 
M is finitely generated. Exercise 8.6 on page 531 shows that End^(M) = Horn z(M, M ) 
is finitely generated. By Proposition 7.24, the additive group of R is finitely generated. 

(ii) By Proposition 7.24, it suffices to prove that the ring Z[or] is finitely generated as an 
abelian group, and this follows from part (i). • 

We could define right-sided versions of all the previous definitions in Chapter 7— 
submodule, quotient module, /\’-homomorphisms, isomorphism theorems, correspondence 
theorem, direct sums, and so on—but there is a more elegant way to do this. 
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Definition. Let R he a ring with multiplication //: R x R —> R. Define the opposite ring 
to be the ring R op whose additive group is the same as the additive group of R, but whose 
multiplication pi op : R x R -> R is defined by pi op (r, s) = pi(s, r) = sr. 

Thus, we have merely reversed the order of multiplication. It is straightforward to check 
that R op is a ring; it is obvious that (7? op ) op = R; moreover, R = R op if and only if R is 
commutative. 


Proposition 8.11. Every right R-module M is a left R op -module, and every left R-module 
is a right R op -module. 

Proof. We will be ultra-fussy in this proof. To say that M is a right /^-module is to say 
that there is a function a : M x R -* M. denoted by a {in, r) — mr. If pt : R x R —> R is 
the given multiplication in R , then axiom (iii) in the definition of right R -module says 

er(m, p.{r, r )) = o(o(m , r), r). 


To obtain a left /^-module, define o' : R x M —> M by o'(r, m ) = aim. r). To see that M 
is a left R op -module, it is only a question of checking axiom (iii), which reads, in the fussy 
notation, 

a , (/r op (r, r'), m) — o'(r, o'(r', m)). 


But 


o' (|U° p (r, r'), m) = o(m, /x op (r, r')) — o(m , /l(r', r)) = m(r'r ), 


while the right side is 


o'(r, o'(r', m)) — o(o'(r ', m), r) — o(o(m, r'), r) = (mr')r. 


Thus, the two sides are equal because M is a right /^-module. 

The second half of the proposition now follows because a right R op -module is a left 
(R°P)°P -module; that is, a left /^-module. • 

It follows from Proposition 8.11 that any theorem about left modules is, in particular, a 
theorem about left /? op -modules, and hence it is also a theorem about right R -modules. 

Let us now see that opposite rings are more than an expository device; they do occur in 
nature. 


Proposition 8.12. If a ring R is regarded as a left module over itself then there is an 
isomorphism of rings 

End*)/?) = R op . 

Proof. Define tp\ End«(/?) —> R op by tp(f) = /(1); it is routine to check that tp is 
an isomorphism of additive abelian groups. Now tp{f)(p{g) = /(l)g(l). On the other 
hand, tp{fg) = (/ o g)(l) = f(g( 1)). But if we write r = g(l), then f(g{ 1 )) = f{r) = 
f(r- 1) = rf{ 1), because / is an /Nmap, and so f{g{ 1)) = r/(l) = g(l)/(l). Therefore, 

<P(fg) = <P(g)<P(f)- 

We have shown that tp: End k ( R ) —> R is an additive bijection that reverses multiplica¬ 
tion. • 
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An anti-isomorphism <p: R —»■ A, where R and A are rings, is an additive bijection 
such that 

<p(rs) - <p(s)<p(r). 

It is easy to see that R and A are anti-isomorphic if and only if R = A op . For example, 
conjugation in H is an anti-isomorphism. If A: is a commutative ring, then transposition, 
A A', is an anti-isomorphism Mat,,(A:) -> Mat,,(A:), because (AB)’ — B' A'\ therefore. 
Mat,, (A:) = [Mat„(A:)] op . However, when A' is not commutative, the formula (AB)’ — B‘ A‘ 
no longer holds. For example, 

( a b P q V ap + br aq + bs ’ 

c d r s ) cp + dr cq + ds ’ 

while 

p q a b _ p r a c 

r sj |_c d\ ~ [q sj d_ 

has pa + rb ^ ap + br as its 1,1 entry. 


Proposition 8.13. If R is any ring, then 

[Mat„(/?)] op = Mat„(7? op ). 

Proof. We claim that transposition A i-> A' is an isomorphism of rings 

[Mat„(/0] op -* Mat n (R op ). 

First, it follows from ( A’)’ = A that A i-> A’ is a bijection. Let us set notation. If M = 
[nijj ] is a matrix, its ij entry nijj may also be denoted by (M) ir Denote the multiplication 
in R op by a*b, where a*b — ba, and denote the multiplication in [Mat,, (A )] op by A* B, 
where (A * B)jj — (BA)jj — Jf^bikOkj € R. We must show that (A * B)’ — A’B‘ in 
Mat„(7? op ). In [Mat„(/?)] op , we have 


In Mat,, ( R op ), we have 


(A * B)\j = (BA)\j 

= (BA)ji 
— ^ b jkOki ■ 


(A’B’fj = Y,( A, )ik* (B’)kj 

k 

— ^2(A)ki * (B)jk 

k 

— '22 aki * b J k 

k 

= ' bjkUki • 

k 


Therefore, (A * B)‘ = A 1 B 1 in Mat„(f? op ), as desired. • 
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Direct sums and direct products of //-modules, where R is any (not necessarily com¬ 
mutative) ring, exist. An R -module is, after all, an additive abelian group equipped with 
a scalar multiplication. If {M, : i el} is a family of left R -modules, construct the di¬ 
rect product n ie/ Mj as the direct product of the underlying abelian groups, and then 
define scalar multiplication by r(m ,) = (rm,) if all the M, are left //-modules, or by 
(m,)r = (nijr) if all the M,- are right //-modules. As with modules over commutative 
rings, define the direct sum M, as the submodule of P| , M, consisting of all /-tuples 
almost all of whose coordinates are 0. There is no difficulty in adapting the definition and 
first properties of external and internal direct sums, such as Proposition 7.15 and Corol¬ 
lary 7.16. 

Since direct sums exist, we can also construct free left //-modules (as direct sums of 
copies of rR) and free right //-modules (as direct sums of Rr). 

Exact sequences of left or of right modules also make sense (again, because modules 
are additive abelian groups with extra structure), and the reader should have no difficulty 
using them. 


Exercises 


8.1 Let R be an additive abelian group equipped with an associative multiplication that satisfies 
both distributive laws. Define a multiplication on the abelian group R* = Z © R by 

(m, r)(n, s) = (mn, ms + nr + rs ), 


where ms is the sum of s with itself m times if m > 0, and ms is the sum of —s with itself \m\ 
times if m < 0. 

Prove that R* is a ring with unit (1, 0), and that R is a two-sided ideal in R*. (We say that 
R* is obtained from R by adjoining a unit.) 


8.2 Let R be the set of all matrices of the form 


, where a and b are complex numbers 


a b\ 

|_—b aj 

and a denotes the complex conjugate of a. Prove that R is a subring of Mat 2 (C) and that 
R = H, where H is the division ring of quaternions. 


8.3 Prove that the following conditions on a ring R are equivalent: 


(i) For every sequence of left ideals L| 2 ^2 3 /3 2 there exists N so that L; = 
L;_|_i for all i > N\ 

(ii) Every nonempty family T of left ideals has a minimal element in T. 


8.4 (Change of Rings) Let (p: R —> S be a ring homomorphism, and let M be a left S-module. 
Show that the function SxM -> M , given by (r, m) i-x ip(r)m, defines a scalar multiplication 
that makes M a left (?-module. 

8.5 Let I be a two-sided ideal in a ring R. Prove that an abelian group M is a left (_K//)-module 
if and only if it is a left /^-module that is annihilated by I. 

8.6 If M is a finitely generated abelian group, prove that the additive group of the ring End(M) is 
a finitely generated abelian group. 
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Hint. There is a finitely generated free abelian group F mapping onto M; apply Hom( , M) 
to F —>■ M —y 0 to obtain an injection 0 —>■ Hom(M, M) —>■ Hom(F, M). But Hom(F, M) 
is a finite direct sum of copies of M. 

8.7 (i) If k is a commutative ring and G is a cyclic group of finite order n, prove that kG = 

k[x]/(x n - 1). 

(ii) If k is a domain, define the ring of Laurent polynomials as the subring of k(x) consisting 
of all rational functions of the form f(x)/x n for n e Z. If G is infinite cyclic, prove 
that kG is isomorphic to Laurent polynomials. 

8.8 Let R be a four-dimensional vector space over C with basis 1, j, k. Define a multiplication 
on R so that these basis elements satisfy the same identities satisfied in the quaternions H [see 
Example 8.1(vi)]. Prove that R is not a division ring. 

8.9 If k is a ring, possibly noncommutative, prove that Mat„ (k) is a ring. 

8.10 Prove that the center of a matrix ring Mat„(F) is the set of all scalar matrices al, where 
a e Z(R) and 1 is the identity matrix. 

8.11 Prove that Z(E I) = [al : a e Rj. 

8.12 Let R = Fj x • • • x R m be a direct product of rings. 

(i) Prove that R op — F° p x ■ ■ • x R°f. 

(ii) Prove that Z(R) = Z(R\) x ■ • ■ x Z(R m ). 

(iii) If k is a field and 

R = Mat„j (k) x • • • x Mat„ m (k), 
prove that dim£(Z(F)) = m. 

8.13 If A is a division ring, prove that A°P is also a division ring. 

8.14 An idempotent in a ring A is an element e e A with e~ = e. If R is a ring and M is a left 
F-module, prove that every direct summand SQM determines an idempotent in End^(M). 
Hint. See Corollary 7.17. 

8.15 Let R be a ring. 

(i) (Peirce Decomposition). Prove that if e is an idempotent in a ring R. then 

R = Re © R{1 — e). 

(ii) Let R be a ring having left ideals / and J such that R — I © J. Prove that there are 
idempotents eel and / e J with 1 = e + /; moreover, / = I e and J = J f . 

Hint. Decompose 1 = e + f, and show that ef — 0 = fe. 

8.16 An element a in a ring R has a left inverse if there is u e R with ua — 1, and it has a right 
inverse if there is t v e R with aw = 1. 

(i) Prove that if a e R has both a left inverse u and a right inverse w , then u — w. 

(ii) Give an example of a ring R in which an element a has two distinct left inverses. 

Hint. Define R = End^(V), where V is a vector space over a field k with basis 
[bn : n > lj, and define a e R by a{b n ) = b n+ \ for all n > 1. 

(iii) (Kaplansky) Let R be a ring, and let a, u,veR satisfy ua = 1 = va. If u v. prove 
that a has infinitely many left inverses. 

Hint. Are the elements u + a"( 1 — au) distinct? 
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8.17 Let k be a field, let G be a finite group, and let J-(G, k) denote the vector space of all functions 
G k. 

(i) Define <p : kG —> T{G, k) as follows: If u = ci x x G kG. then <p u : x a x . Prove 
that 

<Pu+v — Vu + <Pv 

and 

<Puv(.y)= ^2v>uM(p v (x~ l y). 

xe G 

(This last operation is called the convolution of <p u and i p v .) 

(ii) Prove that T{G, k) is a ring and that <I>: kG —>■ T(G, k), given by ip M , is a ring 
isomorphism. 

8.18 (i) For k a field and G a finite group, prove that (kG )°P = kG. 

(ii) Prove that H°P = H, where H is the division ring of real quaternions. 

Exercise 8.30 on page 549 asks for a ring R that is not isomorphic to f?°P. 

8.19 (i) If R is a ring, if r G R, and if k C Z(R) is a subring, prove that the subring generated 

by r and k is commutative. 

(ii) If A is a division ring, if r G R, and if k C Z(A) is a subring, prove that the subdivision 
ring generated by r and k is a (commutative) field. 

8.20 Write the elements of the group Q of quaternions as 

1, 1, i, i, j, j, k, k, 

and define a linear transformation (p : RQ —>■ II by removing the bars: 

< p(x ) = (p{x ) = x for x = 1, i, j , k. 


Prove that <p is a surjective ring map. and conclude that there is an isomorphism of rings 
RQ/ ker (p = EL (See Example 9.113 for a less computational proof.) 

8.21 If R is a ring in which .v 2 = x for every x G R. prove that R is commutative. (A Boolean ring 
is an example of such a ring.) 

8.22 Prove that there is an equivalence of categories ^Mod —»■ Mod^op. 

Hint. Given a left ^-module (M, a), where M is an additive abelian group and a : 8xM-> 
M is its scalar multiplication, consider the right /?°P-module (M, a'), where o' : M x R° P — > 
M is defined in Proposition 8.11. Define F : ^Mod Mod^op on objects by (M.o) 

(M, cr'). 


8.2 Chain Conditions 


This section introduces chain conditions for modules over an arbitrary ring, as well as the 
Jacobson radical, J(R), a two-sided ideal whose behavior has an impact on a ring R. For 
example, semisimple rings R are rings that generalize the group ring CG of a finite group 
G, and we will characterize them in the next section in terms of J (R ) and chain conditions. 
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We will also prove a theorem of Wedderburn that says that every finite division ring is a 
field; that is, it is commutative. 

We have already proved the Jordan-Holder theorem for groups (see Theorem 5.52). 
Here is the version of this theorem for modules. We can prove both of these versions 
simultaneously if we introduce the notion of operator groups (see Robinson, A Course in 
the Theory of Groups, page 65). 

Theorem 8.14 (Zassenhaus Lemma). Given submodules A C A* and B C B* of a 
module M (over any ring), there is an isomorphism 

A + (A* n B*) _ B + (B* n A*) 

A + (A* n B) ~ B + [B* n A) 

Proof A straightforward adaptation of the proof of Lemma 5.49. • 


Definition. A series (or a filtration) of a module M (over any ring) is a finite sequence 
of submodules M = Mq, M\, Mi, .... M n = {0} for which 


M = M 0 3Mi3M 2 3"OM„ = {0}. 


The factor modules of this series are the modules Mo/ M\, Mi/M 2 , ..., M n -\/M n — 
M n - 1 , and the length is the number of strict inclusions; equivalently, the length is the 
number of nonzero factor modules. 

A refinement of a series is a series M = Mq, Mj,..., M' k — {0} having the original 
series as a subsequence. Two series of a module M are equivalent if there is a bijection 
between the sets of nonzero factor modules of each so that corresponding factor modules 
are isomorphic. 


Theorem 8.15 (Schreier Refinement Theorem). Any two series 

M = M 0 2 M { 2 • • • D M„ = {0} and M = N 0 2 Ni D • • • D N k = {0} 
of a module M have equivalent refinements. 

Proof A straightforward adaptation of the proof of Theorem 5.51. • 


Definition. A left W-module is simple (or irreducible) if M f {0} and M has no proper 
submodules. 

As with modules over a commutative ring, the correspondence theorem shows that an 
R-submodule A of a module M is a maximal submodule if and only if M/N is a simple 
module. The proof of Corollary 7.14 can be adapted to show that a left R-module S is 
simple if and only if S = R/I, where / is a maximal left ideal. 
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Definition. A composition series is a series all of whose nonzero factor modules are 
simple. 

Notice that a composition series admits only insignificant refinements; we can merely 
repeat terms (if M, /Af,-+1 is simple, then it has no proper nonzero submodules and, hence, 
there is no intermediate submodule L with M, D L D ;V/, + i). More precisely, any refine¬ 
ment of a composition series is equivalent to the original composition series. 

A module need not have a composition series; for example, the abelian group Z, con¬ 
sidered as a Z-module, has no composition series. 

Definition. A left /(-module M, over any ring R, has the ascending chain condition , 
abbreviated ACC, if every ascending chain of left submodules stops: If 

Si c S 2 c S 3 c • • • 

is a chain of submodules, then there is some t > 1 with 

St — S t +1 = S t + 2 — ■ ■ ■ ■ 

A left /(-module M, over any ring R. has the descending chain condition , abbreviated 
DCC, if every descending chain of left submodules stops: If 

Si 2 s 2 2 s 3 2 • • • 

is a chain of submodules, then there is some t > 1 with 

St = S t +1 = S t + 2 = ■ ■ ■ . 

Most of the theorems proved in Chapter 6 for commutative noetherian rings (for ex¬ 
ample, Proposition 6.38: The equivalence of the ACC, the maximum condition, and finite 
generation of ideals) can be generalized, and with the same proofs, to left modules having 
the ACC. 

Proposition 8.16. 

(i) If a left module M has DCC, then every nonempty family T of submodules contains 
a minimal element ; that is, there is a submodule So € IF for which there is no S e IF 
with S C Sq. 

(ii) If a left module M has ACC, then every nonempty family T of submodules contains 
a maximal element ; that is, there is a submodule Sq e IF for which there is no S e IF 
with S D Sq. 

Proof. Choose S e IF. If S is a minimal element of IF, we are done. Otherwise, there is a 
submodule Si e IF with S f S\. If .S'i is a minimal element, we are done; otherwise, there 
is a submodule S 2 e T with S f Si f Sj. The DCC says that this sequence must stop; 
that is, there is S t e T that is a minimal element of T (for the only obstruction to finding a 
smaller submodule is that S t is minimal). The proof of the second statement is similar. • 
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Proposition 8.17. A module M over any ring R has a composition series if and only if it 
has both chain conditions on submodules. 

Proof If M has a composition series of length n, then no sequence of submodules can 
have length > n, or we would violate Schreier’s theorem (refining a series cannot shorten 
it). Therefore, M has both chain conditions. 

Let T\ be the family of all the proper submodules of M. By Proposition 8.16, the 
maximum condition gives a maximal submodule M\ e T\. Let Ti be the family of all 
proper submodules of M\, and let Mi be maximal such. Iterating, we have a descending 
sequence 

M 2 M\ D M 2 2 ''' • 

If M„ occurs in this sequence, the only obstruction to constructing M n+ \ is if M n — 0. 
Since M has both chain conditions, this chain must stop, and so M, — 0 for some t. This 
chain is a composition series of M, for each M, is a maximal submodule of its predeces¬ 
sor. • 

Theorem 8.18 (Jordan-Holder Theorem). Any two composition series of a module 
M are equivalent. In particular, the length of a composition series, if one exists, is an 
invariant of M, called the length of M. 

Proof. As we remarked earlier, any refinement of a composition series is equivalent to 
the original composition series. It now follows from Schreier’s theorem that any two com¬ 
position series are equivalent; in particular, they have the same length. • 

Let V be a vector space over a field k; if V has dimension n, then V has length n, for if 
vi,..., v„ is a basis of V, then a composition series is 

V = <U1, . . . , V lt ) D (V2, .... V n ) 2 • • • 2 (Vn) 2 {0} 

(the factor modules are one-dimensional, and hence are simple A'-modules). 

Corollary 8.19. If a module M has length n, then every chain of submodules of M has 
length < n. 

Proof. By Schreier’s theorem, there is a refinement of the given chain that is a composi¬ 
tion series, and so the length of the given chain is at most n. • 

The Jordan-Holder theorem can be regarded as a kind of unique factorization theorem; 
for example, we saw in Corollary 5.53 that it gives a new proof of the fundamental theorem 
of arithmetic. 

If A is a division ring, then a left A-module V is called a left vector space over A. The 
following definition from linear algebra still makes sense here. 

Definition. If V is a left vector space over a division ring A, then a list X — x\,..., x m 
in V is linearly dependent if 

Xj e <xi,... Si, • ■ -,x m ) 

for some i ; otherwise, X is called linearly independent. 
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The reader should check that if x\, ..., x m is linearly independent, then 


{x\, ..., x m ) = <xi) © • • • ® ( x m ). 


Proposition 8.20. Every finitely generated left vector space V — {v\,... ,v„) over a 
division ring A is a direct sum of copies of A', that is, every finitely generated left vector 
space over a division ring has a basis. 

Proof. Consider the series 

V = (Ul, 2 ("2, • ■ ■ , Vn) 2 (V3, • • ■ , V n ) 2 ■ ■ ■ 2 (v n ) 2 {0}. 

Denote (i>,+i v n ) by Ui, so that (Vj,... , v n ) — (v,) + U,. By the second isomorphism 
theorem. 


{Vi .... v„)/(vj+l .... Vn) = (( Vi ) + Uj)/Uj = {Vi)/({Vi) n Ui). 

Therefore, the /th factor module is isomorphic to a quotient of (vf = A if i f 0. Since 
A is a division ring, its only quotients are A and {0}. After throwing away those v, cor¬ 
responding to trivial factor modules {0}, we claim that the remaining v’s, denote them by 
v\,..., v m , form a basis. For all j, we have Vj f ( Vj \ i,..., v„). The reader may now 
show, by induction on m, that ..., (v m ) generate a direct sum. • 

Another proof of this proposition, using dependency relations, is sketched in Exer¬ 
cise 8.23(ii) on page 548. 

The next question is whether any two bases of V have the same number of elements. 
The proper attitude is that theorems about vector spaces over fields have true analogs for 
left vector spaces over division rings, but the reader should not merely accept the word of 
a gentleman and a scholar that this is so. 

Corollary 8.21. If V is a finitely generated left vector space over a division ring A, then 
any two bases ofV have the same number of elements. 

Proof. As in the proof of Proposition 8.20, a basis of V gives a series 

V = {v U V 2 ,..., v n ) 2 (v 2 , v n ) 2 (v 3 , ..., Vn) 2 • • • 2 ( Vn) 2 { 0 }- 

This is a composition series, for every factor module is isomorphic to A, which is simple 
because A is a division ring. By the Jordan-Holder theorem, the composition series arising 
from any other basis of V must have the same length. • 

Another proof of this corollary is sketched in Exercise 8.23(iii) on page 548. 

It now follows that every finitely generated left vector space V over a division ring A 
has a left dimension, which will be denoted by dim(V). 

If an abelian group V is a left vector space and a right vector space over a division ring 
A, must its left dimension equal its right dimension? There is an example (see Jacobson, 
Structure of Rings, page 158) of a division ring A and an abelian group V, which is a 
vector space over A on both sides, with left dimension 2 and right dimension 3. 
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We have just seen that dimension is well-defined for left vector spaces over division 
rings. Is the rank of a free left R -module F well-defined for every ring R\ that is, do any 
two bases of F have the same number of elements? In Proposition 7.50, we saw that rank 
is well-defined when R is commutative, and it can be shown that rank is well-defined when 
R is left noetherian; that is, if every left ideal in R is finitely generated (see Rotman, An 
Introduction to Homological Algebra , page 111). However, the next example shows that 
rank is not always well-defined. 

Example 8.22. 

Let k be a field, let V be a vector space over k having an infinite basis {v n : n € N}, 
and let R = Endfc(V). Let A be the left ideal consisting of all the linear transformations 
<p: V -> V for which <p{v2 n ) = 0 for all n, and let B be the left ideal consisting of all 
those linear transformations \[r: V —> V for which f(v 2 n +\) — 0 for all n. We let the 
reader check that A D B — {0} and A + B — R, so that R — A © B. 

Let W be the subspace of V with basis the odd V 2 n +\■ If /: V —> W is a k-isomorphism, 
then the map x/r i-> ffif~ X is an /(-isomorphism 

R — EiufifL) = LiKfi (VP) = A. 

Similarly, if Y is the subspace of V spanned by the even i> 2 „, then R = End*(F) = B. It 
follows that the free left /(-modules R and R © R are isomorphic. ◄ 

There is another useful unique factorization theorem. Call a left /(-module M, over any 
ring R , an indecomposable module if there do not exist nonzero submodules A and B with 
M = A © B. The Krull-Schmidt theorem says that if M has both chain conditions on 
submodules, then M is a direct sum of indecomposable modules: M = A\ © • • • © A„. 
Moreover, if M — B\ © • • • © B m is another decomposition into indecomposables, then 
m — n and there is a permution a e S n with A, = B r , { i ) for all i. A proof can be found in 
Rotman, An Introduction to the Theory of Groups, pages 144-150. 

Here is a surprising result of J. M. Wedderburn. 

Theorem 8.23 (Wedderburn). Every finite division ring D is a field', that is, multipli¬ 
cation in D is commutative. 

Proof. (E. Witt 2 ). If Z denotes the center of D, then Z is a finite field, and so it has q 
elements (where q is a power of some prime). It follows that D is a vector space over Z, 
and so \ D\ — q" for some n > 1; that is, if we define 

[D : Z] =dim z (Z)), 

then [D : Z] — n. The proof will be complete if we can show that n > 1 leads to a 
contradiction. 

If a e D, define C(a ) — {u e D : ua — au}. It is routine to check that C(a ) is a 
subdivision ring of D that contains Z: If u, v e D commute with a, then so do u + v, uv, 

- Wc shall give another proof of this in Theorem 9.123. 
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and u~ l (when u ^ 0). Consequently, |C(a)| = q d ^ for some integer d(a)\ that is, 
[ C(a ) : Z] = cl (a). We do not know whether C(a) is commutative, but Exercise 8.25 on 
page 548 gives 

[D:Z] = [D: C(a)][C(a) : Z], 

where [D : C(a)] denotes the dimension of D as a left vector space over C(a). That is, 
n = [D : C(a)\d(a), and so d(a) is a divisor of n. 

Since D is a division ring, its nonzero elements D x form a multiplicative group of order 
q" — 1. By Example 8.2(ii), the center of the group D x is Z x and, if a e D x , then its 
centralizer Cjjx (a) — C(a) x . Hence, |Z(Z) X )| — q — 1 and |Cqx (a)| = q d ( a ) — ] 5 where 
d(a ) | n. 

The class equation for D x is 

|T> X | = \Z x \ + J2lD x : C D x(a/)], 
i 

where one a, is chosen from each noncentral conjugacy class. But 

[D x : C D x(a,-)] = \D x \/\C D x(cii)\ = (q n - \)/{q d ^ - 1), 


so that the class equation becomes 


q n — i — q — i + ^2/ 


qdtcij) _ 2 


( 1 ) 


We have already noted that each d(ai) is a divisor of n. while the condition that a, is not 
central says that d {at ) < n. 

Recall that the nth cyclotomic polynomial is O,, (x) = nu- — f), where £ ranges over 
all the primitive nth roots of unity. In Corollary 1.41, we proved that <h n (q) is a common 
divisor of q n — 1 and (q n — 1 )/(q d(a '^ — 1 ) for all;, and so Eq. ( 1 ) gives 


'hn(q) I (q ~ 1). 


Ifn > 1 and f is a primitive nth root of unity, then £ ^ 1, and hence £ is some other point 
on the unit circle. Since q is a prime power, it is a point on the v-axis with q > 2, and so 
the distance \q — £| > q — 1. Therefore, 

\®n(q)\ = n Itf _ £1 > q ~ 1’ 

and this contradicts <f>„ (q) \ (q — 1). We conclude that n = 1; that is, I) — Z, and so D is 
commutative. • 

The next discussion will be used in the next section to prove the Wedderburn-Artin 
theorems classifying semisimple rings. Let us consider Horn/A 4, B), where both A and B 
are left R-modules that are finite direct sums: say, A — ^" =1 A, and B — Bj- 

Theorems 7.32 and 7.33, we have 

Hom^(A, B) = Horn/A A,. Bj). 
ij 
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More precisely, if a,: A, -> A is the ;th injection and pj : B —»■ Bj is the /'th projection, 
then each / e Homs (A, B) gives maps f)j — Pj fop e Homs(A ( , Bj). Thus, / defines 
a generalized n x m matrix [fij] (we call [/' f/ ] a generalized matrix because entries in 
different positions need not lie in the same algebraic system). The map / [ fij] is an 

isomorphism Homs(A, B) —»• Homs(A;, Bj). Similarly, if g: B —>■ C, where C = 

Cyfe, then g defines a generalized m x f matrix [gy*], where gjk = qkgPj '■ B j -> Ck, 
Pj : Bj -> B are the injections, and q^: C -> Q- are the projections. 

The composite gf : A -> C defines a generalized n x £ matrix, and we claim that it is 
given by matrix multiplication (g/),-* = Jfj gkjfji- 

Y^Skjfji = ^qkgPjPjfa-i 

j j 

= VkgC^PjPjlfUi 
j 

= qkgfui 
= (gf)ik, 

because f fjpj = 1 B - 

By adding some hypotheses, we can pass from generalized matrices to honest matrices. 

Proposition 8.24. Let V — Y^1=i Vi a ^ e ft B-module. If there is a left R-module L 
and, for each i, an isomorphism (pj : V; —> L, then there is a ring isomorphism 

Endfl(V) = Mat,, (End# (T)). 


Proof. Define 


9: Ends(V) —>■ Mat,, (Ends (L)) 


by 


0: f h* [(p j pjfa i (p i 1 ], 


where a,-: V,- -> V and p ,•: V —» V / / are injections and projections, respectively. That 9 
is an additive isomorphism is just the identity 


Hom(j2 Vi, J2 V ') = E Hom(l/ '’ v 0’ 

i > ij 


which holds when the index sets are finite. In the paragraph discussing generalized ma¬ 
trices, the home of the ij entries was Homs(V;, Vj), whereas the present home of these 
entries is the isomorphic replica Horn/A L, L) = End r{L). 

We now show that 9 preserves multiplication. If g, f e Ends(V), then 9(gf) = 
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[(pjPjgfaiVj *], while the matrix product is 

o(gMf) = [J2&jPj8<*kn l K<PkPkfcti<pr 1 )] 

L k 

= [EvjPjgakPkfaiVr 1 ] 

L k 

k 

- [pjPjgfwr 1 ]- • 

Corollary 8.25. If V is an n-dimensional left vector space over a division ring A, then 
there is an isomorphism of rings 

End A (V) = Mat„(A) op . 

Proof. The isomorphism Endr-(V) = Mat„(A op ) is the special case of Proposition 8.24 
for V = Vi © • • • © V„, where each V,- is one-dimensional, and hence is isomorphic to A. 
Note that End A (A) = A op , by Proposition 8.12. Now apply Proposition 8.13, which says 
that Mat„(A op ) = Mat,, (A) op . • 

The next result involves a direct sum decomposition at the opposite extreme of that in 
Proposition 8.24. 

Corollary 8.26. Let an R-module M be a direct sum M — B\ ® ■ • • © B m in which 
Hom^(Z?,-, Bj) — {0 } for all i j. Then there is a ring isomorphism 

End/;(M) = End«(Z?i) x • • • x End^lB,,,). 

Proof. If /, g e End/}(M), let [fj] and [^,-y] be their generalized matrices. It suffices to 
show that [gij][fij] is the diagonal matrix 

di*kg(.gllflti • • - i gmmfmm )• 

But if i j , then g ik f kj e Horn R (Bt, Bj) = 0; hence, (gf)ij = J2k Sikfkj = 0. • 

Definition. If k is a commutative ring, then a ring R is a k-algebra if R is a 
A:-module and scalars in k commute with everything: 

a(rs) — ( ar)s = r(as) 


for all a e k and r, s e R. 

If R and 5 are ^-algebras, then a ring homomorphism / : R —> S is called a k-algebra 
map if 

f{ar) = af(r) 

for all a e k and r e R\ that is, / is also a map of A-modules. 
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The reason that k is assumed to be commutative (in the definition of A-algebra) can be 
seen in the important special case when A: is a subring of R\ setting ,v = 1 and taking r e k 
gives ar = ra. 

Example 8.27. 

(i) If A = C[x], then A is a C-algebra, and <p: A —> A , defined by q>\ CjX J I-+- 

XC- cj(x — 1 ) J is a C-algebra map. On the other hand, the function 0: A -> A, defined by 
0 : j c j xJ XI / Cj ( x — l)- 7 (where c is the complex conjugate of c), is a ring map but 

it is not a C-algebra map. For example, 9(ix) = —i (x — 1) while i6(x) = i(x — 1). Now 
C[jc] is also an R-algebra, and 6 is an R-algebra map. 

(ii) Every ring R is a Z-algebra, and every ring homomorphism is a Z-algebra map. This 
example shows why, in the definition of /^-algebra, we do not demand that k be isomorphic 
to a subring of R. 

(iii) If A; is a subring contained in the center of R, then R is a A-algebra. 

(iv) If A: is a commutative ring, then Mat,, ( k ) is a A-algebra. 

(v) If A: is a commutative ring and G is a group, then the group algebra kG is a 
A-algehra. -4 

We have already defined the ACC for left modules over any ring. The next definition 
says that a ring R is left noetherian if it has the ACC when viewed as a left module over 
itself (recall that its submodules are the left ideals). When R is commutative, this definition 
specializes to our earlier definition of noetherian ring. 

Definition. A ring R is left noetherian if it has the ACC (ascending chain condition) 
on left ideals: every ascending chain of left ideals 

h c h c h c • • • 

stops: that is, there is some t > 1 with 

It = 4+1 = 4+2 = • • • . 

We define right noetherian rings similarly as those rings having the ACC on right ide¬ 
als. If A is a field, then every finite-dimensional A-algehra A is both left and right noethe¬ 
rian, for if dim(A) = n, then there are at most n strict inclusions in any ascending chain 
of left ideals or of right ideals. In particular, if G is a finite group, then kG is finite¬ 
dimensional, and so it is left and right noetherian. Exercise 8.28 on page 549 gives an 
example of a left noetherian ring that is not right noetherian. 

Proposition 8.28. The following conditions on a ring R are equivalent. 

(i) R is left noetherian. 

(ii) Every nonempty family of left ideals of R contains a maximal element. 

(iii) Every left ideal is finitely generated. 
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Proof. Adapt the proof of Proposition 6.38. • 


Definition. A ring R is left artinian if it has the DCC (descending chain condition ): 
Every descending chain of left ideals 


h 2 h 2 h 2 • • • 

stops: that is, there is some t > 1 with 

It — 4+1 — f+2 = '' ' ■ 

We define right artinian rings similarly, and there are examples of left artinian rings that 
are not right artinian (see Exercise 8.29 on page 549). If k is a field, then every finite¬ 
dimensional ^-algebra A is both left and right artinian, for if dim(A) = n, then there are 
at most n strict inclusions in any descending chain of left ideals or of right ideals. In 
particular, if G is a finite group, then kG is finite-dimensional, and so it is left and right 
artinian. We conclude that kG has both chain conditions (on both sides) when k is a field 
and G is a finite group. 

The ring Z is (left) noetherian, but it is not (left) artinian, because the chain 

Z 3 (2) D (2 2 ) D (2 3 ) 3 • • • 

does not stop. In the next section, we will prove that left artinian implies left noetherian. 

Definition. A left ideal L in a ring R is a minimal left ideal if L f {0} and there is no 
left ideal J with {0} C / C L. 

A ring need not contain a minimal left ideal. For example, Z has no minimal ideals: 
every nonzero ideal I in Z has the form I — (n ) for some nonzero integer n, and I — 
In) D (2n). 

Proposition 8.29. 

(i) Every minimal left ideal L in a ring R is a simple left R-module. 

(ii) If R is left artinian, then every nonzero left ideal I contains a minimal left ideal. 

Proof, (i) If L contained a submodule S with {()} C S C L. then S would be a left ideal 
of R, contradicting the minimality of L. 

(ii) If T is the family of all nonzero left ideals contained in I, then f / 0 because I is 
nonzero. By Proposition 8.16, T has a minimal element, and any such is a minimal left 
ideal. • 

We now define a special ideal, introduced by N. Jacobson, that is the analog of the 
Frattini subgroup in group theory. 
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Definition. If R is a ring, then its Jacobson radical J (R) is defined to be the intersection 
of all the maximal left ideals in R. A ring R is called Jacobson semisimple if J (R) = {0}. 

Clearly, we can define another Jacobson radical: the intersection of all the maximal 
right ideals. It turns out, however, that both of these coincide (see Proposition 8.36). 

The ring Z is Jacobson semisimple. The maximal ideals in Z are the nonzero prime 
ideals ( p ), and so J( Z) = Rp primed) = {0}- If R is a local ring (a commutative ring 
having a unique maximal ideal P), then .1 (R) = P. An example of a local ring is R — 
{a/b e Q : b is odd}; its unique maximal ideal is 

(2) = {2 a/b : bis odd}. 


Example 8.30. 

Let k be a field and let R — Mat,, (k). For any i between 1 and n, let COL(f) denote the fth 
columns; that is. 


col(£) = {A = [aij] e Mat„(£) : atj = 0 for all j l\. 


It is easy to see that COL(f) = REm, where En is the matrix having 1 as its It entry and 
Os everywhere else. We claim that COL(f) is a minimal left ideal in R. If we define 

COL*(f) = y>OL(i-), 

i+l 


then COL* (£) is a left ideal with 


R/COL*(£) = COL(f) 

as left /^-modules. Since COL(f) is a minimal left ideal, it is a simple left R -module, and 
hence COL*(f) is a maximal left ideal. Therefore, 

J(R) c P|col*(£) = {0}, 

i 

so that R = Mat„(T) is Jacobson semisimple. ◄ 


Proposition 8.31. Given a ring R, the following conditions are equivalent for x e R: 

(i) x e J(R)\ 

(ii) for every r e R, the element 1 — rx has a left inverse ; that is, there is u e R with 
u( 1 — rx) — 1; 

(iii) x{R/I) = {0} for every maximal left ideal I ( equivalently, xM = {0 } for every 
simple left R-module M). 
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Proof, (i) =>■ (ii) If there is r e R with 1 — rx not having a left inverse, then R{ 1 — rx) 
is a proper left ideal, for it does not contain 1. Hence, there is a maximal left ideal I with 
1 — rx e R{ 1 — rx) C /, for the proof of Theorem 6.46 (Every proper ideal is contained 
in some maximal ideal) does not use commutativity. Now rx e J (R) c /, because J (R) 
is a left ideal, and so 1 = (1 — rx) + rx el, a contradiction. 

(ii) =>■ (iii) As we mentioned when simple left /^-modules were defined earlier in this 
chapter, a left R-module M is simple if and only if M = R/I, where I is a maximal left 
ideal. 

Suppose there is a simple module M for which xM f {0}; hence, there is m e M with 
xm ^ 0 (of course, m 0). It follows that the submodule Rxm {0}, for it contains 
1 xm. Since M is simple, it has only one nonzero submodule, namely, M itself, and so 
Rxm — M. Therefore, there is r e R with rxm — m\ that is, (1 — rx)m = 0. By 
hypothesis, 1 — rx has a left inverse, say, u( 1 — rx) = 1. Hence, 0 = m(1 — rx)m = m, a 
contradiction. 

(iii) =>■ (i) If x{R/I) = {0}, then x(\ + I) = x + / = /; that is, x e I. Therefore, if 

x(R/I) — {0} for every maximal left ideal I, then x e p| 7 1 = J{R). • 

Notice that condition (ii) in Proposition 8.31 can be restated: x e J(R) if and only if 
1 — z has a left inverse for every z € Rx. 

The following result is frequently used in commutative algebra. 

Corollary 8.32 (Nakayama’s Lemma). If M is a finitely generated left R-module, and 
if JM = M, where J = J (R) is the Jacobson radical, then M — {0}. 

In particular, if R is a local ring, that is, R is a commutative ring with unique maximal 
ideal P, and if M is a finitely generated R-module with PM — M, then M — {0}. 

Proof. Let mi, ..., m n be a generating set of M that is minimal in the sense that no 
proper subset generates M. Since JM = M, we have m \ — ffl=\ r i m i> where r, e J. It 
follows that 

n 

(1 - r\)m\ = Yjinii. 

i=2 

Since r\ e J, Proposition 8.31 says that 1 — r\ has a left inverse, say, u, and so m \ = 
Yi =2 llr i m i- This is a contradiction, for now M can be generated by the proper subset 
{m 2 , ..., m n }. 

The second statement follows at once because J(R) = P when R is a local ring with 
maximal ideal P. • 


Remark. The hypothesis in Nakayama’s lemma that the module M be finitely generated 
is necessary. For example, it is easy to check that R — {a/b e Q : b is odd} is a local ring 
with maximal ideal P — (2), while Q is an /^-module with P Q = 2Q = Q. < 
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Remark. There are other characterizations of J(R). One such will be given in Proposi¬ 
tion 8.36, in terms of units in R (elements having two-sided inverses). Another character¬ 
ization is in terms of left quasi-regular elements: An element x e R is left quasi-regular 
if there is y e R with y o x — 0 (here, y o x — x + y — yx is the circle operation), and 
a left ideal is called left quasi-regular if each of its elements is left quasi-regular. It can 
be proved that J(R) is the unique maximal left quasi-regular ideal in R (see Lam, A First 
Course in Noncommutative Rings, pages 67-68). ◄ 

The next property of an ideal is related to the Jacobson radical. 

Definition. A left ideal A in a ring R is nilpotent if there is some integer m > 1 with 
A m = {0}. 

Recall that A m is the set of all sums of the form a\ ■ ■ ■ a m , where aj e A for all j ; that 
is, A m — { JL an ■ ■ ■ ai m ■ a ij £ A}. It follows that if A is nilpotent, then every a e A 
is nilpotent; that is, a m — 0. On the other hand, if a e R is a nilpotent element, it does 
not follow that Ra, the left ideal generated by a, is a nilpotent ideal. For example, let 
R — Mat 2 (fc), for some commutative ring k, and let a = [o o]- Now a 2 = [j] [}], but Ra 
contains 


o 

o 

'0 

f 


O 

O 

o 

_1 

0 

0 


O 


which is idempotent: e 2 — e. Therefore, e m — e 0 for all m, and so (. Re) m {0}. 

Corollary 8.33. If R is a ring, then I c J (R) for every nilpotent left ideal I in R. 

Proof Let I" — {0}, and let x e I. For every r e R, we have rx e I, and so ( rx) n — 0. 
The equation 

(1 + rx + (rx) 2 + • • • + (rx)” _1 )(l - rx) = 1 
shows that 1 — rx is left invertible, and so x e J(R), by Proposition 8.31. • 

Proposition 8.34. If R is a left artinian ring, then J(R) is a nilpotent ideal. 

Proof Denote J(R) by J in this proof. The descending chain of left ideals, 

J 2 J 2 2 J 3 2 "• , 

stops, because R is left artinian; say, J' n — J"' + 1 = • • •; define I — J'". It follows that 
I — I 2 . We will assume that I f {0} and reach a contradiction. 

Let T be the family of all nonzero left ideals B with IB f {0}; note that T f 0 
because I e T. By Proposition 8.16, there is a minimal element Bq e T. Choose b e Bq 
with lb ^4 {0}. Now 

I(Ib) = I 2 b = lb j=. {0}, 

so that lb c Bq e T, and minimality gives Bq = lb. Since b e Bq, there is x e I C 
J — J(R) with b — xb. Hence, 0 = (1 — x)b. But 1 — x has a left inverse, say, u, by 
Proposition 8.31, so that 0 = u( 1 — x)b — b, and this is a contradiction. • 
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The Jacobson radical is obviously a left ideal, but it turns out to be a right ideal as well; 
that is, J(R) is a two-sided ideal. We begin by giving another source of two-sided ideals. 

Definition. If R is a ring and M is a left /^-module, define the annihilator of M to be 

ann(M) = {a e R : am = 0 for all m e M}. 

Even though it is easy to see that ann(M) is a two-sided ideal in R, we prove that it 
is a right ideal. Let a e ann(M), r e R, and m e M. Since M is a left /^-module, we 
have rm e M; since a annihilates every element of M, we have a(rm) = 0. Finally, 
associativity gives ( ar)m = 0 for all m, and so ar e ann(M). 

Corollary 8.35. 

(i) J (R) — ("j ann(J?/ 1), and so J (R) is a two-sided ideal in R. 

I = maximal 
left ideal 

(ii) R/J(R) is a Jacobson semisimple ring. 

Proof, (i) Let A{R) denote p| ; ann(/?//), where the intersection is over all maximal left 
ideals I. For any left ideal /, we claim that ann (R/I) c I. If a e ann( R/1 ), then, for all 
r e R, we have a(r + /) = ar + I = /; that is, ar e I. In particular, if r = 1, then a e I. 
Hence, A(R) C J(R). 

For the reverse inclusion, assume that I is a maximal left ideal, and define S = R/I; 
maximality of I implies that S is a simple R -module. For each nonzero x e S. define 
< p x : R —> S by q> x : r i-> rx. It is easy to check that (p x is an /?-map, and it is surjective 
because S is simple. Thus, R / ker (p x = S, and simplicity of S shows that the left ideal 
ker <p x is maximal. But it is easy to see that aniif R/I) = p| (:s5 ker <p x . It follows that 
J(R) c A(R). Since J(R ) is equal to A(R), which is an intersection of two-sided ideals, 
J{R ) is a two-sided ideal. 

(ii) First, R/J{R ) is a ring, because J(R) is a two-sided ideal. The correspondence 
theorem for rings shows that if I is any two-sided ideal of R contained in J(R), then 
J(R/I) — J(R)/I; the result follows if I — J(R). • 

Let us now show that we could have defined the Jacobson radical using right ideals 
instead of left ideals. 

Definition. A unit in a ring R is an element u e R having a two-sided inverse; that is, 
there is v e R with 

uv = 1 = vu. 


Proposition 8.36. 

(i) If R is a ring, then 


J(R) = {x e R : 1 + rxs is a unit in R for all r, s e R}. 
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(ii) If R is a ring and J'(R) is the intersection of all the maximal right ideals of R, then 

J'(R) = J(R). 

Proof (i) Let W be the set of all x e R such that 1 + rxs is a unit for all r. s e R. If 
x e W, then setting s = — 1 gives 1 — rx a unit for all r e R. Hence, 1 — rx has a left 
inverse, and so x e J(R ), by Proposition 8.31. Therefore, W C J(R). For the reverse 
inclusion, let x e J{R). Since J(R) is a two-sided ideal, by Corollary 8.35, we have 
xs e J(R ) for all .v e R. Proposition 8.31 says that 1 — rxs is left invertible for all r e R\ 
that is, there is u e R with u( 1 — rxs) = 1. Thus, u — 1 + urxs. Now (-itr)xs e J(R ), 
since J(R) is a two-sided ideal, and so u has a left inverse (Proposition 8.31 once again). 
On the other hand, u also has a right inverse, namely, 1 — rxs. By Exercise 8.16, u is a 
unit in R. Therefore, 1 — rxs is a unit in R for all r. s e R. Finally, replacing r by —r, we 
have 1 + rxs a unit, and so J(R) C W. 

(ii) The description of J(R) in part (i) is left-right symmetric. After proving right-sided 
versions of Proposition 8.31 and Corollary 8.35, one can see that J'(R ) is also described 
as in part (i). We conclude that JfR ) = J(R). • 


Exercises 

8.23 (i) Generalize the proof of Lemma 6.69 to prove that if A is a division ring, then a < S, 

defined by a e (S ), is a dependency relation. 

(ii) Use Theorem 6.71 to prove that every left vector space over a division ring has a basis. 

(iii) Use Theorem 6.72 to prove that any two bases of a left vector space over a division ring 
have the same cardinality. 

8.24 If k is a field and A is a finite-dimensional A-algebra, define 

L = {A fl e End^-fA) : : x i-> ax} 


and 

R = {p a e End ir (A) : p a : x \-± xa}. 

Prove that there are A-algebra isomorphisms 

L = A and R = A°P. 

Hint. Show that the function A —> L defined by a X a is an injective A'-algebra map which 
is surjective because A is finite-dimensional. 

8.25 (i) Let C be a subdivision ring of a division ring D. Prove that D is a left vector space over 

C, and conclude that [D : C] = dimc(£)) is defined. 

(ii) If Z C C C D is a tower of division rings with [D : C] and [C : Z] finite, then [D : Z] 
is finite and 

[D :Z] = [D: C][C : Z], 

Hint. If;/ j, .... u m is a basis of D as a left vector space over C, and if ci, ..., is a 
basis of C as a left vector space over Z, show that the set of all CjUj (in this order) is a 
basis of D over Z. 
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8.26 ( Modular Law). Let A, B, and A' be submodules of a module M. If A' C A. prove that 
A n (B + A') = (A n B) + A'. 

8.27 (i) Let 0 —>■ A —> B —C 0 be an exact sequence of left ^-modules over some ring 

R. Prove that if both A and C have DCC. then B has DCC. Conclude, in this case, that 
A ® B has DCC. 

(ii) Let 0 —^ A —^ B —^ C —^ 0 be an exact sequence of left ^-modules over some ring 
R. Prove that if both A and C have ACC, then B has ACC. Conclude, in this case, that 
A ® B has ACC. 

(iii) Prove that every semisimple ring is left artinian. 

8.28 (L. Small) Prove that the ring of all matrices of the form [ ? 0 ], where a e Z and b, c e Q, is 
left noetherian but not right noetherian. 

8.29 Let R be the ring of all 2 x 2 matrices [ q * ] ■ where a e Q and b.c el. Prove that R is right 
artinian but not left artinian. 

Hint. There are only finitely many right ideals in R, but for every V C R that is a vector 
space over Q, 

'0 V' 

_° °. 

is a left ideal. 

8.30 Give an example of a ring R that is not isomorphic to f?°P. 

8.31 (i) If R is a commutative ring with J (R) = {0}, prove that R has no nilpotent elements. 

(ii) Give an example of a commutative ring R having no nilpotent elements and for which 

J(R) ^ {0}. 

8.32 Let k be a field and R = Mat 2 (L). Prove that a = q] is left quasi-regular, but that the 
principal left ideal Ra is not a left quasi-regular ideal. 

8.33 (i) If A is a division ring, prove that a finite subgroup of A x need not be cyclic. Compare 

with Theorem 3.30. (S. A. Amitsur has found all the finite subgroups of multiplicative 
groups of division rings.) 

(ii) If A is a division ring whose center is a field of characteristic p > 0, prove that every 
finite subgroup G of A x is cyclic. 

Hint. Consider F p G, and use Theorem 8.23. 

8.34 If R is a ring and M is a left /^-module, prove that Hom^ (R, M) is a left R -module, and prove 
that it is isomorphic to M. 

Hint. If /: R -»• M and r' e R, define r' f : r rr'. 

8.35 If k is a field of characteristic 0, then End,t(/r[f]) contains the operators 

/(f) £f«) and y: f(t) tf(t). 


= |[§sf»H 


(i) If A i (k) is the subalgebra of End^ (&[t]) generated by x and y, prove that 

yx = xy + 1. 

(ii) Prove that A ] (k) is a left noetherian ring having no proper nontrivial two-sided ideals 
that satisfies the left and right cancellation laws (if a ^0, then either equation ab = ac 
or ba = ca implies b = c). 
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Remark. Exercise 8.35 can be generalized by replacing fc[f] by k[t\, ..., t n \, the operator x by 
partial derivatives 

Xi : / 0 i. t „) i-» J 7 /O 1 . t n ), 

and the operator y by 

yi -fin . t„) m- .f„). 

The subalgebra A„ ( k ) of Endi(k[fi generated by x \, ..., x n , y \,..., y n is called the /ith 

Weyl algebra over k. H. Weyl introduced this algebra to model momentum and position operators in 
quantum mechanics. It can be shown that A n ( k) is a left noetherian simple domain for all n > I (see 
McConnell-Robson, Noncommutative Noetherian Rings, page 19). ◄ 


8.3 Semisimple Rings 

A group is an abstract object; we can picture it only as a “cloud,” a capital letter G. Of 
course, there are familiar concrete groups, such as the symmetric group S n and the general 
linear group GL(V) of all nonsingular linear transformations of a vector space V over a 
field k. Representations of a finite group G are homomorphisms of G into such familiar 
groups, and they are of fundamental importance for G. 

We begin by showing the connection between group representations and group rings. 

Definition. A k-representation of a group G is a homomorphism 

o-: G -> GL(V), 

where V is a vector space over a field k. 

Note that if dim(V) = n, then GL(V) contains an isomorphic copy of S„ [if tq,..., v„ 
is a basis of V and a e S„, then there is a nonsingular linear transformation T: V -> V 
with T(vi) — v a (i ) for all ; ]; therefore, permutation representations are special cases of 
^-representations. Representations of groups can be translated into the language of kG- 
modules (compare the next proof with that of Proposition 8.8). 

Proposition 8.37. Every k-representation a : G —> GL(V) equips V with the structure 
of a left kG-module; denote this module by V a . Conversely, every left kG-module V 
determines a k-representation o : G —> GL(V). 

Proof Given a homomorphism a : G —> GL(V), denote cr(g): V —» V by o g , and 
define an action kG x V —> V by 

(ZX#) v = Ew*')- 

geG geG 

A routine calculation shows that V, equipped with this scalar multiplication, is a left kG- 
module. 

Conversely, assume that V is a left AG-module. If g e G, then v i->- gv defines a linear 
transformation T g : V —> V ; moreover, 7„ is nonsingular, for its inverse is T g -i. It is easily 
checked that the function o : G —»• GL( V ), given by a : g i-> T g , is a k-representation. • 
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If r: G —> GL(V) is another A-representation, when is V T = V a , where V T and V a 
are the A'G-modules determined by r, cr, respectively, in Proposition 8.37? Recall that if 
T : V —»• V is a linear transformation, then we made V into a A[x]-module we denoted by 
V 1 , and we saw, in Proposition 7.3, that if S: V -> V is another linear transformation, 
then V s = V 1 if and only if there is a nonsingular q> : V -> V with S — <pTcp~ [ . 


Proposition 8.38. Let G be a group and let a. r: G —> GL(V) be k-representations, 
where k is afield. If V a and V T are the corresponding kG-modules defined in Propo¬ 
sition 8.37, then V a = V T as kG-modules if and only if there exists a nonsingular 
tp: V —> V with 

(pr(g) = cr(g)(p 


for every g e G. 


Remark. We often say that <p intertwines a and r. < 

Proof. If tp\ V T -» V n is a AG-isomorphism, then i p: V —> V is an isomorphism of 
vector spaces with 

Oggv ) = (J2 a gs) < P(v) 

for all v e V and all g e G. But the definition of scalar multiplication in V T is gv = 
r(g)(v), while the definition of scalar multiplication in V° is gv = o(g)(v). Hence, for 
all g e G and v e V. we have q>{r(g)(v)) — cr(g)iq>{v)). Therefore, 

(pr(g) = crig)(p 


for all g e G. 

Conversely, the hypothesis gives (prig) — crig)(p for all g e G, where q> is a nonsingular 
A-linear transformation, and so (p(r(g)v) = a(g)(p(v) for all g e G and v e V. It now 
follows easily that (p is a AG-isomorphism; that is, (p preserves scalar multiplication by 
J2g€G a g8- • 

Let us rephrase the last proposition in terms of matrices. 

Corollary 8.39. Let G be a group and let cr, r: G —> Mat,, (A) be k-representations. 
Then (A") CT = (A") T as kG-modules if and only if there is a nonsingular n x n matrix P 
with 

Prix)P~ l — crix) 

for every x e G. 


Example 8.40. 

If G is a finite group and V is a vector space over a field A, then the trivial homomorphism 
a : G —*■ GL(V ) is defined by cr(x) — ly for all x e G. The corresponding AG-module 
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V n is called the trivial £G-module: If v e V, then xv — v for all x e G. The trivial 
module k (also called the principal kG-module) is denoted by 

Vo(k). ◄ 


We now introduce an important class of rings; it will be seen that most group algebras 
kG are semisimple rings. 

Definition. A left R -module is semisimple if it is a direct sum of simple modules. A ring 
R is left semisimple if it is a direct sum of minimal left ideals. 3 

Recall that if a ring R is viewed as a left /G module, then its submodules are its left 
ideals; moreover, a left ideal is minimal if and only if it is a simple left R -module. 

The next proposition generalizes Example 8.30. 

Proposition 8.41. If a ring R is left semisimple, then it has both chain conditions on left 
ideals. 

Proof. Since R is left semisimple, it is a direct sum of minimal left ideals: R = Li- 
Let 1 = e;, where e, e If r — r, e JT L,, then r — lr and so r, = e, r,. 

Hence, if e,- = 0, then L, = 0. We conclude that there are only finitely many nonzero L, ; 
that is, R — L \ © • • • © L n . Now the series 

R = Lx ® • • • ® L n D L 2 © • • • © L n D • • • D L n 2. {0} 

is a composition series, for the factor modules are L i, ..., L n , which are simple. It follows 
from Proposition 8.17 that R (as a left R -module over itself) has both chain conditions. • 

We now characterize semisimple modules over any ring. 

Proposition 8.42. A left module M (over any ring ) is semisimple if and only if every 
submodule of M is a direct summand. 

Proof. Suppose that M is semisimple; hence, M = Y2j e j Sj , where each Sj is simple. 
For any subset I C /, define 

si = E s j- 

jel 

If B is a submodule of M , Zorn’s lemma provides a subset K C. J maximal with the 
property that Sk GB = {0}. We claim that M = B®Sk- We must show that M — B + Sk, 
for their intersection is {0} by hypothesis, and it suffices to prove that Sj c B + Sk for 


3 We can define a ring to be right semisimple if it is a direct sum of minimal right ideals. However, we shall 
see in Corollary 8.57 that a ring is a left semisimple ring if and only if it is right semisimple. 
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all j e J. If j e K, then Sj C Sk c B + Sk- If j f K, then maximality gives 
(S K + Sj) nfi / {0}. Thus, 

sk + Sj = b / 0, 

where sk e Sk. Sj e Sj, and b e B. Note that Sj ^ 0, lest sk = b e Sk n B = {0}. 
Hence, 

Sj = b — sk €= S j Cl (B S k), 

and so Sj fl (B + Sk) {0}. But Sj is simple, so that Sj — Sj fl (B + Sk), and so 
Sj c B + Sk, as desired. Therefore, M — B © Sk- 

Now assume that every submodule of M is a direct summand. 

(i) Every nonzero submodule B contains a simple summand. 

Let b e B be nonzero. By Zorn’s lemma, there exists a submodule C of II maximal 
with b £ C. By Corollary 7.18, C is a direct summand of B: There is some submodule 
D with B — C © D. We claim that D is simple. If D is not simple, we may repeat the 
argument just given to show that D — D' © D" for nonzero submodules />' and D". Thus, 

B = C®D = C®D' © D". 

We claim that at least one of C ® D' or C © D" does not contain the original element 
b. Otherwise, b = c' + d' = c" + d", where c', c" e C, d' e D', and d" e D". But 
d - c" = d" - d' e C O D = {0} gives d' = d" e D' O D" = {0}. Hence, d' = d" = 0, 
and so b = d e C, contradicting the definition of C. Finally, either C ® />' oi' C ® D" 
contradicts the maximality of C. 

(ii) M is left semisimple. 

By Zorn’s lemma, there is a family {Sj : j e 1} of simple submodules of M maximal 
such that the submodule U they generate is their direct sum: U — ;e / Sj. By hypothesis, 

U is a direct summand: M — U © V for some submodule V of M. If V — {0}, we are 
done. Otherwise, by part (i), there is some simple submodule S contained in V that is a 
summand: V — S © V' for some V' C V. The family [Sj : j e /} U {5} violates the 
maximality of the first family of simple submodules, for this larger family also generates 
its direct sum. Therefore, V — {0} and M is left semisimple. • 


Corollary 8.43. 

(i) Every submodule and every quotient module of a semisimple module M is itself left 
semisimple. 

(ii) If R is a (left) semisimple ring, then every left R-module M is a semisimple module. 

(iii) If I is a two-sided ideal in a semisimple ring R, then the quotient ring R/I is also a 
semisimple ring. 
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Proof, (i) Let B be a submodule of M. Every submodule C of B is, clearly, a submodule 
of M. Since M is left semisimple, C is a direct summand of M and so, by Corollary 7.18, 
C is a direct summand of B. Hence, B is left semisimple, by Proposition 8.42. 

Let M/H be a quotient of M. Now H is a direct summand of M, so that M — H © H' 
for some submodule H' of M. But //' is left semisimple, by the first paragraph, and 
M/H = H'. 

(ii) There is a free left /(-module F and a suijective /(-map <p: F M. Now R is a 
semisimple module over itself (this is the definition of semisimple ring), and so F is a 
semisimple module. Thus, M is a quotient of the semisimple module F, and so it is itself 
semisimple, by part (i). 

(iii) First, R/I is a ring, because / is a two-sided ideal. The left R -module R // is semisim¬ 
ple, by (i), and so it is a direct sum R/I = Sj, where the Sj are simple left /(-modules. 
But each Sj is also simple as a left (/(//(-module, for any (/(//(-submodule of Sj is also 
an /(-submodule of Sj. Therefore, R/I is semisimple. • 

Corollary 8.44. 

(i) A finitely generated left semisimple R-module M ( over a ring R) is a direct sum of 
a finite number of simple left modules. In particular, a left semisimple ring R is a 
direct sum of a finite number of minimal left ideals. 

(ii) The direct product R — R[ x • • • x R m of left semisimple rings R\, ..., R m is also 
a left semisimple ring. 

Proof, (i) Let xi, , x n be a generating set of M. Since M is left semisimple, it is a 
direct sum of simple left modules, say, M = j Sj. Now each xj = J Zj s ij > where 
Sjj e Sj, has only a finite number of nonzero components. Hence, {x\, ..., x„) involves 
only finitely many Sj ’s, say, Sj t ,..., Sj,. Therefore, 

M c <*i,.. x„) c S jl © • • • © Sj, c M. 

As a left semisimple module over itself, R is cyclic, hence finitely generated. Therefore, 
R is a direct sum of only finitely many simple left submodules; that is, R is a direct sum of 
finitely many minimal left ideals. 

(ii) Since each /(, is left semisimple, it is a direct sum of minimal left ideals, say, /(; = 
7/1 © • • • © Ji t(i ). Each J,k is a left ideal in R, not merely in /(,-, as we saw in Example 8.5. 
It follows that J !k is a minimal left ideal in R. Hence, R is a direct sum of minimal left 
ideals, and so it is a left semisimple ring. • 

It follows that a finite direct product of fields is a commutative semisimple ring (we will 
prove the converse later in this section). For example, if n is a squarefree integer, then the 
Chinese remainder theorem implies that I„ is a semisimple ring. Similarly, if k is a field 
and f(x) e £[;t] is a product of distinct irreducible polynomials, then k[x]/(f (x)) is a 
semisimple ring. 
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We can now generalize Proposition 8.20: Every, not necessarily finitely generated, left 
vector space over a division ring A has a basis. Every division ring is a left semisimple 
ring, and A itself is the only minimal left ideal. Therefore, every left A-module M is a 
direct sum of copies of A; say, M = A,. If x, e A, is nonzero, then X = {x,- : i e /} 
is a basis of M. This observation explains the presence of Zorn’s lemma in the proof of 
Proposition 8.42. 

The next result shows that left semisimple rings can be characterized in terms of the 
Jacobson radical. 

Theorem 8.45. A ring R is left semisimple if and only if it is left artinian and J(R) — {0}. 

Proof. If R is left semisimple, then there is a left ideal I with R — J (R) © /, by 
Proposition 8.42. It follows from Exercise 8.15(ii) on page 532 that there are idempotents 
e e J(R) and / e I with 1 = e + f. Since e e J(R), Proposition 8.31 says that f — 1—e 
has a left inverse; there is u e R with uf = 1. But / is an idempotent, so that f — f 2 . 
Hence, 1 = uf — uf 2 — (uf)f — /, so that e — 1 — / = 0. Since J(R)e — J{R), by 
Exercise 8.15(ii) on page 532, we have J (R) — {0}. Finally, Proposition 8.41 shows that 
R is left artinian. 

Conversely, assume that R is left artinian and J (R) = {0}. We show first that if I is a 
minimal left ideal of R , then I is a direct summand of R. Now I f {0}, and so I £ J{R)', 
therefore, there is a maximal left ideal A not containing I. Since I is minimal, it is simple, 
so that I fl A is either I or {0}. But I fl A = I implies / C A, a contradiction, and so 
I fl A = {0}. Maximality of A gives I + A = R, and so R — I © A. 

Choose a minimal left ideal I\, which exists because R is left artinian. As we have just 
seen, R — I\ © B\ for some left ideal By. Now B\ contains a minimal left ideal, say, A, by 
Proposition 8.29(ii), and so there is a left ideal Ih with B\ = h(& /C- This construction 
can be iterated to produce a strictly decreasing chain of left ideals B\ D Ih f f B r+ \ 
as long as B, f {0}. If B, f {0} for all r, then the DCC is violated. Therefore, B r = {0} 
for some r, so that /? = /[©•••©/,. and R is semisimple. • 

Note that the chain condition is needed. For example, Z is Jacobson semisimple, that 
is, J fl) — {0}, but Z is not a semisimple ring. 

We can now prove the following remarkable result. 

Theorem 8.46 (Hopkins-Levitzki). If a ring R is left artinian, then it is left noetherian. 

Proof. It suffices to prove that R. regarded as a left module over itself, has a composition 
series, for then Proposition 8.17 applies at once to show that R is left noetherian as a 
module over itself; that is, R has the ACC on left ideals. 

If / = J (R ) denotes the Jacobson radical, then J m — {0} for some m > 1, by Proposi¬ 
tion 8.34, and so there is a chain 

fi = /°D/3/ 2 3J 3 D..O J m = {0}. 

Since each J q is an ideal in R, it has the DCC, as does its quotient J q /J q+l . Now R/ J 
is a semisimple ring, by Theorem 8.45 [it is left artinian, being a quotient of a left artinian 
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ring, and Jacobson semisimple, by Corollary 8.35(ii)]. The factor module J q / J q+l is an 
(/(//(-module; hence, by Corollary 8.43, J q / J q+1 is a semisimple module, and so it can 
be decomposed into a direct sum of (possibly infinitely many) simple (/(/./(-modules. But 
there can be only finitely many summands, for every (/(//(-submodule of J q /J q+l is 
necessarily an /(-submodule, and J q /J q+l has the DCC on /(-submodules. Hence, there 
are simple (/(// (-modules .S', with 

j q /j q + l = S l ®S 2 ®---®S p . 

Throwing away one simple summand at a time yields a series of J q / J q+1 whose ith factor 
module is 

(Si © S i+ 1 0 • • • 0 S p )/(S i+ 1 © • • • © S p ) = Si. 

Now the simple (/(//(-module .S', is also a simple /(-module, for it is an /(-module an¬ 
nihilated by /, so that we have constructed a composition series for J q / J q+l as a left 
/(-module. Finally, refine the original series for R in this way, for every q , to obtain a 
composition series for R. • 

Of course, the converse of Theorem 8.46 is false. 

The next result is fundamental. 

Theorem 8.47 (Maschke’s Theorem). If G is a finite group and k is a field whose 
characteristic does not divide |G|, then kG is a left semisimple ring. 

Remark. The hypothesis always holds if k has characteristic 0. ◄ 

Proof By Proposition 8.42, it suffices to prove that every left ideal I of kG is a direct 
summand. Since k is a field, kG is a vector space over k and / is a subspace. By Corol¬ 
lary 6.49, / is a (vector space) direct summand: There is a subspace V (which may not be 
a left ideal in kG) with kG = I © V. There is a A-linear transformation d: kG —»■ I with 
d ( b ) = b for all b e I and with ker d — V [each u e kG has a unique expression of the 
form u = b + v, where he! and v e V, and d(u ) = b]. Were d a kG- map, not merely a 
A'-map, then we would be done, by the criterion of Corollary 7.17: / is a summand of kG 
if and only if it is a retract; that is, there is a AG-map D : kG —»• I with D{u) — u for all 
u e I. We now force d to he a AG-map by an “averaging” process. 

Define D : kG —> kG by 

1 x ^ 

D(u ) = - y, x d(x u) 

^ xeG 

for all u e kG. Note that |G| f 0 in A, by the hypothesis on the characteristic of A, and so 
it is invertible. It is obvious that D is a A-map. 

(i) im DC/. 

If a e kG and x e G, then d(x~ l u ) e I (because im d c /), and xd(x~ l u ) e I 
because / is a left ideal. Therefore, D(u) e /, for each term in the defining sum of D(u ) 
lies in I. 
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(ii) If b e I. then l)(b) = b. 

Since b e /, so is x~ l b , and so d(x~ l b ) = x~ l b. Hence, xd(x~ l b ) = xx~ l b = b. 
Therefore, xd(x~ l b) = \G\b, and so D(b ) = b. 

(iii) D is a kG- map. 

It suffices to prove that D(gu) = gD(u) for all g e G and all u e kG. But 

1 \ - -i 

gD(u) - — 2_^ gxd(x u ) 

' ' xeG 

= gxd{x~ l g _1 gu) 

1 1 xeG 

1 x 

= y d( y su) 

1 1 y=gxeG 

= D(gu) 

(as x ranges over all of G, so does y — gx). • 

The converse of Maschke’s theorem is true: If G is a finite group and k is a field whose 
characteristic p divides |G|, then kG is not left semisimple; a proof is outlined in Exer¬ 
cise 8.37 on page 573. 

Before analyzing left semisimple rings further, let us give several characterizations of 
them. 

Proposition 8.48. The following conditions on a ring R are equivalent. 

(i) R is left semisimple. 

(ii) Every left R-module is a semisimple module. 

(iii) Every left R-module is injective. 

(iv) Every short exact sequence of left R-modules splits. 

(v) Every left R-module is projective. 

Proof, (i) =>■ (ii). This follows at once from Corollary 8.43(ii), which says that if R is a 
semisimple ring, then every /^-module is a semisimple module. 

(ii) => (iii). If £ is a left /^-module, then Proposition 7.64 says that E is injective if every 
exact sequence 0—>C—>0 splits. By hypothesis, B is a semisimple module, 
and so Proposition 8.42 implies that the sequence splits; thus, E is injective. 

(iii) =>■ (iv). If 0 -> A —> B -> C -> 0 is an exact sequence, then it must split because, as 
every module, A is injective (see Proposition 7.64). 

(iv) =>• (v). Given a module M. there is an exact sequence 
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where F is free. This sequence splits, by hypothesis, and so F = M ® F'. Therefore, M 
is a direct summand of a free module, and hence it is projective (see Theorem 7.56). 

(v) => (i). If I is a left ideal of R, then 

0 ->/->/?-> /?// —> 0 

is an exact sequence. By hypothesis, R/I is projective, and so this sequence splits (see 
Proposition 7.54); that is, I is a direct summand of R. By Proposition 8.42, R is a semisim¬ 
ple left R-module. Therefore, R is a left semisimple ring. • 

Modules over semisimple rings are so nice that there is a notion of global dimension 
of a ring R that measures how far removed R is from being semisimple; we will discuss 
global dimension in Chapter 11. 

Here are more examples of left semisimple rings; the Wedderburn-Artin theorem will 
say that there are no others. 

Proposition 8.49. 

(i) If A is a division ring and V is a left vector space over A with dim(V) = n, then 
EndA( V) = Mat„(A op ) is a left semisimple ring. 

(ii) If Ai, ..., A„, are division rings, then 


Mat,,, (Ai) x • • • x Mat„ m (A,„) 


is a left semisimple ring. 

Proof, (i) By Proposition 8.24, we have 

EndA(V) = Mat„(EndA(A)); 

by Proposition 8.12, EikIa(A) = A op . Therefore, EndA(V) = Mat„(A op ). 

Let us now show that EiuIaIT) is semisimple. If iq..... v n is a basis of V, define 

Col O') = {T e End A (V) : T(v t ) = 0 for all i f j}. 

It is easy to see that Col(y) is a left ideal in EndA(V): If S € EndA(V), then S(Ti >,) = 0 
for all i f j. Recall Example 8.30: If we look in Mat„(A op ) = EndA(V), then Col(j) 
corresponds to COL(y), all those matrices whose entries off the /th column are 0. It is 
obvious that 

Mat„(A op ) = COL(l) ® • • • ® COL(h). 

Hence, EndA(V) is also such a direct sum. We asserted, in Example 8.30, that each COLO) 
is a minimal left ideal, and so EiuIaIT) is a left semisimple ring. Let us prove minimality 
ofColO). 

Suppose that I is a nonzero left ideal in End a (L) with I C Col(/). Choose a nonzero 
F e I; now F(vj) — u f 0, for otherwise F would kill every basis element and, hence. 
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would be 0. If T e Col(;), write T(vj) = w. Since u ^ 0, there is S e EndA(V) with 
S(u) = w. Now 


SFM = 


! ° 

| S(u) — w 


if' + 

if i = j. 


Therefore, T = SF, because they agree on a basis, and so T el, because I is a left ideal. 
Therefore, Coif/) = I , and Coif /) is a minimal left ideal. 

(ii) This follows at once from part (i) and Proposition 8.44(h), for if A is a division ring, 
then so is A op , by Exercise 8.13 on page 532. • 


Corollary 8.50. If V is an n-dimensional left vector space over a division ring A, then 
the minimal left ideals Coif;), for 1 < j < n, in EikIa(V) are all isomorphic. 

Proof. Let tq,..., v n be a basis of V. For each j, define pj : V —> V to be the linear 
transformation that interchanges Vj and uj and that fixes all the other i>,-. It is easy to see 
that T i-^ Tpj is an isomorphism Colf 1) —> Colfy). • 

We will see, in Lemma 8.61(h), that all the minimal left ideals in End a (C) are isomor¬ 
phic. 


Definition. A ring R is simple if it is nonzero and it has no proper nonzero two-sided 
ideals. 

In Proposition 8.59, we will see that every left artinian simple ring is semisimple. 


Proposition 8.51. If A is a division ring, then R — Mat,, (A) is a simple ring. 

Proof. A matrix unit E pq is the n x n matrix all of whose entries are 0 except the p. q 
entry, which is 1. The matrix units form a basis for Mat,, (A) viewed as a left vector space 
over A, for each matrix A — [a,/] has a unique expression 


A — ^ ^ at j Eij. 
ij 

[Of course, this says that dim(Mat„(A)) = n 2 .\ A routine calculation shows that matrix 
units multiply according to the following rule: 


Eij Em 


j 0 if; 

\Eu if j=k. 


Suppose that A is a nonzero two-sided ideal in Mat,, (A). If A is a nonzero matrix in 
N, it has a nonzero entry; say, a,-; f 0. Since A is a two-sided ideal, A contains E jv A E jq 
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for all p, q. But 


EpiAEjq — Epi ^auEuEjq 

ki 

— E pi ^ ' Clj Ekq 
k 

— ^ ' Ukj E pj E^g 
k 

= Clij Epq. 

Since ^ 0 and A is a division ring, a~j l e A, and so E pq e N for all p, q. But the 
collection of all E pq span the left vector space Mat,, (A) over A, and so N — Mat,, (A). • 

We are now going to prove the converse of Proposition 8.49(h): Every left semisimple 
ring is isomorphic to a direct product of matrix rings over division rings. The first step 
shows how division rings arise. 

Theorem 8.52 (Schur’s Lemma). Let M and M’ be simple left R-modules, where R is 
a ring. 

(i) Every nonzero R-map f: M —> M’ is an isomorphism. 

(ii) Endft(M) is a division ring. In particular, if L is a minimal left ideal in a ring R, 
then Ends(L) is a division ring. 

Proof, (i) Since M is simple, it has only two submodules: M itself and {0}. Now the 
submodule ker f M because / / 0, and so ker / = {0}; that is, / is an injection. 
Similarly, the submodule imf^f {0}, so that im f = M' and / is a surjection. 

(ii) If /: M —> M and / f 0, then / is an isomorphism, by part (i), and hence it has an 
inverse f~ l e Ends(M). Thus, the ring End^(M) is a division ring. • 

Lemma 8.53. If L and L' are minimal left ideals in a ring R, then each of the following 
statements implies the one below it: 

(1) LL' {0}; 

(2) Hom«(L, L') {0}, and there exists b' e L' with L' = Lb'; 

(3) L = L' as left R-modules. 

If also L 2 {0}, then (3) implies (1), and the three statements are equivalent. 

Proof. Let L and IJ be minimal left ideals. 

(1) =4 (2) 

If LL' f {0}, then there exists b e L and b' e L’ with hh' f 0. Thus, the function 
f:L -> L' , defined by i h- xb' ’, is a nonzero /?-map, and so Hom^(L, L') {0}. 

Moreover, Lb' = L' , for it is a nonzero submodule of the minimal left ideal L'. 
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(2) =► (3) 

If Hom^(L, L') ^ {0}, then there is a nonzero /: L —»■ L', and / is an isomorphism, 
by Schur’s lemma; that is, L = L'. 

(3) and L 2 ± {0} =► (1) 

Assume now that L 2 ^ {0}, so there are x,y e L with xy ^0. If g: L -* L' is an 
isomorphism, then 0 ^ g(xy) — xg(y) e LL', and so LL' ^ {0}. • 

Note that if J (R) = {0}, then L 2 ^ {0}. Otherwise, L is a nilpotent left ideal and 
Corollary 8.33 gives L C .H R) — {0}, a contradiction. 

Proposition 8.54. If R = J2j L j is a left semisimple ring, where the L j are minimal left 
ideals, then every simple R-module S is isomorphic to some Lj. 

Proof. Now S = Hom^(/C S ) ^ {0}, by Exercise 8.34 on page 549. If Hom^(L ; -, S) — 
{0} for all j, then Hom^(/?, S ) = {0} (for R — Lj©- • -®L m ). Hence, Hom/?(Z. j, S) {0} 
for some j. Since both L j and 5 are simple. Theorem 8.52(i) gives Lj = S. • 

Here is a fancier proof. 

Proof. By Corollary 7.14, there is a left ideal I with S = R/I, and so there is a series 

#2 / 2 {0}. 

In Proposition 8.41, we saw that 

R = Li © • • • © L n 2 L 2 © • • • © L n 2 • • • 2 L„ 2 {0} 

is a composition series with factor modules L \, ..., L n . The Schreier refinement theorem 
(Theorem 8.15) now says that these two series have equivalent refinements. Since a com¬ 
position series admits only refinements that repeat a term, the factor module S occurring 
in the refinement of the first series must be isomorphic to one of the factor modules in the 
second series; that is, S = Li for some i. • 

Example 8.55. 

The trivial A'G-module Xfik) (see Example 8.40) is a simple AG-module (for it is 
one-dimensional and so has no subspaces other than {0} and itself). By Proposition 8.54, 
Vo (k) is isomorphic to some minimal left ideal L of kG. We shall find L by searching for 
elements u — a g8 i n kG with hu — u for all h e G. For such elements n, 

hu — aghg = a g g — u. 

g€G geG 

Since the elements in G form a basis for the vector space kG, we may equate coefficients, 
and so a g = ai lg for all g e G; in particular, a\ = ah. As this holds for every h e G, all 
the coefficients a g are equal. Therefore, if we define y e kG by 

y = 12 S’ 

geG 
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then u is a scalar multiple of y. It follows that L = (y) is a left ideal isomorphic to the 
trivial module Vo(k); moreover, (y) is the unique such left ideal. -4 

An abstract left semisimple ring R is a direct sum of minimal left ideals: R — . Lj. 

and we now know that End r ( L ; ) is a division ring for every j. The next step is to find 
the direct summands of R that will ultimately turn out to be matrix rings; they arise from a 
decomposition of R into minimal left ideals by collecting isomorphic terms. 

Definition. Let R be a left semisimple ring, and let 

R — L i © * * * © L n , 

where the L j are minimal left ideals. Reindex the summands so that no two of the first m 
ideals L[,..., L m are isomorphic, while every Lj in the given decomposition is isomor¬ 
phic to some Lj for 1 < i < m. The left ideals 

Bi E L J 

Lj=Lj 

are called the simple components of R relative to the decomposition R — ^ • L j. 

We shall see, in Corollary 8.62, that the simple components do not depend on the par¬ 
ticular decomposition of R as a direct sum of minimal left ideals. 

We divide the Wedderburn-Artin 4 theorem into two parts: an existence theorem and a 
uniqueness theorem. 

Theorem 8.56 (Wedderburn-Artin I). A ring R is left semisimple if and only if R is 
isomorphic to a direct product of matrix rings over division rings. 

Proof. Sufficiency is Proposition 8.49. 

For necessity, assume that R is left semisimple. Now R is the direct sum of its simple 
components: 

R — B\ © • • • © B m , 

where each Bi is a direct sum of isomorphic minimal left ideals. Proposition 8.12 says that 
there is a ring isomorphism 

R op = Endfl(R), 

where R is regarded as a left module over itself. Now Hom^(Z?/, If ) = {0} for all i f j, 
by Lemma 8.53, so that Corollary 8.26 applies to give a ring isomorphism 

R op = End^(R) = End^(Z?i) x • • • x End^(B m ). 

By Proposition 8.24, there is an isomorphism of rings 

End/Afi/) = Mat„,(Endfl(L/)), 

AVedderburn proved the theorem for semisimple ^-algebras, where k is a field; Artin generalized the theorem 
as it is stated here. This theorem is why artinian rings are so called. 
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because B , is a direct sum of isomorphic copies of L,-. By Schur’s lemma. End r(L{) is a 
division ring, say. A,, and so 


R op = Mat„j (Ai) x • • • x Mat„ m (A,„). 


Hence, 

R = [MaMAOfP x ■ ■ ■ x [Mat„ m (A,„)] op . 

Finally, Proposition 8.13 gives 

R = Mat,,, (A° p ) x • • • x Mat„ m (A°f). 

This completes the proof, for A° p is also a division ring for all i, by Exercise 8.13 on 
page 532. • 


Corollary 8.57. A ring R is left semisimple if and only if it is right semisimple. 

Proof. It is easy to see that a ring R is right semisimple if and only if its opposite ring 
R op is left semisimple. But we saw, in the middle of the proof of Theorem 8.56, that 


R op = Mat ni (Aj) x • • • x Mat„ m (A m ), 


where A, = End^(L, ). • 

As a consequence of this corollary, we say that a ring is semisimple without the adjec¬ 
tives left or right. 

Corollary 8.58. A commutative ring R is semisimple if and only if it is isomorphic to a 
direct product of finitely many fields. 

Proof. A field is a semisimple ring, and so a direct product of finitely many fields is also 
semisimple, by Corollary 8.44(ii). Conversely, if R is semisimple, it is a direct product of 
matrix rings over division rings. Since R is commutative, all the matrix rings must be of 
size lxl and all the division rings must be fields. • 

Even though the name suggests it, it is not yet clear that a simple ring is semisimple. In¬ 
deed, this is false without assuming the DCC (see Lam, A First Course in Noncommutative 
Rings , page 43, for an example of a simple ring that is not semisimple). 

Proposition 8.59. A simple left artinian ring R is semisimple. 

Proof. (Rieffel) First, we show that if L is any nonzero left ideal in R and A = End/AT), 
then R = End^(T). Now L is a left A-module [with scalar multiplication A x T —> L 
given by (/, a) i-> f(a) for all / e A and a e L], 

Define <p: R —> EndA (T) by (p, being left multiplication by r: 


(p r (a) — ra 
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for all r e R and a e A. Note that <p r is a A-map: If / e A = End^(A), then 
<Pr(f(a )) = rf(a) = f(ra) = f<p r (a). 

It is easy to check that q> is a ring homomorphism; in particular, <p\ is the identity function 
on A. Since <p is not the zero map, ker <p f R. But R is a simple ring and ker <p is a 
two-sided ideal, so that ker <p — {0} and (p is an injection. 

Proving that (p is a surjection is more subtle. If b e A, define pb : L -> L to be right 
multiplication by b: 

Pb'. a ab. 

Now pb : L —> L is an A 1 -map: If r e R and a <= A, then 

Pb(ra) = ( ra)b = r(ab ) = rpb(a). 

Hence, pb e Ends(A) = A. If h e End a(A) and a, b e A, then 

h(Pb(a )) = Pbh(a). 

The left side is h(pb(a )) = h(ab ) = h((p a (b )), and the right side is Pbh(a) = h(a)b = 
< Ph(a)(b )• Therefore, 

hipa — (Ph(a) ^ (p(R). 
and so (p(L) is a left ideal in EndA(T). 

Now LR — {^ ( Vjrj : v/ e L and r, e R} is a two-sided ideal in R. and LR ^ 
{0} because R has a unit element. Simplicity of R gives LR = R. Therefore, <p(R) = 
<p(LR) = <p(L)(p(R) is a left ideal in EikIa(T) (because ip(L) is a left ideal). But <p(R) 
contains <p(\) — 1, and so the left ideal <p(R) contains 1. We conclude that cp(R) = 
EndA(T) and R = End A (A) . 

Since R is left artinian, we may assume that L is a minimal left ideal, that A = End r (L) 
is a division ring (by Schur’s lemma), and that L is a left vector space over A. If L is 
finite-dimensional, say, dim a (A) = n, then R = EndA(A) = Mat„(A op ), and we are 
done. If, on the other hand, A is infinite-dimensional, then there is an infinite independent 
set t>i, t> 2 ,... , v n , ... that is part of a basis. If 

Ij — {T e End A (A) : A(ui) = 0 = • • • = T( Vj )}, 

then it is easy to see that ft 2 /2 2 • ■ ■ is a strictly decreasing sequence of left ideals, 
contradicting R being left artinian. • 

The following corollary follows at once from Proposition 8.59 and the Wedderburn- 
Artin theorem. 

Corollary 8.60. If A is a simple left artinian ring, then A = Mat,, (A) for some n > 1 
and some division ring A. 

The next lemma, which gives some interesting properties enjoyed by left semisimple 
rings, will be used to complete the Wedderburn-Artin theorem by stating uniqueness of 
its constituent parts. In particular, it will say that the integer n and the division ring A in 
Corollary 8.60 are uniquely determined by A. 
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Lemma 8.61. Let R be a left semisimple ring, and let 

R = ©•••©/, H = ©•••© B m , 

where the L j are minimal left ideals and the Bj’s are the corresponding simple components 

of R. 

(i) Each B, is a ring that is also a two-sided ideal in R, and BjB j — {0} if j ^ i. 

(ii) If L is any minimal left ideal in R, not necessarily occurring in the given decompo¬ 
sition of R, then L = L, for some i and L C If. 

(iii) Every two-sided ideal D in R is a direct sum of Bi ’s. 

(iv) Each B , is a simple ring. 

Proof, (i) Each Bj is a left ideal. To see that it is also a right ideal, consider 


BjR — Bj(B[ © • • • © B m ) C BjB i + • • • + BjB m . 

Recall, for each i, that If is a direct sum of left ideals L isomorphic to Lj. If L = L, 
and L’ = Lj, then the contrapositive not (3) =>• not (1) in Lemma 8.53 applies to give 
LL' = {0} if j ^ i. Hence, if j ^ i. 


BjBj = (J2L)( = 

L=Lj L'=Lj 

Thus, BjB !+••• + Bj B m C Bj Bj. Since If is a left ideal. If If C R If C If . Therefore, 
Bj R C Bj, so that If is a right ideal and, hence, is a two-sided ideal. 

In the last step, proving that If is a right ideal, we saw that If Bj C If ; that is. If is 
closed under multiplication. Therefore, to prove that If is a ring, it now suffices to prove 
that it contains a unit element. If 1 is the unit element in R , then 1 = e\ + • • • + e m , where 
ej e Bj for all i. If h, e Bj , then 

bj = Ibj — (e\ H-h e m )bj = ejbj, 

for BjBj — {0} whenever j f i, by part (i). Similarly, the equation bj = bjl gives 
bjej — bj, and so e, is a unit in Bj. Thus, If is a ring. 5 

(ii) By Proposition 8.54, a minimal left ideal L is isomorphic to Lj for some i. Now 
L — RL — ( B\ ©•••© B m )L B\L © • • • © B m L. 

If j i, then BjL — {0}, by Lemma 8.53, so that 

L c BjL c Bj, 


because Bj is a right ideal. 

5 Bj is not a subring of R because its unit Cj is not the unit I in R. 
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(iii) A nonzero two-sided ideal D in R is a left ideal, and so it contains some minimal left 
ideal L, by Proposition 8.29(ii). Now L = L, for some i, by Proposition 8.54; we claim 
that Bj C D. By Lemma 8.53, if L' is any minimal left ideal in If, then L' — Lb' for some 
b' e L'. Since L c D and D is a right ideal, we have L' — Lb' C LL' C DR C D. We 
have shown that D contains every left ideal isomorphic to L,-; as /(, is generated by such 
ideals, B ,• C D. Write R — Bj © Bj, where Bi — Bj with If C D and Bj = ff, Bj 
with B j g D. By Corollary 7.18 (which holds for modules over noncommutative rings), 
D — Bj © (D fl Bj). But D fl Bj — {0}; otherwise, it would contain a minimal left ideal 
L = Lj for some j & J and, as above, this would force B j C D. Therefore, D = B\. 

(iv) A left ideal in Bj is also a left ideal in R: If a e R, then a — ^ • aj, where a j e Bj\ 
if bj e Bj , then 

abj — (aj H-h a m )bj — a,bj e Bj, 

because B j Bj = {0} for j g i. Similarly, a right ideal in Bj is a right ideal in R, and so a 
two-sided ideal D in Bj is a two-sided ideal in R. By part (iii), the only two-sided ideals 
in R are direct sums of simple components, and so D c Bj implies I) = {0| or I) = /(,. 
Therefore, Bj is a simple ring. • 

Corollary 8.62. If R is a semisimple ring, then the simple component containing a mini¬ 
mal left ideal Lj is the left ideal generated by all the minimal left ideals that are isomorphic 
to Lj. Therefore, the simple components of a semisimple ring do not depend on a decom¬ 
position of R as a direct sum of minimal left ideals. 

Proof. This follows from Lemma 8.61 (ii). • 

Corollary 8.63. 

(i) If A is a simple artinian ring, then A = Mat,, (A) for some division ring A. If L 
is a minimal left ideal in A, then every simple left A-module is isomorphic to L\ 
moreover, A op = End,r(L). 

(ii) Two left A-modules M and N are isomorphic if and only if dim.\{M) — dimA((V). 
In particular, if A = Mat,,, (A), then M = N if and only if (l\m a (M) — diniA((V). 

Proof. Since A is a semisimple ring, every left module M is isomorphic to a direct sum 
of minimal left ideals. But, by Lemma 8.61(ii), all minimal left ideals are isomorphic, say, 
to L, and so dim a (A/) is the number of summands in a decomposition. If A7 A N as 
left Mat„(A)-modules, then M = N as left A-modules, and so dimA(M) = dimA((V). 
Conversely, if dimA(M) — d — dimA((V), then both M and N are direct sums of d copies 
of L, and hence M = N as left A-modules. 

We may now assume that A = Mat,, (A) and that L = Col(l), the minimal left ideal 
consisting of all the n x n matrices whose last n — 1 columns are 0 (see Proposition 8.49). 
Define (p: A -* End,i (C) as follows: if d e A and i e L, then <pd'- (- *-»■ Id. Note that 
<Pd is an A-map: it is additive and, if a e A and l e L, then (pd(at) — (al)d — a(Id) = 
aqtd(I). Next, <p is a ring antihomomorphism: <p i = 1/,, it is additive, and <pdd' — Vd'Vd'- 
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if i e L. then (pd'tpdif ) = <Pd(td') — Id'd — tpdd’(£)\ that is, (p is a ring homomorphism 
A°P -» End/i(L). To see that <p is injective, note that each t e L c Mat,, ( A) is a matrix 
with entries in A; hence, id — 0 implies l = 0. Finally, we show that <p is surjective. 
Let / e End^(L). Now L = AE n, where E\\ is the matrix unit (every simple module 
is generated by any nonzero element in it). If m, e A, let [mi,..., u n ] denote the n x n 
matrix in L whose first column is (mi, ..., u,,) 1 and whose other entries are all 0. Write 
/(En) = [d\_, ..., d n \. If l e L, then l has the form [mi, ..., m„], and using only the 
definition of matrix multiplication, it is easy to see that [mi, ..., u n ] = [mi, ..., u„]En- 
Since / is an A- map, 

/([mi, ..., M„]) = /([mi-- m„]Eh) 

= [mi, , u n ]f{E n) 

= [mi,- u n ][d\ -- d n ] 

- [Ml, • • ■ , = <Pdi ([mi, . • ■ , M„ ]). 

Therefore, f — cpd l e im <p, as desired. • 

The number m of simple components of R is an invariant, for it is the number of non¬ 
isomorphic simple left E-modules. However, there is a much stronger uniqueness result. 

Theorem 8.64 (Wedderburn-Artin II). Every semisimple ring R is a direct product, 


R = Mat„! (Ai) x • • • x Mat„ m (A m ), 

where n, > 1 and A; is a division ring, and the numbers m and n,-, as well as the division 
rings Ai, are uniquely determined by R. 

Proof. Let R be a left semisimple ring, and let R = B\ © • • • © B m be a decomposition 
into simple components arising from some decomposition of R as a direct sum of minimal 
left ideals. Suppose that R = B[ x • • • x B' t , where each If is a two-sided ideal that is also 
a simple ring. By Lemma 8.61, each two-sided ideal //' is a direct sum of Bf s. But //' 
cannot have more than one summand Bj, lest the simple ring // contain a proper nonzero 
two-sided ideal. Therefore, t — m and, after reindexing, B'- — B, for all i. 

Dropping subscripts, it remains to prove that if B = Mat,,(A) = Mat,/(A') = B', then 
n = n and A = A'. In Proposition 8.49, we proved that Col(f), consisting of the matrices 
with jth columns 0 for all j / l, is a minimal left ideal in B, so that Col(£) is a simple 
B -module. Therefore, 

{0} c Col(l) c Col(l) © Col(2) c • • • c Col(l) © • • • 0 Co1(m) = B 

is a composition series of B as a module over itself. By the Jordan-Holder theorem (Theo¬ 
rem 8.18), n and the factor modules Cold::) are invariants of B. Now Cold) A Col(l) for 
all i, by Corollary 8.63, and so it suffices to prove that A can be recaptured from Col(l). 
But this has been done in Corollary 8.63(i): A = End£(Col(l)) op . • 

The description of the group algebra kG simplifies when the field k is algebraically 
closed. 
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Corollary 8.65 (Molien). If G is a finite group and A is an algebraically closed field 
whose characteristic does not divide |G|, then 

kG = Mat,,, (A) x • • • x Mat„ m (k). 

Proof. By Maschke’s theorem, kG is a semisimple ring, and its simple components are 
isomorphic to matrix rings of the form Mat„(A), where A arises as End^e (L) op for some 
minimal left ideal L in kG. Therefore, it suffices to show that Endjc(L) op = A — k. 

Now EndtG(L) op C Eiufi-(L) op , which is finite-dimensional over k because L is; hence, 
A = EndA-G(L) op is finite-dimensional over k. Each / e End( L) is a AC/’-map, hence is 
a A-map; that is, f(au) — af(u) for all a e A and u e L. Therefore, the map tp a : L —> L, 
given by u i-> an, commutes with /; that is, A (identified with all tp a ) is contained in 
Z( A), the center of A. If A e A, then S commutes with every element in A, and so A(<5), the 
subdivision ring generated by A and S. is a (commutative) field. As A is finite-dimensional 
over A, so is A(S); that is, k(S) is a finite extension of the field A, and so <5 is algebraic over 
A, by Proposition 3.117. But A is algebraically closed, so that S e A and A — k. • 
Example 8.66. 

There are nonisomorphic finite groups G and H having isomorphic complex group alge¬ 
bras. If G is an abelian group of order n. then CG is a direct product of matrix rings over 
C, because C is algebraically closed. But G abelian implies CG commutative. Hence, CG 
is the direct product of n copies of C. It follows that if H is any abelian group of order n, 
then CG A CH. In particular, I 4 and It © I 2 are nonisomorphic groups having isomorphic 
complex group algebras. It follows from this example that certain properties of a group G 
get lost in the group algebra CG. ◄ 

Corollary 8.67. If G is a finite group and A is an algebraically closed field whose charac¬ 
teristic does not divide |G|, then |G| = n^ + n^-f- • • + «„, where the ith simple component 
Bj ofkG consists of n, x n, matrices. Moreover, we may assume thatn\ = l . 6 
Remark. Theorem 8.149 says that all the n,- are divisors of |G|. ◄ 

Proof. As vector spaces over A, both kG and Mat,,, (A) x • • • x Mat„ m (A) have the same 
dimension, for they are isomorphic, by Corollary 8.65. But dim(AG) = |G|, and the 
dimension of the right side is JT dim(Mat„ ( . (A)) = nj. 

Finally, Example 8.55 shows that there is a unique minimal left ideal isomorphic to the 
trivial module Vo(A); the corresponding simple component, say. If, is one-dimensional, 
and so n 1 = 1 . • 

The number m of simple components in CG has a group-theoretic interpretation; we 
begin by finding the center of the group algebra. 

Definition. Let C\,... ,C r be the conjugacy classes in a finite group G. For each Cj, 
define the class sum to be the element z / e CG given by 

Z J =J2s- 

_ geCj 

6 By Example 8.55, the group algebra kG always has a unique minimal left ideal isomorphic to Vq(A:), even 
when k is not algebraically closed. 
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Here is a ring-theoretic interpretation of the number c of conjugacy classes. 

Lemma 8.68. If r is the number of conjugacy classes in a finite group G, then 

r — dimc(Z(CG)), 

where Z(CG)) is the center of the group algebra. In fact, a basis of Z(CG) consists of all 
the class sums. 

Proof. If Zj — J2 g eCj 8 ' s a c ^ ass sum, then we claim that Zj e Z(CG). If h e G, then 

hzjh~ l — Zj, because conjugation by any element of G merely permutes the elements 
in a conjugacy class. Note that if j f I, then z.j and z.t have no nonzero components in 
common, and so zi, . ■ ■, Z r is a linearly independent list. It remains to prove that the z. j 
span the center. 

Let u — JfgfzQ a g g e Z(CG). If h e G, then huh~ l = u, and so a hf , h -1 = a g for all 
g e G. Thus, if g and g' lie in the same conjugacy class of G, then their coefficients in u 
are the same. But this says that u is a linear combination of the class sums Zj . • 

Theorem 8.69. If G is a finite group, then the number m of simple components in CG is 
equal to the number r of conjugacy classes in G. 

Proof. We have just seen, in Lemma 8.68, that r — dimc(Z(CG)). On the other hand, 
Z(Mat„ ; (C)), the center of a matrix ring, is the subspace of all scalar matrices, so that 
m — dimc(Z(CG)), by Exercise 8.12(iii) on page 532. • 

We began this section by seeing that A:-representations of a group G correspond to kG- 
modules. Let us now return to representations. 

Definition. A A:-representation of a group G is irreducible if the corresponding kG- 
module is simple. 

For example, a one-dimensional (necessarily irreducible) A:-representation is a group 
homomorphism X: G —»• k x , where k x is the multiplicative group of nonzero elements of 
k. The trivial AG-module \fiik) corresponds to the representation /.„ = 1 for all g e G. 
The next result is basic to the construction of the character table of a finite group. 

Theorem 8.70. If G is a finite group, then the number of its irreducible complex repre¬ 
sentations is equal to the number r of its conjugacy classes. 

Proof. By Proposition 8.54, every simple CG-module is isomorphic to a minimal left 
ideal. Since the number of minimal left ideals is m [the number of simple components of 
CG], we see that m is the number of irreducible C-representations of G. But Theorem 8.69 
equates m with the number r of conjugacy classes in G. • 
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Example 8.71. 

(i) If G = S 3 , then CG is six-dimensional. There are three simple components, for S 3 has 
three conjugacy classes (by Theorem 2.9, the number of conjugacy classes in S„ is equal to 
the number of different cycle structures), having dimensions 1,1, and 4, respectively. (We 
could have seen this without Theorem 8.69, for this is the only way to write 6 as a sum of 
squares aside from a sum of six 1 ’s.) Therefore, 

CS 3 = C x C x Mat 2 (C). 

One of the one-dimensional irreducible representations is the trivial one; the other is 
sgn (signum). 

(ii) We now analyze kG for G — Q, the quaternion group of order 8 . If k = C, then 
Corollary 8.65 gives 

CQ = Mat,,, (C) x • • • x Mat„ r (C), 

while Corollary 8.67 gives 

IQI = 8 = n\ + n\ + ■ ■ • + n J?, 

where 11 \ = 1. It follows that either all n, = 1 or four n,- = 1 and one n,- = 2. The first 
case cannot occur, for it would imply that CQ is a commutative ring, whereas the group Q 
of quaternions is not abelian. Therefore, 

CQ = CxCxCxCx Mat 2 (C). 

We could also have used Theorem 8.69, for Q has exactly five conjugacy classes, namely, 

{i},{T},{i,7},{./J},{*,*}. 

The group algebra IRQ is more complicated because R is not algebraically closed. Exer¬ 
cise 8.20 on page 533 shows that H is a quotient of RQ, hence H is isomorphic to a direct 
summand of RQ because RQ is semisimple. It turns out that 

RQ = RxRxRxRxI. < 

Here is an amusing application of the Wedderburn-Artin theorems. 

Proposition 8.72. Let R be a ring whose group of units U — U (R ) is finite and of odd 
order. Then U is abelian and there are positive integers nij with 

t 

\u\ =n<2-' -1). 

! = 1 

Proof First, we note that 1 = — 1 in R, otherwise —1 is a unit of even order. Con¬ 
sider the group algebra kU , where k — F 2 . Since k has characteristic 2 and \U\ is odd, 
Maschke’s theorem says that kU is semisimple. There is a ring map <41 ; kU -> R carrying 
every /.'-linear combination of elements of U to “itself.” Now R' = mup is a finite sub¬ 
ring of R containing U (for kU is finite); since dropping to a subring cannot create any new 
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units, we have U = U (/?')• By Corollary 8.43(iii), the ring R' is semisimple, so that the 
Wedderburn-Artin Theorem I gives 

t 

r ' =n Ma MA,), 

i=i 


where each A, is a division ring. 

Now A, is finite, because R' is finite, and so A, is a finite division ring. By the “other” 
theorem of Wedderburn, Theorem 8.23, each A, is a field. But —1 = 1 in R implies 
— 1 = 1 in A, , and so each field A, has characteristic 2; hence, 

I A; I = 2 m ‘ 


for integers m,- > 1. All the matrix rings must be 1 x 1, for any matrix ring of larger size 
must contain an element of order 2, namely, I + K, where K has entry 1 in the first position 
in the bottom row, and all other entries 0. For example. 


'1 

o' 

2 

'1 

O' 

1 

1 


2 

1 


Therefore, R' is a direct product of finite fields of characteristic 2, and so U — U ( R’) is an 
abelian group whose order is described in the statement. • 

It follows, for example, that there is no ring having exactly five units. 

The Jcicobson-Chevcilley density theorem , an important generalization of Wedderburn’s 
theorem for certain nonartinian rings, was proved in the 1930s. Call a ring R left primitive 
if there exists a faithful simple left /?-module S ; that is, S is simple and, if r e R and 
rS = {0}, then r = 0. It can be proved that commutative primitive rings are fields, while 
left artinian left primitive rings are simple. Assume now that R is a left primitive ring, that 
S is a faithful simple left /^-module, and that A denotes the division ring Ends (S). The 
density theorem says that if R is left artinian, then R A Mat,,(A), while if R is not left 
artinian, then for every integer n > 0, there exists a subring R n of R with R n A Mat,,(A). 
We refer the reader to Lam, A First Course in Noncommutative Rings , pages 191-193. 

The Wedderburn-Artin theorems led to several areas of research, two of which are de¬ 
scriptions of division rings and of finite-dimensional algebras. Division rings will be con¬ 
sidered in the context of central simple algebras in Chapter 9 and crossed product algebras 
in Chapter 10. Let us discuss finite dimensional algebras now. 

Thanks to the theorems of Maschke and Molien, the Wedderburn-Artin theorems ap¬ 
ply to ordinary representations of a finite group G; that is, to kG-modules, where k is a 
field whose characteristic does not divide |G|. We know kG is semisimple in this case. 
However, modular representations, that is, kG-modules for which the characteristic of k 
does divide |G|, arise naturally. For example, if G is a finite p-group, for some prime 
p, then a minimal normal subgroup A is a vector space over ¥ p . Now G acts on N (by 
conjugation), and so N is an F^G-module. Modular representations are used extensively 
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in the classification of the finite simple groups. In his study of modular representations, 
R. Brauer observed that the important modules M are indecomposable rather than irre¬ 
ducible. Recall that a module M is indecomposable if there are no nonzero modules A 
and B with M = A ® B (in the ordinary case, a module is indecomposable if and only 
if it is irreducible [i.e., simple], but this is no longer true in the modular case). When 
kG is semisimple. Proposition 8.54 says that there are only finitely many indecomposable 
modules (corresponding to the minimal left ideals). This is not true in the modular case, 
however. For example, if k is an algebraically closed field of characteristic 2, k\ and kA^ 
have infinitely many nonisomorphic indecomposable modules. 

A finite-dimensional ^-algebra R over a field k is said to have finite representation 
type if there are only finitely many nonisomorphic finite-dimensional indecomposable R- 
modules. D. G. Higman proved that if G is a finite group, then kG has finite representation 
type for every field k if and only if all its Sylow subgroups G are cyclic. In the 1950s, the 
following two problems, known as the Brauer-Thrall conjectures , were posed. Let R be 
a ring not of finite representation type. 

(I) . Are the dimensions of the indecomposable /^-modules unbounded? 

(II) . Is there a strictly increasing sequence n\, ni, ... with infinitely many nonisomorphic 
indecomposable R -modules of dimension n, for every il 

The positive solution of the first conjecture, by A. V. Roiter in 1968, had a great im¬ 
pact. Shortly thereafter, P. Gabriel introduced graph-theoretic methods, associating finite¬ 
dimensional algebras to certain oriented graphs, called quivers. He proved that a connected 
quiver has a finite number of nonisomorphic finite-dimensional representations if and only 
if the quiver is one of the Dynkin diagrams A n , D n , EV„ E-j, or E& ( Dynkin diagrams are 
multigraphs that describe simple complex Lie algebras; see the discussion on page 778). 
Gabriel’s result can be rephrased in terms of hereditary ^-algebras A (one-sided ideals are 
projective ,4-modules). V. Dlab and C. Ringel extended Gabriel’s result to all Dynkin dia¬ 
grams (of any type A through G). They proved that a finite-dimensional hereditary algebra 
is of finite representation type if and only if its graph is a finite union of Dynkin dia¬ 
grams. Moreover, using Coxeter functors (which were introduced by I. N. Bernstein, I. M. 
Gelfand, and V. A. Ponomarev to give a new proof of Gabriel’s result), they extended the 
classification to hereditary algebras of tame representation type in terms of the so-called 
extended Dynkin diagrams (algebras of infinite representation type are divided into those 
of tame type and those of wild type). A confirmation of the second Brauer-Thrall con¬ 
jecture for all hereditary algebras followed. A positive solution of Brauer-Thrall II for all 
(not necessarily hereditary) finite-dimensional algebras over an algebraically closed field 
follows from the multiplicative basis theorem of R. Bautista, P. Gabriel, A. V. Roiter, and 

L. Salmeron: Every finite-dimensional ^-algebra A of finite representation type has a mul¬ 
tiplicative basis B: a vector space basis of A such that the product of two basis vectors 
lies in B U {0}. In fact, they proved that there exist multiplicative bases that contain a 
complete set of primitive orthogonal idempotents and a basis of each power of the radical. 

M. Auslander and I. Reiten created a theory involving almost split sequences (defined in 
Chapter 10) and Auslander-Reiten quivers. This theory, which generalizes the concept of 
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Coxeter functors, provides a construction of new indecomposable representations of (ar¬ 
bitrary) finite-dimensional algebras. As of this writing, Auslander-Reiten theory is the 
most powerful tool in the study of representations of finite-dimensional algebras. For a 
discussion of these ideas, we refer the reader to Artin-Nesbitt-Thrall, Rings with Mini¬ 
mum Condition, Dlab-Ringel, Indecomposable Representations of Graphs and Algebras, 
Memoir AMS #173, 1976, Jacobson, The Theory of Rings, Jacobson, Structure of Rings, 
and Drozd-Kirichenko, Finite Dimensional Algebras. 

Exercises 

8.36 Let A be an n-dimensional L-algebra over a field k. Prove that A can be imbedded as a 
f'-subalgebra of Mat,, (k). 

Hint. If a € A, define L a : A —>■ A by L a : x ax. 

8.37 Let G be a finite group, and let k be a commutative ring. Define e: kG — *■ k by 

«(!]«£<?) = J2 a g 

geG geG 

(this map is called the augmentation, and its kernel, denoted by Q, is called the augmentation 
ideal). 

(i) Prove that e is a LG-map and that kG/Q = k as f -algebras. Conclude that Q is a 
two-sided ideal in kG. 

(ii) Prove that kG/Q = V()(k), where Vq( k) is k viewed as a trivial fcG-module. 

Hint. Q is a two-sided ideal containing xu — u = (x — 1 )u e Q. 

(iii) Use part (ii) to prove that if kG = Q © V, then V = (v), where v = a ffg^G 8 - 
Hint. Argue as in Example 8.55. 

(iv) Assume that k is a field whose characteristic p does divide | G|. Prove that kG is not left 
semisimple. 

Hint. First show that e(u) = 0, and then show that the short exact sequence 

Q^Q-^kG-Uk^O 

does not split. 

8.38 If A is a division ring, prove that every two minimal left ideals in Mat„(A) are isomorphic. 
(Compare Corollary 8.50.) 

8.39 An element a in a ring R is called a zero divisor if a 0 and there exists a nonzero b e R 
with ab = 0 (more precisely, we call a a left zero divisor and b a right zero divisor). Prove 
that a left artinian ring R having no zero divisors must be a division ring. 

8.40 Let T : V V be a linear transformation, where V is a vector space over a field k. and let 
k[T] be defined by 

k[T] = jfc[*]/(m(jc)), 

where m(x) is the minimum polynomial of T . 

(i) If m{x) = ]”[/, p{x) e P, where the p{x) e k[x] are distinct irreducible polynomials and 
e p > 1. prove that k[T] = \\ p k[x]/(p(x) ep ). 
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(ii) Prove that fc[T] is a semisimple ring if and only if m(x) is a product of distinct linear 
factors. (In linear algebra, we show that this last condition is equivalent to T being 
diagonalizable ; that is, any matrix of T [arising front some choice of basis of T] is 
similar to a diagonal matrix.) 

8.41 Find C G if G = Dg, the dihedral group of order 8 . 

8.42 Find CG if G = A 4 . 

Hint. Aq has four conjugacy classes. 

8.43 (i) Let k be a field, and view sgn: S n — * (±1) < k. Define Sig(Ar) to be k made into a 

kS, ,-module (as in Proposition 8.37): If y e S n and a e k, then ya = sgn(}/)a. Prove 
that Sis(k) is an irreducible kS n -module, and if k does not have characteristic 2, then 

Sig(fc) £ V 0 (k). 

(ii) FindCSs- 

Hint. There are five conjugacy classes in S 5 . 

8.44 Let G be a finite group, and let k and K be algebraically closed fields whose characteristics p 
and q, respectively, do not divide |G|. 

(i) Prove that kG and K G have the same number of simple components. 

(ii) Prove that the degrees of the irreducible representations of G over k are the same as the 
degrees of the irreducible representations of G over K. 


8.4 Tensor Products 


We now introduce a new notion, 7 tensor products, that is used to construct induced rep¬ 
resentations (which extend representations of subgroups to representations of the whole 
group). Tensor products are also useful in other areas of algebra as well; for example, 
they are involved in bilinear forms, the adjoint isomorphism, free algebras, exterior alge¬ 
bra, and determinants. The reader who wishes to see the impact of the Wedderburn-Artin 
and Maschke theorems on groups without this interruption can proceed directly to the next 
section, for the first application we shall give—Burnside’s theorem—does not use induced 
representations in its proof. On the other hand, we shall also prove a theorem of Frobenius 
that does use induced representations. 

If k is a field and H be a subgroup of a group G, then a k-representation of H is the 
same thing as a k//-module, and a k-representation of G is the same thing as a /cG-module. 
If we could force a kH -module M to be a /rG-module, then we would be able to create a 
representation of the big group G from a representation of a subgroup. More generally, if 
A is a subring of a ring R , we may want to force an A-module M to be an /{-module. If M 
is generated as an A-module by a set X , then each m e M has an expression of the form 
m — UjXi, where a; e A and x, e X. Perhaps we could create an /^-module containing 
M by taking all expressions of the form r, x/ for r, e R. This naive approach is doomed 

7 Tensor products of A'-modules, where R is commutative, could have been presented in Chapter 7. How¬ 
ever, I believe that the best exposition delays the introduction of noncommutative rings to the present chapter. 
Consequently, putting tensor products earlier would have forced me to construct them in two stages: first over 
commutative rings in Chapter 7, then over general rings now. This is not a good idea. 
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to failure. For example, a cyclic group G — (g } of finite order n is a Z-module; can we 
make it into a Q-module? A Q-module V is a vector space over Q, and it is easy to see 
that if v e V and q e Q, then qv — 0 if and only if q = 0 or v = 0. If we could create a 
rational vector space V containing G in the naive way described in the previous paragraph, 
then ng — 0 would imply g = 0 in VI Our goal of adjoining scalars to obtain a module 
over a larger ring still has merit but, plainly, we cannot be so cavalier about its construction. 
The proper way to deal with such matters is with tensor products. 

One of the most compelling reasons to introduce tensor products comes from algebraic 
topology, where we assign to every topological space X a sequence of homology groups 
H„(X) for all n > 0 that are of basic importance. The Kunneth formula computes the 
homology groups of the cartesian product X x Y of two topological spaces in terms of the 
tensor product of the homology groups of the factors X and Y. 

Definition. Let R be a ring, let A r he a right R -module, let rB be a left R -module, and 
let G be an (additive) abelian group. A function /: A x B -* G is called R-biadditive if, 
for all a , a' e A, b , b' e B , and r e R, we have 

f{a + a', b) — f(a, b) + f{a', b)\ 
f(a,b + b') = f (a, b) + f (a, b')\ 
f (ar , b) = f(a,rb). 

An R -biadditive function is also called a pairing. 

If R is commutative and A, B. and M are /^-modules, then a function /: A x B -» M 
is called R-bilinear if / is R -biadditive and also 

f(ar, b) = /(a, rb) — rf(a, b). 


Example 8.73. 

(i) If R is a ring, then its multiplication p. : R x R —>• R is A 1 -hiadditive; the first two 
axioms are the right and left distributive laws, while the third axiom is associativity: 

p{ar , b) — ( ar)b = a(rb) — p(a , rb). 

If R is a commutative ring, then p is R -bilinear, for (ar)b — a(rb) — r(ab). 

(ii) If rM is a left R -module, then its scalar multiplication o : R x M -> M is R -biadditive; 
if R is a commutative ring, then a is R -bilinear. 

(iii) If Mr and Nr are right /^-modules, then Hom/A/V/. N) is a left R -module if, for 
/ e Hom^(M, N) and r e R. we define rf: M —> N by 

rf: m i —> f(mr). 

The reader may show that this does make Horn into a left R -module; moreover, we can 
now see that evaluation e: M x Homfi(M, N) -> N , given by (m, f) f(m), is R- 
biadditive. 
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The dual space V* of a vector space V over a field k gives a special case of this con¬ 
struction: Evaluation V x V* -> k is R -bilinear. 

(iv) If G* — Horn z(G, Q/Z) is the Pontrjagin dual of an abelian group G. then evaluation 
GxG'-> Q/Z is Z-bilinear. ◄ 

Tensor products convert biadditive functions into linear ones. 

Definition. Given a ring R and modules Ar and r 11 , then their tensor product is an 
abelian group A <S)r B and an A 1 -hiadditive function 

h: A x B A B 

such that, for every abelian group G and every R -biadditive /: A x B -> G. there exists 
a unique Z-homomorphism f: A (&r B -> G making the following diagram commute. 

A x B --->- A ®r B 

A f 

G 

If a tensor product of A and B exists, then it is unique to isomorphism, for it has been 
defined as a solution to a universal mapping problem (see the proof of Proposition 7.27 on 
page 448). 

Quite often, we denote A <E)r B by A (g) B when R — Z. 

Proposition 8.74. If R is a ring and Ar and rB are modules, then their tensor product 
exists. 

Proof. Let F be the free abelian group with basis A x B \ that is, F is free on all ordered 
pairs (a, b), where a e A and b e B. Define S to be the subgroup of F generated by all 
elements of the following types: 

(i a , b + b') — (a, b) — (a, b')\ 

(i a + a', b) — ( a , b) — (a', b)\ 

(ar. b) — (a, rb). 

Define A (&r B = F/S. denote the coset (a, b) + S by a ® b. and define 

h: A x B —»• A <&r B by h: (a, b) i-> a ® b 

(thus, h is the restriction of the natural map F —> F/S). We have the following identities 
in A <g> r B : 

a ® (b + b r ) = a ® b + a ® b '; 

(a + a') ® b = a <g> b + a' <g> b\ 
ar <g> b — a ® rb. 
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It is now obvious that h is A 1 -hiadditive. 

Consider the following diagram, where G is an abelian group and / is R -biadditive: 
Ax B ---A® r B 



where i: Ax B -* F is the inclusion. Since F is free abelian with basis Ax B. there exists 
a homomorphism rp: F -> G with <p(a,b) = f (a, b) for all (a, b)\ now S C ker ip because 
/ is R -biadditive, and so rp induces a map /: A ®r B —*■ G (because A®r B = F/S) by 

f(a ® b ) = /((a, b ) + S) = <p{a, b ) = /(a, b). 

This equation may be rewritten as fh = /; that is, the diagram commutes. Finally, / is 
unique because A (&r B is generated by the set of all a < 8 > b’ s. • 

Remark. Since A B is generated by the elements of the form a (gib, every u e A <S)r B 
has the form 

a — cij ® bj. 
i 

This expression for u is not unique; for example, there are expressions 

0 = a ® (b + b' ) — a ® b — a ® b' 

= (a + a 1 ) ® b — a ® b — a 1 <g> b 
— ar ® b — a <g> rb. 

Therefore, given some abelian group G, we must be suspicious of a definition of a map 
g. A ®r B -» G that is given by specifying g on the generators a <S) b\ such a “function” 
g may not be well-defined because elements have many expressions in terms of these 
generators. In essence, g is only defined on F (the free abelian group with basis A x B ), 
and we must still show that g(S) = {0}, because A®rB = F/S. The simplest (and safest!) 
procedure is to define an R -biadditive function on A x B, and it will yield a (well-defined) 
homomorphism. We illustrate this procedure in the next proof. ■* 

Proposition 8.75. Let f : Ar —> A p and g: rB -> rB' be maps of right R-modules 
and left R-modules, respectively. Then there is a unique Z-homomorphism, denoted by 
f <g> g: A® r B —> A' ®r B', with 


f ® g: a ®b i-> f(a ) <g> g(b). 
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Proof. The function tp : A x B —*■ A 1 <g> r B', given by (a, b) i-> f (a) ® gib), is easily 
seen to be an R -biadditive function. For example, 

cp: (ar,b) i-> f(ar) <g> gib) = f (a)r <g> g(b) 


and 


<p: (a,rb) !->■ f{a)®girb) = f(a)®rg(b); 


these are equal because of the identity a'r <g> b' — a’ ® rb ’ in A ’ ®r B'. The biadditive 
function cp yields a unique homomorphism A <S)r B —> ,4' <S)r B' taking 


a <g> b fia) <g> gib). • 


/ f 

Corollary 8.76. Given maps of right R-modules, A -> A —> A", and maps of left 
R-modules, B -%■ B' B", 

if ®g’)if®g) = f'f®g'g. 

Proof. Both maps take a b i-> f fia) < 8 > g'gib), and so the uniqueness of such a 
homomorphism gives the desired equation. • 


Theorem 8.77. Given Ar, there is an additive functor Fa : n Mod -> Ab, defined by 
F A {B) = A® r B and F A ig) = 1 A <8> g, 
where g : B —> B 1 is a map of left R-modules. 

Proof. First, note that F A preserves identities: /^(lg) = 1 a ® 1b is the identity 1 a®s- 
because it fixes every generator a <g> b. Second, F A preserves composition: 

F A ig’g ) — U <S> g'g = (1a <S> g')(lA < 8 >g) = F A (g')F A (g), 

by Corollary 8.76. Therefore, F A is a functor. 

To see that F A is additive, we must show that F A ig + h) — F A (g) + F A ih), where 
g, h: B —> B'-, that is, 1 A ® ig + h) = 1 A ® g + 1 A ®> h. This is also easy, for both these 
maps send a <8> b a ® gib) + a ® h(b). • 

We denote the functor F A by A<S)r . Of course, there is a similar result if we fix a left 
/^-module B: There is an additive functor ®rB : Mod/; —*■ Ab. 

Corollary 8.78. Iff'. M —> M' and g: N —> N' are, respectively, isomorphisms of right 
and left R-modules, then f ® g: M ®b N M' ®R N' is an isomorphism of abelian 
groups. 
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Proof. Now / <g> 1 jv' is the value of the functor Frr on the isomorphism /, and hence 
/ ® 1^' is an isomorphism; similarly, 1 m <S> g is an isomorphism. By Corollary 8.76, we 
have f<S>g = (f®lN')(lM < 8>g)- Therefore, f<S>g is an isomorphism, being the composite 
of isomorphisms. • 

Before continuing with properties of tensor products, we pause to discuss a technical 
point. In general, the tensor product of two modules is only an abelian group; is it ever a 
module? If so, do the tensor product functors then take values in a module category, not 
merely in Ab? That is, is 1 ® f always a map of modules? 

Definition. Let R and S be rings and let M be an abelian group. Then M is an ( R. S)- 
bimodule , denoted by rM$. if M is a left R -module and a right 5-module, and the two 
scalar multiplications are related by an associative law: 

r(ms ) = ( rm)s 

for all r e R. m e M , and s e S. 

If M is an (R, 5)-bimodule, it is permissible to write rms with no parentheses, for the 
definition of bimodule says that the two possible associations agree. 

Example 8.79. 

(i) Every ring R is an (R. R )-bimodule; the extra identity is just the associativity of multi¬ 
plication in R. 

(ii) Every two-sided ideal in a ring R is an ( R. R )-bimodule. 

(iii) If M is a left R -module (i.e., if M — rM). then M is an (R. Z)-bimodule; that is, 
M = rMz- Similarly, a right /^-module A is a bimodule zNr. 

(iv) If R is commutative, then every left (or right) /^-module is an (R. /?)-bimodule. In 
more detail, if M — rM , define a new scalar multiplication M x R -> M by (m, r) 
rm. To see that M is a right /^-module, we must show that m{rr') — (mr)r', that is, 
(rr')m — r'(rm), and this is so because rr' — r'r. Finally, M is an (R, /?)-bimodule 
because both r(mr') and (rm)r' are equal to (rr')m. 

(v) In Example 8.6, we made any left A'G-module M into a right A6/’-module by defining 
mg — g~ 1 m for every m e M and every g in the group G. Even though M is both 
a left and right AG-module, it is usually not a (AG, AG)-bimodule because the required 
associativity formula may not hold. In more detail, let g, h e G and let m e M. Now 
g(mh ) = g(h~ l m) = ( gh~ l )m ; on the other hand, ( gm)h = h~ l {gm) = (h~ l g)m. To 
see that these can be different, take M = kG, m = 1, and g and h noncommuting elements 
of G. ◄ 

The next lemma solves the problem of extending scalars. 

Lemma 8.80. Given a bimodule $Ar and a left module rB, then the tensor product 
A (&r B is a left S-module, where 


s(a <S> b) — (so) ® b. 
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Similarly, given Ar and rBr, the tensor product A ®r B is a right S-module, where 
(i a <S> b)s — a ® ( bs ). 

In particular, if k is a commutative ring and A is a k-algehra, then A B is a left 
A-module. 

Proof For fixed s e S , the multiplication p s : A -> A, defined by a i— sa, is an A 1 -map, 
for A being a bimodule gives 

p, s {ar) = s(ar) = (sa)r — gb s (a)r. 

If F = ®rB : Mod^ —> Ab, then F(p, s ): A (&r B —»■ A ®r B is a (well-defined) 
Z-homomorphism. Thus, F(p s ) = p, s ® 1 r : a®b m* (sa) <g> b , and so the formula in the 
statement of the lemma makes sense. It is now straightforward to check that the module 
axioms do hold for A ®r B. 

The last statement follows because a k-algebra A is an (A, k)-bimodule. • 

For example, if V and W are vector spaces over a field k, then their tensor product 
V <8>jt W is also a vector space over k. 

After a while, we see that proving properties of tensor products is just a matter of show¬ 
ing that the obvious maps are, indeed, well-defined functions. 

We have made some progress in our original problem: Given a left k-module M, where 
k is a subring of a ring K, we can create a left K -module from M by extending scalars; 
that is. Lemma 8.80 shows that K <E)k M is a left K -module, for K is a (K, k)-bimodule. 
However, we must still investigate, among other things, why a left k-module M may not 
be imbedded in K <S)k M, where k is a subring of a ring K. 

The following special case of extending scalars is important for representations. If H 
is a subgroup of a group G and if p: H -> GL( V ) is a ^-representation, then p: H -> 
GL( V) equips V with a left ^//-module structure. We call V G — kG ®kH V the induced 
module. Note that kG is a right ^//-module (it is even a right kG-module), and so the 
tensor product V G = kG ®k.H V makes sense; moreover, V G is a left /t'G-module, by 
Lemma 8.80. We will investigate this construction more carefully later in this chapter (see 
induced modules on page 624). 

Corollary 8.81. 

(i) Given a bimodule sAr, then the functor Fa — A®r : ^Mod -* Ab actually takes 
values in s Mod. 

(ii) If R is a commutative ring, then A(&r B is an R-module, where 

r(a ® b) — (ra) <g> b — a <S> rb 
for all r e R, a e A, and b e B. 

(iii) If R is a commutative ring, r e R, and /x, : B —> B is multiplication by r, then 

\ A ® p r \ A B A ®r B 


is also multiplication by r. 
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Proof, (i) By the lemma, we know that A <S)r B is a left 5-module, where s(a 0> b) = 
(, sa ) <g> b, and so it suffices to show that if g : B -* B' is a map of left /^-modules, then 
Fa (g) — 1a ® g is an 5-map. But 

(1a ® g)LAa 0 b)] - (1a 0 g)[(sa) <g> b] 

— (sa) 0 gb 

— s(a0gb) by Lemma 8.80 

= s(1a 0 g)(a 0 b). 


(ii) Since R is commutative, we may regard A as an (R. f?)-bimodule by defining ar = ra. 
Lemma 8.80 now gives 

r(a 0 b) — (ra) ® b — (ar) 0 b — a 0 rb. 


(iii) This statement merely sees the last equation a 0 rb — r(a 0 b) from a different 
viewpoint: 

(1a 0 Fr)(a 0 b) — a 0 rb = r(a 0 b). • 


We have defined R -biadditive functions for arbitrary, possibly noncommutative, rings 
R , whereas we have defined /^-bilinear functions only for commutative rings. Tensor 
product was defined as the solution of a certain universal mapping problem involving 
R -biadditive functions; we now consider the analogous problem for R -bilinear functions 
when R is commutative. 

Here is a provisional definition, soon to be seen unnecessary. 

Definition. If k is a commutative ring, then a k-bilinear product is a ^-module X and a 
^-bilinear function h : AxB —> X such that, for every ^-module M and every ^-bilinear 
function g: A x B -> M, there exists a unique ^-homomorphism g~: X —> M making the 
following diagram commute. 


AxB - - - 

A * 

M 

The next result shows that ^-bilinear products exist, but that they are nothing new. 

Proposition 8.82. If k is a commutative ring and A and B are k-modules, then the 
k-module A 0 k B is a k-bilinear product. 
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Proof. We show that X — A ® k B provides the solution if we define h ( a ,b) = a®b\ note 
that h is also A:-bilinear, thanks to Corollary 8.81. Since g is A-bilinear, it is A-bi additive, 
and so there does exist a Z-homomorphism 'g: A ® k B M with g~(a ® b) — g(a, b) for 
all (a. b) e A x B. We need only show that g is a A-map. If u e A, 

g(u(a ® b)) — g’i(ua) ® b ) 

= g{ua. b) 

— ug(a, b ) for g is A-bilinear 

= ug{a®b). • 

As a consequence of the proposition, the term bilinear product is unnecessary, and we 
shall call it the tensor product instead. 

In contrast to the Horn functors, the tensor functors obey certain commutativity and 
associativity laws. 

Proposition 8.83 (Commutativity). If A is a commutative ring and M and N are 
k-modules, then there is a k-isomorphism 

r : M ® k N -* N ®a- M 


with m. 

Proof First, Corollary 8.81 shows that both M®^N and NtEp-M are A-modules. Consider 
the diagram 

M x N - - -> M ®ic N 

• r 
A' 

N ® k M, 

where / (m ,n) — n®m. It is easy to see that / is A-bilinear, and so there is a unique A-map 
r: M® k N —> with r: m®n n®m. A similar diagram, interchanging the roles 

of M® k N and N® k M, gives a A-map in the reverse direction taking n®m i—► m®n. Both 
composites of these maps are obviously identity maps, and so r is a A-isomorphism. • 

Proposition 8.84 (Associativity). Given Ar.r B$, and $C, there is an isomorphism 
0: A ®r ( B ®s C) = (A ®r B) ®$ C 



given by 


a ® (b ®c) !->■ (a ® b) ® c. 


Proof. Define a triadditive function /: AxBx C —> G. where G is an abelian group, 
to be a function that is additive in each of the three variables (when we fix the other two). 


f{ar,b,c) — f(a,rb.c), and f(a,bs,c) — f(a,b,sc ), 
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for all r e R and s e S. Consider the univeral mapping problem described by the diagram 
A x B x C--->■ T(A, B, C), 


G 

where G is an abelian group, / is triadditive, and / is a Z-homomorphism. As for biaddi¬ 
tive functions and tensor products of two modules, define T (A, B. C) — F/N , where F is 
the free abelian group on all ordered triples (a, b, c) e A x B x C, and N is the obvious 
subgroup of relations. Define h: A x B x C -> T(A . B. C) by 

h : ( a , b, c) h> ( a , b, c) + N 

(denote (a. b, c) + N by a ® b ® c). A routine check shows that this construction does give 
a solution to the universal mapping problem for triadditive functions. 

We now show that A (B ®$ C) is another solution to this universal problem. Define 
a triadditive function ?/:AxBxC-*A ®^ (B ®$ C) by ry. (a.b.c) i-> a ® (b ® c); 
we must find a homomorphism /: A ®^ (B ®s C) —> G with ft] = f. For each 
a e A, the 5-biadditive function f a : B x C —> G, defined by (b, c) i-> f (a, b , c), gives 
a unique homomorphism f a : B <S)s C —>• G taking b <S) c f(a, b, c). If a, a' e A, then 
fa+a>(b ® c) = f(a + a', b, c) = f(a, b, c) + f(a', b, c) = f a (b <8> c) + f a Ab <g> c). It 
follows that the function (p: A x (B C) —> G, defined by (p(a, b ® c) = f a (b ® c), is 
additive in both variables. It is R -biadditive, for if r e R. then <p(ar, b®c) = f ar (b®c) = 
f(ar, b, c) — f(a, rb, c) = f a (rb ® c) = q>(a, r{b ® c)). Therefore, there is a unique 
homomorphism /: A (B ®s C) —»• G with a ® ® c) i —> (p(a, b ® c) = /(a, /?, c); 

that is, ft] — f. Uniqueness of solutions to universal mapping problems shows that there 
is an isomorphism T(A, B,C) —»■ A®^ (B ®^C) with a®f>®c i-> a® (f>®c). Similarly, 
T (A, B, C) = (A ®j? B) ®5 C via a ® b ® c i-> (a ® fo) ® c, and so A ®^ (B ®5 C) = 
(A ®s B) ®jC via a ® (b ® c) i-> (a ® b) ® c. • 



Remark. That the elements a®£>®c e T(A, B, C) have no parentheses will be exploited 
in the next chapter when we construct tensor algebras. ■* 

We now present properties of tensor products that will help us compute them. First, we 
give a result about Horn, and then we give the analogous result for tensor. 

Recall Exercise 8.34 on page 549: For any left R-module M, for any / e Horn/A R. M), 
and for any r, ,v e R. define 

rf:sH> f (sr ). 

Using the fact that a ring R is an ( R. R )-bimodule, we can check that rf is an R-map and 
that Horn/A R. M) is a left R-module. We incorporate this into the next result. 
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Proposition 8.85. If M is a left R-module, then Horn ft (/A M) is a left R-module, and 
there is an R-isomorphism (pM'- Hom^fS, M) —>■ M, given by tpM(f) — /(1). Indeed, 
q> = {(Pm) is a natural equivalence between Homj?(S, ) and the identity functor on 

A Mod. 

Proof Adapt the proof of Proposition 7.102. • 


Proposition 8.86. For every left R-module M, there is an R-isomorphism 

9m • R ®ft M ~M 

with 9m'- r ® m i—>• rm. Indeed, 9 — {9m} is a natural equivalence between R®r and 
the identity functor on ftMod. 

Proof The function R x M -> M, given by (r, m) i-> rm, is A-hiadditive, and so there 
is an R -homomorphism 9: R ®r M —>• M with r ® m i-> rm [we are using the fact 
that R is an (R, R) -bimodule]. To see that 9 is an R -isomorphism, it suffices to find a 
Z-homomorphism f: M — >• R ®r M with 9f and f9 identity maps (for it is now only a 
question of whether the function 9 is a bijection). Such a Z-map is given by /: m m* 1 ®m. 

To see that the isomorphisms 9m constitute a natural equivalence, we must show, for 
any module homomorphism h: M N , that the following diagram commutes. 


R®rM —^ R® r N 


6m 


8n 


M 


-> N 


It suffices to look at a generator r ® m of R®rM. Going clockwise, r®m i->- r®h(m) i->- 
rh(m), while going counterclockwise, r ® m i->- rm i->- h(rm). These agree, for h is an 
/?-map, so that h(rm) — rh(m). • 

The next theorem says that tensor product preserves arbitrary direct sums. 


Theorem 8.87. Given a right module Ar and left R-modules { r If : i e I], there is a 
Z- isomorphism 

<P'- A ®r Y2 B i ^(A B i) 
iel iel 

with (p \ a® (bi) h (a® bj ). Moreover, if R is commutative, then (p is an R-isomorphism. 

Proof. Since the function /: A x (JT Bj) -> £T(A ®r Bi), given by /: (a, (bi)) i-> 
(a ® bi) is R -biadditive, there exists a Z-homomorphism 

(P'. A ® R (^2 Bj) -> ^P(A ® R Bi) 
i i 
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with <p\ a 0 (bj) i-> (a 0 bj). If R is commutative, then A 0s (£T g/ £?,) and 
Y2i e i(A 0 r Bj ) are /(-modules and ip is an /(-map (for <p is the function given by the 
universal mapping problem in Proposition 8.82). 

To see that q> is an isomorphism, we give its inverse. Denote the injection B j —> /(, 

by kj [where kj(bj) e JT Bj has y'th coordinate bj and all other coordinates 0], so that 
1 a 0 kj : A 0s Bj -> A 0s (JL Bj). That direct sum is the coproduct in sMod gives a 
homomorphism 6: JL (A 0 s Bj) —> A 0 s Bj) with 9: (a 0 bj) i-> a 0 ''-i(bti¬ 

lt is now routine to check that 9 is the inverse of (p, so that <p is an isomorphism. • 

There is a theorem of C. E. Watts (see Rotman, An Introduction to Homological Algebra, 
page 77) saying that if T: sMod -> Ab is a (covariant) right exact functor that preserves 
direct sums, then there is a right /(-module A so that F is naturally equivalent to A0s . 

Example 8.88. 

Let k be a field and let V and W be fc-modules; that is, V and W are vector spaces over 
k. Now W is a free k-module; say, W = (Wj), where {u>j : i e /} is a basis of W. 

Therefore, V 0 a W = £L g/ V 0 a (wj)- Similarly, V — where {vj : je/) 

is a basis of V and, for each i, V 0a (u>ti = IZjeJ i v j) ( w i)- But the one-dimensional 
vector spaces (u ; -) and (wj) are isomorphic to k, and Proposition 8.86 gives (u/) 0 a (u),} = 
(vj 0 Wj Hence, V 0a W is a vector space over k having {u ; - 0 Wj : i el and j e J} as 
a basis. In case both V and W are finite-dimensional, we have 

dim(V 0 a W) = dim(V) dim(W). ◄ 


Example 8.89. 

We now show that there may exist elements in a tensor product V 0* V that cannot be 
written in the form u 0 w for u, w e V. 

Let v\ , i >2 be a basis of a two-dimensional vector space V over a field k. As in Exam¬ 
ple 8.88, a basis for V 0a- V is 

V\ 0 V\, Vl 0 U2, V2 0 Vi, V2 0 U2- 

We claim that there do not exist u,weV with t>i 0 U 2 + U 2 0 v\ = »0 w. Otherwise, 
write u and w in terms of iq and 1 ) 2 : 

t’l 0 t>2 + l>2 0 Vj — U 0 W 

= (av 1 + bV 2 ) 0 (ctq +di> 2 ) 

— acvi 0 tq + adv\ 0 tq + bcv 2 0 tq + bd i >2 0 tq. 

By linear independence of the basis. 


ac — 0 — bd and ad = 1 = be. 


The first equation gives a — 0 or c — 0, and either possibility, when substituted into the 
second equation, gives 0=1. ◄ 
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As a consequence of Theorem 8.87, if 

0 -> B' -4 B 4 - B" -* 0 

is a split short exact sequence of left /^-modules, then, for every right R -module A, 

0 -+ A ® R B' ^ A ® R B A ® R B" -* 0 
is also a split short exact sequence. What if the exact sequence is not split? 

Theorem 8.90 (Right Exactness). Let A be a right R-module, and let 

B' A B A B" -±0 

be an exact sequence of left R-modules. Then 

A ® R B' A® r B ^ A ®r B" -> 0 
is an exact sequence of abelian groups. 

Remark. 

(i) The absence of 0 -> at the beginning of the sequence will be discussed later; clearly 
this has something to do with our initial problem of imbedding a group G in a vector 
space over Q. 

(ii) We will give a nicer proof of this theorem once we prove the adjoint isomorphism 

(see Proposition 8.100) < 


Proof There are three things to check. 

(i) im(l <g> i) C ker(l <g> p). 

It suffices to prove that the composite is 0; but 

(1 ® p)(l ® 0 = 1 ® pi = 1 ® 0 = 0. 

(ii) ker(l <g> p) C im(l <g> i). 

Let E = im(l <g> i). By part (i), E C ker(l ® p), and so 1 ® p induces a map 
p : (A <g> B)/E —>■ A <g> B" with 

p\ pb, 

where a e A and b e B. Now if 7r : A ® B —> (A ® B)/E is the natural map, then 
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for both send a ® b i-> a ® pb. 



Suppose we show that p is an isomorphism. Then 

ker(l <g> p) — ker/wr = ker7r = E = im(l <g> i), 

and we are done. To see that p is, indeed, an isomorphism, we construct its inverse 
A® B" (A® B)/E. Define 

/: Ax B" -* (A® B)/E 

as follows. If b" e B", there is b e B with pb — //', because p is surjective; let 

/: ( a , b") h -+ a ® b. 

Now / is well-defined: If pb\ = b", then p(b — b\) — 0 and b — b\ e ker p = im i. Thus, 
there is b' e B' with ib' = b — b\, and hence a ® (b — b i) = a ® ib' e im(l ® i) = E. 
Clearly, / is R -biadditive, and so the definition of tensor product gives a homomorphism 
/; A® B" (A ® B)/E with f(a ® b") — a ® b + E. The reader may check that / is 
the inverse of /?, as desired. 

(iii) 1 ® p is surjective. 

If J2 a i ® b" e A ® B", then there exist h, e B with pb t = b'( for all i, for p is 
suijective. But 

1 ® p : ^ cij ® b, ^ cij ® pbi = ^ at ® b". • 



is exact. 
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Definition. A (covariant) functor T : a* Mod —> Ab is called right exact if exactness of a 
sequence of left R -modules 

o 

implies exactness of the sequence 

T(B') T(B) T -^l T(B") -> 0. 

There is a similar definition for covariant functors Mods -> Ab. 

In this terminology, the functors A<E)r and 0 r B are right exact functors. 

The next example illustrates the absence of “0 —*” in Theorem 8.90. 

Example 8.91. 

Consider the exact sequence of abelian groups 

0 -* Z -4 Q Q/Z 0, 

where i is the inclusion. By right exactness, there is an exact sequence 

I 2 <® Z —I 2 ® Q —> I 2 <S> (Q/Z) —>■ 0 

(in this proof, we abbreviate to <g>). Now I 2 <8> Z = I 2 , by Proposition 8.86. On the 
other hand, if a ® q is a generator of I 2 ® Q, then 

a ® q — a ® ( 2q/2) — 2a ® (q/2) = 0 ® (q/ 2) = 0. 

Therefore, I 2 ® Q = 0, and so 1 <g> i cannot be an injection. < 

The next proposition helps compute tensor products. 

Proposition 8.92. For every abelian group B, we have H„ B = B/nB. 

Proof. If A is a finite cyclic group of order n, there is an exact sequence 

o-*z-^>z4a-*o, 

where /x„ is multiplication by n. Tensoring by an abelian group B gives exactness of 

Z B ^ B Z ® z B P -^> A ® z B 0. 


Consider the diagram 


„ /x n <8>lfl ^ P<8)Ib 

Z 0^ B ——>■ Z 0^ B ->■ A 1 


B 


i 

B 


P'tt 


0 


e 


v 

B 


B/nB 


0 , 
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where 0 : T,®z It —»■ B is the isomorphism of Proposition 8.86, namely, 0 : m <g> b \—> mb. 
where m e Z and b e B. This diagram commutes, for both composites take m(&b m* nmb. 
The next, very general, proposition will apply to this diagram, yielding 

A <g>z B = B/nB. • 


Proposition 8.93. Given a commutative diagram with exact rows in which the vertical 
maps f and g are isomorphisms, 


A' —^ A —^ A" -0 

f S \h 

T j V q V 
B' —B -> B" -*- 0, 

there exists a unique isomorphism h : A" —> B” making the augmented diagram commute. 

Proof. If a" e A", then there is a e A with p(a) — a " because p is surjective. Define 
h(a") — qgia). Of course, we must show that h is well-defined; that is, if u e A satifies 
p{u) = a" , then qg(u) = qg(a). Since p(a) = p{u), we have p(a — u) = 0, so that 
a — u e kerp = im i, by exactness. Hence, a — u — i(a r ), for some a' e A'. Thus, 

qg(a -u) = qgi(a') = qjf(a') = 0, 

because qj = 0. Therefore, h is well-defined. 

To see that the map h is an isomorphism, we construct its inverse. As in the first para¬ 
graph, there is a map h' making the following diagram commute: 



We claim that h' — li 1 . Now h'q = pg 1 . Hence, 

h'hp = h'qg = pg~ l g = p; 

since p is surjective, we have h'h = 1^". A similar calculation shows that the other 
composite hh' is also the identity. Therefore, h is an isomorphism. If It: A" —> It" 
satisfies h! p — qg and if a" e A", choose a e A with pa = a". Then h'pa — h'a" = 
qga — ha" , and so h is unique. • 

The proof of the last proposition is an example of diagram chasing. Such proofs appear 
long, but they are, in truth, quite routine. We select an element and, at each step, there 
is essentially only one thing to do with it. The proof of the dual proposition is another 
example of this sort of thing. 
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Proposition 8.94. Given a commutative diagram with exact rows in which the vertical 
maps g and h are isomorphisms, 


0 

0 



\ P - A 

/i 

^ r 

i > A 

\f 


8 h 

\ J 

\ 

/ y 

. <1 . ", 


'j'q 

B’ —^ B -> B , 


there exists a unique isomorphism f : A’ -> B' making the augmented diagram commute. 
Proof. A diagram chase. • 

A tensor product of two nonzero modules can be zero. The following proposition gen¬ 
eralizes the computation in Example 8.91. 


Proposition 8.95. IfT is an abelian group with every element of finite order and if D is 
a divisible abelian group, then T ®i D = {0}. 

Proof. It suffices to show that each generator t (gj d, where t e T and d e D, is 0 in 
T ®i D. Since t has finite order, there is a nonzero integer n with nt = 0. As D is 
divisible, there exists d' e D with d — nd 1 . Hence, 


t <g> d = t <g> nd' = nt ® d’ — 0 <g) d’ = 0. • 

We now understand why we cannot make a finite cyclic group G into a Q-module, for 
Q ®z G = {0}. 

Corollary 8.96. If D is a nonzero divisible abelian group with every element of finite 
order (e.g., D — Q/Z), then there is no multiplication D x D —> D making D a ring. 

Proof. Assume, on the contrary, that there is a multiplication p,: DxD -> D making D a 
ring. If 1 is the identity, we have 1 f 0, lest D be the zero ring, which has only one element. 
Since multiplication in a ring is Z-bilinear, there is a homomorphism Jl: D <S) 7 , D -> D 
with Jl(d ® d') — p(d, d') for all d, d' e D. In particular, if if ^0, then Jl(d ® 1) = 
p{d, 1 ) — d ^0. But D ®z D — {0}, by Proposition 8.95, so that jl(d <g> 1) = 0. This 
contradiction shows that no multiplication /i on D exists. • 

The next modules arise from tensor products in the same way that projective and injec¬ 
tive modules arise from Horn. Investigation of the kernel of A B' —> A B is done 
in homological algebra; it is intimately related with a functor called Tor. 

Definition. If R is a ring, then a right R -module A is flat 8 if, whenever 

0 -> B’ B 4 B" -» 0 

8 This term arose as the translation into algebra of a geometric property of varieties. 
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is an exact sequence of left R -modules, then 

0 A ® R B' A ® R B ^ A ® R B" -> 0 

is an exact sequence of abelian groups. Flatness of a left .R-module is defined similarly. 

In other words, A is flat if and only if A®r is an exact functor. Because the functor 
A ® r is right exact, we see that A is flat if and only if, whenever i : B' —> B is an 
injection, then 1,4 <g> i: A ® R B' — > A ® R B is also an injection. 

Lemma 8.97. If every finitely generated submodule of a right R-module M is flat, then 
M is flat. 

Remark. Another proof of this lemma is given in Corollary 8.103. ◄ 

Proof. Let i: A —> B he an injective /(-map between left R-modules, and assume that 
u = xj ® yj e ker(l^ ® i), where xj e M and yj e A. As u e M ® R A, we have 


n 

0 = (1 m ® i)u = ’^2,Xj ® iyj. 

7=1 

Let F be the free abelian group with basis M x A, and let S be the subgroup of F con¬ 
sisting of the relations of F/S = M ® R A (as in the construction of the tensor product in 
Proposition 8.74); thus, S is generated by all elements in F of the form 

(m, a + a') — (m, a) — (m, a'); 

(m + m', a) — (m, a) — (in ', a); 

(mr, a) — (m, ra). 

Let M' be the submodule of M generated hv v i, ..., x n together with the (finite number 
of) first “coordinates” in M exhibiting ffflxj. iy f) as a linear combination of relators 
just displayed. Of course, M' is a finitely generated submodule of M. The element u' = 
J2xj® yj e M' ® R A (which is the version of u lying in this new tensor product M' ® R A) 
lies in ker 1 M’®f f° r we have taken care that all the relations making (1 \.j ®i){u) — 0 are 
still present. But M' is a finitely generated submodule of M, so that it is flat, by hypothesis, 
and so (1 a/' f)( m) = 0 implies if — 0 in M'®rA. Finally, if t: M' —> M is the inclusion, 
then (l ® 1 a)( if ) = u, and so u — 0. Therefore, 1 m <S> i is injective and M is flat. • 

We will use this lemma to prove that an abelian group is a flat Z-module if and only if 
it has no nonzero elements of finite order (see Corollary 9.6). Here are some examples of 
flat modules. 

Lemma 8.98. Let R be an arbitrary ring. 


(i) The right R-module R is aflat R-module. 



592 


Algebras Ch. 8 


(ii) A direct sum of right R-modules E j M j is flat if and only each M j is flat. 

(iii) Every projective right R-module F is flat. 

Proof, (i) Consider the commutative diagram 

A - - ->■ B 

O X 

Y V 

R®rA ->■ R®rB 

1 R®1 

where i: A -> B is an injection, a : a i->- 1 0 a, and 1 0 b. Now both a and r 

are isomorphisms, by Proposition 8.86, and so 1 r <® i — r/cr -1 is an injection. Therefore, 
R is a flat module over itself. 

(ii) By Proposition 7.30, any family of R-maps {/,•: Uj —> V,} can be assembled into an 
R-map q>\ E / Uj E / K/’ where cp: ( uj ) (Jj (u j )), and it is easy to check that q> is 
an injection if and only if each fj is an injection. 

Let i: A -> B he an injection. There is a commutative diagram 

(Ej M j ) ®R A (Ej M l) ®R B 


Ej(Mj ®R A) Ej(Mj ®R B ), 

where <p\ (mj ® a) i-> (mj ® ia ), where 1 is the identity map on E/ Mj- an d where the 
downward maps are the isomorphisms of Proposition 8.87. 

By our initial observation, 10/ is an injection if and only if each 1 v/ , 0/ is an injection; 
this says that E / Mj is flat if and only if each Mj is flat. 

(iii) Combining the first two parts, we see that a free /^-module, being a direct sum of 
copies of R, must be flat. Moreover, since a module is projective if and only if it is a direct 
summand of a free module, part (ii) shows that projective modules are always flat. • 

We cannot improve this lemma without further assumptions, for there exist rings R for 
which every flat R -module is projective. 

There is a remarkable relationship between Horn and 0. The key idea is that a func¬ 
tion of two variables, say, /; A x B -> C, can be viewed as a one-parameter family of 
functions of one variable: If we fix a e A , then define f a : B -> C by h f (a, b). 
Recall Lemma 8.80: If R and S are rings and Ar and rB$ are modules, then ,4 0 r B 
is a right 5-module, where (a 0 b)s = a 0 (bs). Furthermore, if Cs is a module, then 
it is easy to see that Homs(ZL C) is a right R-module, where ( f r){b ) = f(rb ); thus 
Horn/,* (4, Hom >, (/L C)) makes sense, for it consists of R-maps between right R-modules. 
Finally, if F e Hom^fA, Homs(B, C)), we denote its value on a & A by F a , so that 
F a : B —> C, defined by F c : b \—r F(a)(b), is a one-parameter family of functions. 
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Theorem 8.99 (Adjoint Isomorphism). Given modules Ar, rBs, and Cs, where R 
and S are rings, there is an isomorphism 

xa.b.c '■ Homs (A ®R B, C ) -» Horrid (A, Homs(B, C)), 

namely, for f: A ®r B C and a e A and b e B, 

*a,b,c :/!->■/*, where f*:b\-+ f (a <g> b ). 

Indeed, fixing any two of A, B, C, the maps Xa.b.C constitute natural equivalences 

Homs( ®rB, C) —> Hom^( , Homs(B, C)), 


Homs(A®« , C) —> Homs(A, Homs( , C)), 

and 

Homs(A B, )-> Hom^(A, Homs(B, )). 

Proof. To prove that r = r a.b.c is a Z-homomorphism, let fig: A <g )r B —> C. The 
definition of f + g gives, for all a e A, 

r(f + g)a :bt-+ (f + g)(a <g> b) = f(a ® b) + g(a ® (?) 

= T(f)a(b) + z(g) a (b). 


Therefore, r(f + g) = r(f) + r(g). 

Next, r is injective. If r (f) a = 0 for all a e A, then 0 = r(f) a (b) — f(a ® b) for all 
a e A and b e B. Therefore, f — 0 because it vanishes on every generator of A <S)r B. 

We now show that r is surjective. If F: A -> Homs(B, C) is an /Nmap, define 
(p: A x B -* C by <p(a, b) = F a (b). Now consider the diagram 

Ax B ---^ A® r B 



C 

It is straightforward to check that <p is R -biadditive, and so there exists a Z-homomorphism 
\p: A ®r B -> C with tp(a ® b) = q>(a, b) = F a (b) for all a e A and b e B. Therefore, 
F — r(<p), so that r is surjective. 

We let the reader prove that the indicated maps are natural transformations; diagrams 
and the proof of their commutativity must be given. • 

Given any two functors F : C V and G: V —> C, we called the ordered pair (F, G ) 
an adjoint pair if, for each pair of objects C e C and D e V, there are bijections 

r C.D'- Horn x>(FC, D) — > Homc(C, GD ) 

that are natural transformations in C and in D. It follows from Theorem 8.99 that 
( ®rB, Homf/i, )) is an adjoint pair. 
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As promised earlier, here is another proof of Theorem 8.90, the right exactness of tensor 
product. Since ( <S) r B ■ Hom(R ,)) is an adjoint pair of functors, the right functor <g> must 
preserve all direct limits, by Theorem 7.105. But cokernel is a direct limit, and a functor is 
right exact if it preserves cokernels. Here is this proof in more concrete terms. 

Proposition 8.100. Let Abe a right R-module, and let 

B' -4 B 4 - B" -* 0 

be an exact sequence of left R-modules. Then 

A ® R B' A® r B ^ A ® R B" 0 
is an exact sequence of abelian groups. 

Proof Regard a left R-module B as a (R, Z)-bimodule, and note, for any abelian group 
C, that Horn z(B, C ) is a right R-module, by Exercise 8.45 on page 603. In light of Propo¬ 
sition 7.48, it suffices to prove that the top row of the following diagram is exact for every 
C: 


0 h- Hom z (A <g ) R B", C) ^ Hom z (A ® R B. C ) Hom z (A <g ) R B', C) 


Z A,C 

0->■ Hom^fA, H") 


r 

Hom^efA, H) 



Hom^fA, //'), 


where H" = Horn z(B", C), H = Hornz(B, C), and H' — Horn z(B' , C ). By the adjoint 
isomorphism, the vertical maps are isomorphisms and the diagram commutes. The bottom 
row is exact, for it arises from the given exact sequence B' —> B -> B" -> 0 by first 
applying the left exact (contravariant) functor Horn^i , C), and then applying the left exact 
(covariant) functor Hom^(A, ). Exactness of the top row now follows from Exercise 8.51 
on page 604. • 

In Theorem 7.92, we proved that Hom(A, ) preserves inverse limits; we now prove that 
A® preserves direct limits. This, too, follows from Theorem 7.105. However, we give 
another proof based on the construction of direct limits. 


Theorem 8.101. If A is a right R-module and { If , <pi} is a direct system of left R-modules 
(over any not necessarily directed index set I), then 

A®r lim Bj = lim(A R ( ). 

Proof Note that Exercise 7.66 on page 517 shows that {A <g>^ R,, 1 ® ip'■} is a direct 
system, so that lim(A <g>^ Bf) makes sense. 

We begin by constructing lim B, as the cokernel of a certain map between sums. For 
each pair i, j e I with i < j in the partially ordered index set /, define Ifj to be a module 
isomorphic to B, by a map bj i-»- bjj, where bj e Bj , and define a : • Bjj —> Bj by 

cr: bj i-> kjtp'jbj - X/bj, 
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where A.,- is the injection of B, into the sum. Note that imcr — S, the submodule arising in 
the construction of Urn B , in Proposition 7.94. Thus, coker a — (ff Bj)/S = lim Bj, and 
there is an exact sequence 

J2 B u Y ^ B > °- 

Right exactness of A®r gives exactness of 

A ( T, Bjj^J A ® R ( y Bj^J —> A (lim B,) -> 0. 

By Theorem 8.87, the map r: A <S)r (y,i Bj^J -> £T(A Bj ), given by 

r: a ® ( bi) (a 0 bj), 

is an isomorphism, and so there is a commutative diagram 

A ® Bij ll8)tT >- At&JfBi - A ® lim Bj ->. q 

r r' 

„ V 

£(A 0 Bij) —J2(A ® Bj) -s- lhn(A ® Bj) -^ o, 

where r' is another instance of the isomorphism of Theorem 8.87, and 

a : a 0 bjj m* (1 ® A/) (a ® <p l jbi) — (1 0 A,)(a' ® bi). 

By Proposition 8.93, there is an isomorphism A lim B, -> coker ct = lim(A Bj), 
the direct limit of the direct system {A Bj, 1 ® ^'}. • 

The reader has probably observed that we have actually proved a stronger result: any 
right exact functor that preserves sums must preserve all direct limits. The dual result 
also holds, and it has a similar proof; every left exact functor that preserves products must 
preserve all inverse limits. In fact, if ( F. G) is an adjoint pair of functors (defined on 
module categories), then F preserves direct limits and G preserves inverse limits. 


Corollary 8.102. If {F, , <p l j\ is a direct system of flat right R-modules over a directed 
index set /, then lim Fj is also flat. 

k 

Proof. Let 0 —>• A —> B be an exact sequence of left R-modules. Since each Fj is flat, 
the sequence 

_ . 1 /®* 

0 —> Fj ®r A — » Fj ®r B 

is exact for every i, where 1/ abbreviates 1 f,. Consider the commutative diagram 


0->■ lim(C,- ® A) —lim(F,- ® B) 


l 


(limR,) ® B, 


0 


^ (lim Fj) ® A 
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where the vertical maps <p and f are the isomorphisms of Theorem 8.101, the map k is 
induced from the transformation of direct systems {1; <8> k}, and 1 is the identity map on 
lim Fj. Since each F, is flat, the maps 1/ ® k are injections; since the index set I is directed, 
the top row is exact, by Proposition 7.100. Therefore, 1® k: ( lim Fj ) ® A -> ( lim Ty) ® R 

is an injection, for it is the composite of injections \jrk(p ~ 1 . Therefore, lim Fj is flat. • 

Corollary 8.103. 

(i) If R is a domain with Q — Frac(R), then Q is aflat R-module. 

(ii) If every finitely generated submodule of a right R-module M is flat, then M is flat. 

Proof (i) In Example 7.97(v), we saw that Q is a direct limit, over a directed index set, 
of cyclic submodules, each of which is isomorphic to R. Since R is projective, hence flat, 
the result follows from Corollary 8.102. 

(ii) In Example 7.99(iii), we saw that M is a direct limit, over a directed index set, of its 
finitely generated submodules. Since every finitely generated submodule is flat, by hypoth¬ 
esis, the result follows from Corollary 8.102. We have given another proof of Lemma 8.97. 


Corollary 7.75 can be extended from abelian groups to modules over any ring. 

Theorem 8.104. For every ring R, every left R-module M can be imbedded as a sub- 
module of an injective left R-module. 

Proof. Regarding R as a bimodule zRr and an abelian group D as a left Z-module, we 
use Exercise 8.45 on page 603 to see that Hornz(R, D) is a left R-module; the scalar 
multiplication R x Horn z(R. D) -> Horn z(R, D) is given by (a. (p) a<p, where 
acp: r i—>• (pira ). 

If now D is a divisible abelian group, we claim that H — Horn z(R- D) is an injec¬ 
tive R-module; that is, we show that Hom/d , H) is an exact functor. Since Horn is 
left exact, it suffices to show that if i : ,4' -> A is an injection, then the induced map 
i *: Horn a>M. H) -> HoiriftM', H) is a surjection. Consider the following diagram. 

Homft(A, Hornz(R, D)) —Hom^lA', Horn z(R- D)) 

Hornz(A ®ft R, D) -^ Homz(A' ®r R, D) 

Homz(A, D) - 5 - Homz(A', D) 

The adjoint isomorphism gives commutativity of the top square. The bottom square arises 
from applying the contravariant functor HomjJ , D) to the following diagram, which 
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commutes because the isomorphism A —>• A R. given by a m* a <g> 1, is natural. 

A R <■- A' 7? 


A -<■-A' 

Since D is divisible. Corollary 7.73 says that D is an injective Z-module. Therefore, 
Homz( , D) is an exact functor and the bottom row in the large diagram is surjective. 
Since all the vertical maps in the large diagram are isomorphisms, commutativity now 
gives i* surjective. We conclude that Homj : ( R. D) is an injective left R -module. 

Finally, regard M as an abelian group. By Corollary 7.75, there is a divisible abelian 
group D and an injective Z-homomorphism j : M —> D. It is now easy to see that there is 
an injective /?-map M —> Horn z(R, D ), namely, m i-> f m , where f m (r ) = j(rm ) e Z); 
this completes the proof. • 

This last theorem can be improved, for there is a smallest injective module containing 
any given module, called its injective envelope (see Rotman, An Introduction to Homolog¬ 
ical Algebra , page 73). 

We have already seen, in Proposition 7.69, that if R is a noetherian ring, then every 
direct sum of injective modules is injective; we now prove the converse. 

Theorem 8.105 (Bass). If R is a ring for which every direct sum of injective left R- 
modules is injective, then R is left noetherian. 

Proof. We show that if R is not left noetherian, then there is a left ideal I and an R-map 
to a sum of injectives that cannot be extended to R. Since R is not left noetherian, there 
is a strictly ascending chain of left ideals f C I 2 C • • •; let I — [J hi- We note that 
///„ f {0} for all n. By Theorem 8.104, we may imbed ///„ in an injective left /?-module 
E„; we claim that E — E„ is not injective. 

Let 7 r„ : I —v I //„ be the natural map. For each a e I , note that Tt n (a) — 0 for large n 
(because a e I„ for some n), and so the R-map /:/—>■ ]~[(///„), defined by 

/: a h* (jr n (a)), 

does have its image in (///„); that is, for each a e /, almost all the coordinates of 
f(a) are 0. Composing with the inclusion ^(///„) —>• E n = E, we may regard / as 
a map I -» E. If there is an R-map g: R —» E extending /, then g ( I) is defined; say, 
g(l) = (x n ). Choose an index m and choose a e I with a f since a j. I m , we have 
7 x m (a) 0, and so g(a) — f(a) has nonzero wth coordinate n m (a). But g(a) — ag(l) = 
a(x n ) — ( ax n ), so that Tt m (a) = ax m . It follows that x m 0 for all m, and this contradicts 
g(l) lying in the direct sum E = E n . • 


We are now going to give a connection between flat modules and projective modules. 
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Definition. If B is a right R -module, define its character module B* as the left R -module 

B* = Horn Z (B, Q/Z). 

Recall that B* is a left R-module if one defines rf , for r e R and / : B —> Q/Z, by 

rf-b i-> /(for). 

The next lemma improves Proposition 7.48: If i: A f —>• A and p: A -> A" are maps 
and, for every module B, 

0 Hom(A", B) -C Hom(A, B) -4- Hom(A', B) 
is an exact sequence, then so is 

a'-4a-4a% o. 

Lemma 8.106. A sequence of right R-modules 



is exact if and only if the sequence of character modules 

04C'4b'4a*^0 


is exact. 

Proof If the original sequence is exact, then so is the sequence of character modules, for 
the contravariant functor Hom z ( , Q/Z) is exact, because Q/Z is an injective Z-module, 
by Corollary 7.73. 

For the converse, it suffices to prove that ker a = i m ft without assuming either a* 
suijective or f>* is injective. 
imofC ker ft. 

If x e A and oix f ker/, then fiaix) 0. By Exercise 7.57(i) on page 488, there is a 
map /: C —* Q/Z with ffa(x) /= 0. Thus, f e C* and ffa /= 0, which contradicts the 
hypothesis that a* ft* = 0. 
ker / Cimcf. 

If y e ker / and y </ im a. then y + ima is a nonzero element of B/ima. Thus, there 
is a map g: B/ima -> Q/Z with g(y + ima) ^ 0, by Exercise 7.57(i) on page 488. 
If v: B —> B/ima is the natural map, define g' = gv e B*; note that g\y) 0, for 
g'(y) = gv(y) = g(y + ima). Now g'lfma) — {0}, so that 0 = g'a — a*(g f ) and 
g' e kera* = im /*. Thus, g' = fi*(h) for some h e C*; that is, g' — /;/. Hence, g'(y) = 
hfi(y), which is a contradiction, for g'(y) 0, while fo/(y) = 0, because y e ker/. • 
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Proposition 8.107. A right R-module B is flat if and only if its character module B* is 
an injective left R-module. 

Proof. As in the proof of Theorem 8.104 with B playing the role of R (so that flat¬ 
ness implies that the map A' <S)r B -> A ®r B is injective), the left R -module B* = 
Horn z(B, Q/Z) is injective. 

Conversely, assume that B* is an injective left /Cmodule and A! —>■ A is an injection 
between left /^-modules A' and A. Since Hornet,4. B*) — Horn^(,4. HomjfB. Q/Z)), 
the adjoint isomorphism. Theorem 8.99, gives a commutative diagram in which the vertical 
maps are isomorphisms. 

Horn*(A, B*) ->Hom s (T, B*) -^ 0 

Y 

Horn i(B ®r A, Q/Z)->■ Horn %(B <g> A', Q/Z)-0 


(B ®r A)* ->■ (.B ® R A')* --0 

Exactness of the top row now gives exactness of the bottom row. By Lemma 8.106, the 
sequence 0 —> B ®r A' —> B ®r A is exact, and this gives B flat. • 

Corollary 8.108. A right R-module B is flat if and only if for every finitely generated 
left ideal /, the sequence 0 —> B ®r I —> B ®r R is exact. 

Proof. If B is flat, then the sequence 0 —»• B <g )r I — > B ®r R is exact for every left 
/?-module /; in particular, this sequence is exact when I is a finitely generated left ideal. 
Conversely, the hypothesis of exactness ofO B ®r I B ®r R for every finitely 
generated left ideal I allows us to prove exactness of this sequence for every left ideal, 
using Proposition 7.100 and the fact that tensor product commutes with direct limits. There 
is an exact sequence (B®r R)* -* (B®rI)* —*■ 0 that, by the adjoint isomorphism, gives 
exactness of Hom^(/L B *) -» Homs(/, B *) —> 0. This says that every map from an ideal 
I to B* extends to a map R —> B* ; thus, B* satisfies the Baer criterion. Theorem 7.68, 
and so B* is injective. By Proposition 8.107, B is flat. • 

Lemma 8.109. Given modules (rX,r Z$), where R and S are rings, there is a natural 
transformation in X , Y, and Z 

rx.Y.z ■ Hom s (T, Z) ® R X -> Hom s (Hom^(A, F), Z). 

Moreover, Tx.Y.Z tin isomorphism whenever X is a finitely generated free left R-module. 

Proof. Note that both Homs(Y, Z) and Hom/QA, F) make sense, for F is a bimodule. If 
/ € HomQF, Z) and x e X, define t x,Y,z(f ® x) to be the 5-map Hom^l A. F) -> Z 
given by 


tx,Y,z(f ® x): g f(g(x)). 
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It is straightforward to check that tx.y.z is a homomorphism natural in X, that r r,y,z 
is an isomorphism, and, more generally, that r x.Y.z is an isomorphism when X is a finitely 
generated free left R -module. • 

Theorem 8.110. A finitely presented left R-module B is fiat if and only if it is projective. 

Proof All projective modules are flat, by Lemma 8.98, and so only the converse is sig¬ 
nificant. Since B is finitely presented, there is an exact sequence 

-» B -* B -> 0, 

where both F' and F are finitely generated free left B-modules. We begin by showing, 
for every left B-module Y [which is necessarily an (R — Z)-bimodule], that the map 
r B — tb.y.Q/Z- Y* (&r B -* Homfi(B, Y)* of Lemma 8.109 is an isomorphism. 
Consider the following diagram. 

Y* ® R F' -^ Y* <g ) R F -^ Y* ® R B ->■ 0 



HomflCB', Y j* ->■ Horn R {F, Y)* ->- Horn R (B, Y f -*- 0 

By Lemma 8.109, this diagram commutes [for Y* <&r F — Hom^CT, Q/Z) ®j f] and the 
first two vertical maps are isomorphisms. The top row is exact, because Y*<%> r is right ex¬ 
act. The bottom row is also exact, because Horn/A , Y)* is the composite of the contravari- 
ant functors Hom^( , T), which is left exact, and * = Homz( , Q/Z), which is exact. 
Proposition 8.93 now shows that the third vertical arrow, r B : Y* <S) r B -» Horn /A B, Y )*, 
is an isomorphism. 

To prove that B is projective, it suffices to prove that Hom( B, ) preserves surjections: If 
A —>■ A" -> 0 is exact, then Hom(B, A) -> Hom(B, A") —> 0 is exact. By Lemma 8.106, 
it suffices to show that 0 -> Homf/i. A")* Hom(B, A)* is exact. Consider the diagram 

0-*• A"* B -*■ A* ® R B 



0-^ Hom(B, A”)* -^ Hom(B, A)* 

Naturality of r gives commutativity of the diagram, while the vertical maps r are isomor¬ 
phisms, because B is finitely presented. Since A -* A" —> 0 is exact, 0 —> A"* —»■ A* 
is exact, and so the top row is exact, because B is flat. It follows that the bottom row is 
also exact; that is, 0 -* Hom(B, A")* -> Hom(B, A")* is exact, which is what we were 
to show. Therefore, B is projective. • 
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Corollary 8.111. If R is right noetherian, then a finitely generated right R-module B is 
flat if and only if it is projective. 

Proof. This follows from the theorem once we recall Proposition 7.59 (which holds for 
noncommutative rings): Every finitely generated module over a noetherian ring is finitely 
presented. • 

Here is a nice application of tensor products that helps to place the Wedderburn-Artin 
theorems in perspective. 

Definition. A module P is small if the covariant Horn functor Hom( If ) preserves 
(possibly infinite) direct sums. 

For example. Proposition 8.85 shows that every ring R is a small R -module. 

To say that P is small means more than that there is some isomorphism 


Hom(P, B d = Hom ( p ’ £/); 

iet iel 

it also means that Hom( P. ) preserves the coproduct diagram; if A.,-: B, —> B are the in¬ 
jections, where B — If. then the induced maps (A.,-)*: Hom( P. If ) Hom(/ J . B) 
are the injections of Hom(P, If). 

Example 8.112. 

(i) Any finite direct sum of small modules is small, and any direct summand of a small 
module is small. 

(ii) Since a ring R is a small /?-module, it follows from (i) that every finitely generated free 
/?-module is small and that every finitely generated projective R -module is small. 4 


Definition. A right R -module P is a generator of Mod r if every right /?-module M is a 
quotient of some direct sum of copies of P. 

It is clear that R is a generator of Mod r, as is any free right /^-module. However, a 
projective right /?-module may not be a generator. For example, if R = Ig, then R = 
P © Q, where P — {[0], [2], [4]} = I 3 , and the projective module P is not a generator (for 
Q = I 2 is not a quotient of a direct sum of copies of P). 

Theorem 8.113. Let R be a ring and let P be a small projective generator o/Mod^. If 
S — End^(P), then there is an equivalence of categories 


Modtf = Mods. 


Proof. Notice that P is a left 5-module, for if x e P and f,geS — End«(P), then 
(g o f)x = g(fx). In fact, P is a (5, f?)-bimodule, for associativity f{xr) = ( fx)r , 
where r e R, is just the statement that / is an /Amap. It now follows from Corollary 8.81 
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that the functor F: Mod s -► Ab, defined by F = ®sP> actually takes values in Mod/;. 
Exercise 8.45(h) on page 603 shows that the functor G : Horned P, ): Mod/; -> Ab 
actually takes values in Mods- As (F, G) is an adjoint pair. Exercise 7.75 on page 518 
gives natural transformations FG -» 1/; and 1 y —> GF, where 1/; and 1$ denote identity 
functors on the categories Mod/; and Mod s, respectively. It suffices to prove that each of 
these natural transformations is a natural equivalence. 

Since P is a projective right /^-module, the functor G — Hom//(7\ ) is exact; since P 
is small, G preserves direct sums. Now F — ®sP, as any tensor product functor, is right 
exact and preserve sums. Therefore, both composites GF and FG preserve direct sums 
and are right exact. 

Note that 

FG{P) = F(Hom R (P, P)) = F(S ) = S P = P. 

Since P is a generator of Mod/;, every right /^-module M is a quotient of some direct sum 

of copies of P : There is an exact sequence K —»■ E P —E M -> 0, where K = ker /. 
There is also some direct sum of copies of P mapping onto K , and so there is an exact 
sequence 

Hence, there is a commutative diagram (by naturality of the upward maps) with exact rows 

E p - 4 E p- -^ M -i, 0 

A f 

E FG{P) -^ E FG(P) -*- FG(M )-^ 0 

We know that the first two vertical maps are isomorphisms, and so a diagram chase (see 
the five lemma. Exercise 8.52 on page 604) gives the other vertical map an isomorphism; 
that is, FG(M) = M, and so 1// = FG. 

For the other composite, note that 

GF(S) = G(S <g> 5 P) = G{P) = Horn R (P, P) = S. 

If N is any left 5-module, there is an exact sequence of the form 

0 , 

because every module is a quotient of a free module. The argument now concludes as that 
just done. • 

Corollary 8.114. If R is a ring and n > 1, there is an equivalence of categories 




Mod/; = Modviai,,!/;). 
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Remark. There is an equivalence of categories gop Mod —> Mod/;, by Exercise 8.22 on 
page 533. In particular, if R is commutative, then 

tfMod = ModMat„(«)- ** 

Proof. For any integer n > 1, the free module P = ff' = \ If, where If = R, is a small 
projective generator of Mod^, and 5 = End^(P) = Mat,, (R). • 

We can now understand Proposition 8.49: Mat,, (A) is semisimple when A is a division 
ring. By Proposition 8.48, a ring R is semisimple if and only if every /^-module is projec¬ 
tive; that is, every object in Mod r is projective. Now every A-module is projective (even 
free), so that equivalence of the categories shows that every object in ModMat„(A) is also 
projective. Therefore, Mat,, (A) is also semisimple. 

There is a circle of ideas, usually called Morita theory (after K. Morita). The first 
question it asks is when an abstract category C is equivalent to Mod /,■ for some ring R. The 
answer is very nice: A category C is isomorphic to a module category if and only if it is an 
abelian category (this just means that the usual finitary constructions in the second section 
of Chapter 7 exist; see Mac Lane, Categories for the Working Mathematician , pages 187— 
206), it is closed under infinite coproducts, and it contains a small projective object P that 
is a generator. Given this hypothesis, then C = Mods, where S = End( P ) (the proof is 
essentially that given for Theorem 8.113). 

Two rings R and S are called Morita equivalent if Mod r A Mod s - For example, it 
follows from Theorem 8.113 that every commutative ring R is Morita equivalent to the 
ring Mat,, (R), where n > 1. Moreover, if R and S are Morita equivalent, then VAR) = 
Z(.S'); that is, they have isomorphic centers (the proof actually identifies all the possible 
isomorphisms between the categories). In particular, two commutative rings are Morita 
equivalent if and only if they are isomorphic. See Jacobson, Basic Algebra II, pages 177— 
184, Lam, Lectures on Modules and Rings, Chapters 18 and 19, and Reiner, Maximal 
Orders, Chapter 4. 

In the next chapter, we will see that the tensor product R Cfk S of two ^-algebras R and S 
is also a ^-algebra. Indeed, when R and S are commutative, then R Aik S is their coproduct 
in the category of commutative ^-algebras. 

Exercises 

8.45 This exercise is an analog of Corollary 8.81. 

(i) Given a bimodule rA$, prove that Hom^(A, ): ^Mod —> ^Mod is a functor, where 
Horn/? (A, B) is the left 5-module defined by sf: a f(as). 

(ii) Given a bimodule rA$, prove that Hom^fA, ): Mody -> Mod^ is a functor, where 
Homy(A, B) is the right R -module defined by fr: af(ra). 

(iii) Given a bimodule prove that Hom^( , B): Mod^ —*■ ^Mod is a functor, where 
Horn/;(A, B) is the left 5-module defined by sf: a s[/(a)]. 

(iv) Given a bimodule rBr , prove that Hom^fA, ): ^Mod — ► Mod^ is a functor, where 
Homy(A, B) is the right R-module defined by fr: a f(a)r. 
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Remark. Let /: A —> B be an I?-map. Suppose we write f(a) when A is a right .^-module 
and (a)f when A is a left f?-module (that is, write the function symbol / on the side opposite 
the scalar action). With this notation, each of the four parts of this exercise is an associative 
law. For example, in part (i) with both A and B left .R-modules, writing sf for s e S, we have 
a(sf) — (as)f. Similarly, in part (ii), we define fr, for r e R so that ( fr)a — f(ra). ◄ 

8.46 Let V and W be finite-dimensional vector spaces over a field F, say, and let U]...., v m and 
w\,, w n be bases of V and IV, respectively. Let S: V —> V be a linear transformation 
having matrix A = [a, ;], and let T: W —>■ W be a linear transformation having matrix 
B — [bk{\. Show that the matrix of S ®T \ V W —> V IV, with respect to a suitable 
listing of the vectors Vj ® wj, is the nm x nm matrix K, which we write in block form: 


a.\ 1 B 

a 12 s •• 

d\m B 

^21 B 

a2 2 S 

a 2m B 

_(i m 1 B 

1:1 m2 b 

a mm B _ 


The matrix A ® B is called the Kronecker product of the matrices A and B. 

8.47 Let R be a domain with Q = Frac(W). If A is an f?-module, prove that every element in 
Q A has the form q ® a for q e Q and a e A (instead of JT q, ® a, ). (Compare this 
result with Example 8.89.) 

8.48 Let m and n be positive integers, and let cl = (m, n). Prove that there is an isomorphism of 
abelian groups 

H/77 ® = I'd • 

8.49 Let k be a commutative ring, and let P and Q be projective k-modules. Prove that P Q is 
a projective fc-module. 

8.50 Let k be a commutative ring, and let P and Q be flat fc-modules. Prove that P ®£ Q is a flat 
^-module. 

8.51 Assume that the following diagram commutes, and that the vertical arrows are isomorphisms. 

0-s- -s- A- a"-^ 0 

I v 

0->- B' -^ B -»- B" -^ 0 

Prove that the bottom row is exact if and only if the top row is exact. 

8.52 (Five Lemma). Consider a commutative diagram with exact rows 


A i -s- A 2 ->- A3 ->■ A 4 -5- A5 


h\ 

h 2 


hs 








B-[ ->- B2 -^ ^3 ->- ->- /?5 


(i) If I 12 and /t 4 are surjective and /z 5 is injective, prove that h 3 is surjective. 

(ii) If I 12 and / 7 4 are injective and h \ is surjective, prove that ho, is injective. 
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(iii) If hi, I12,li A , and h 5 are isomorphisms, prove that h 3 is an isomorphism. 

8.53 Prove that a ring R is left noetherian if and only if every direct limit (with directed index set) 
of injective left ^-modules is itself injective. 

Hint. See Theorem 8 . 105 . 

8.54 Let A, B. and C be categories. A functor of two variables T : A x B —»■ C assigns, to each 

ordered pair of objects (A, B). where A e ob(A) and B e ob (B), an object T{A, B) e ob(C), 
and to each ordered pair of morphisms /: A -*■ A' in A and g: B —> B' in B, a morphism 
T(f. g): T(A , B ) T{A', B'), such that 

(a) Fixing either variable is a functor: for example, if A e ob(A), then 

T a = T (A, ): B-> C 

is a functor, where T A (B) = T(A, B) and T A (g) — T(\ A , g). 

(b) The following diagram commutes: 


T (A, /.') T(A, If) 



T(f. 1 B 0 


TUa'.s) 


T(A', S') 


(i) Prove that ®: Mod/; x //Mod —»• Ab is a functor of two variables. 

(ii) Modify the definition of a functor of two variables to allow contravariance in a variable, 
and prove that Horn is a functor of two variables. 


8.5 Characters 

Representation theory is the study of homomorphisms of abstract groups G into groups 
of nonsingular matrices; such homomorphisms produce numerical invariants whose arith¬ 
metic properties help to prove theorems about G. We now introduce this vast subject with 
one goal being a proof of the following theorem. 

Theorem 8.115 (Burnside). Every group G of order p m q n , where p and q are primes, 
is a solvable group. 

Notice that Burnside’s theorem cannot be improved to groups having orders with only 
three distinct prime factors, for A 5 is a simple group of order 60 = 2 2 • 3 • 5. 

Using representations, we will prove the following theorem. 

Theorem. If G is a nonabelian finite simple group, then {1} is the only conjugacy class 
whose size is a prime power. 
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Proposition 8.116. The preceding theorem implies Burnside’s theorem. 

Proof. Assume that Burnside’s theorem is false, and let G be a “least criminal”; that 
is, G is a counterexample of smallest order. If G has a proper normal subgroup H with 
H { 1}, then both H and G/H are solvable, for their orders are smaller than |G| and are 
of the form p' q J . By Proposition 4.24, G is solvable, and this is a contradiction. We may 
assume, therefore, that G is a nonabelian simple group. 

Let Q be a Sylow ^-subgroup of G. If Q = {1}, then G is a /;-group, contradicting G 
being a nonabelian simple group; hence, Q f {1}. Since the center of Q is nontrivial, by 
Theorem 2.103, we may choose a nontrivial element x e Z(Q). Now Q < C G (x), for 
every element in Q commutes with x, and so 

[G : Q] = [G ; C G (x)][C G (x) : Q]\ 

that is, [G : C G (x)] is a divisor of [G : Q] — p m . Of course, [G : C G (x)] is the number 
of elements in the conjugacy class x G of x (Corollary 2.100), and so the hypothesis says 
that \x (j = 1; hence, x € Z(G), which contradicts G being simple. • 

The proof that the hypothesis of the proposition is true will use representation theory 
(see Theorem 8.153). 

We now specialize the definition of /:-representation on page 550 from arbitrary fields 
k of scalars to the complex numbers C. 

Definition. A representation of a group G is a homomorphism 

a: G -» GL(V), 

where V is a vector space over C. The degree of o is dim( V). 

For the remainder of this section, we restrict ourselves to finite groups and represen¬ 
tations having finite degree. If a representation er: G -> GL( V) has degree n and one 
chooses a basis of V, then each cr(g) can be regarded as an n x n nonsingular matrix with 
entries in C. 

Representations can be translated into the language of modules. In Proposition 8.37, 
we proved that every representation o : G —> GL(V) equips V with the structure of a 
left CG-module (and conversely); If g e G. then o (g ): V —> V, and we define scalar 
multiplication gv, for g e G and v e V, by 

gv = a(g)(v). 


Example 8.117. 

We now show that permutation representations, that is, G-sets, give a special kind of 
representation. A G-set X corresponds to a homomorphism n : G -> Sx, where Sx is 
the symmetric group of all permutations of X. If V is the complex vector space having 
X as a basis, then we may regard Sx < GL(V) in the following way. Each permutation 

^Recall that if a group G acts on a set X, then X is called a G-set. 
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jr(g) of X, where g e G, is now a permutation of a basis of V and, hence, it determines a 
nonsingular linear transformation on V. With respect to the basis X, the matrix of Trig) is 
a permutation matrix: It arises by permuting the columns of the identity matrix I by Trig): 
thus, it has exactly one entry equal to 1 in each row and column while all its other entries 
are 0. ◄ 

One of the most important representations is the regular representation ; in terms of 
modules, the regular representation is the group algebra CG regarded as a left module over 
itself. 


Definition. If G is a group, then the representation p: G -» GL(CG) defined, for all 
g, h e G, by 

pig): h h* gh, 

is called the regular representation. 

Two representations o : G —»■ GL( V) and r: G —> GL( W) can be added. 


Definition. If a : G — > GL( V) and r: G —> GL( W) are representations, then their sum 
a + x : G -> GL(V © W ) is defined by 

(a + r)(g): (v , w) h* (a(g)v, r{g)w) 

for all g e G, v e V. and w e W. 

In matrix terms, if o : G -> GL(n, C) and r: G -> GL(m, C), then 

a + t : G —*■ GL (n + m, C), 


and if g e G, then (cr + r)(g) is the direct sum of blocks cr(g) © r(g); that is. 


(a + r )(g) = 


<r(g) 

0 


0 

T(g) 


The following terminology is the common one used in group representations. 


Definition. A representation a of a group G is irrreducible if the corresponding CG- 
module is simple; a representation cr is completely reducible if it is a direct sum of irre¬ 
ducible representations; that is, the corresponding CG-module is semisimple. 


Example 8.118. 

A representation a is linear if degree(cr) = 1. The trivial representation of any group G 
is linear, for the principal module Vo(C) is one-dimensional. If G = S „, then sgn: G -» 
{±1} is also a linear representation. 

Every linear representation is irreducible, for the corresponding CG-module must be 
simple; after all, every submodule is a subspace, and {0} and V are the only subspaces of a 
one-dimensional vector space V. It follows that the trivial representation of any group G 
is irreducible, as is the representation sgn of S„. < 
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Recall the proof of the Wedderburn-Artin theorem: There are pairwise nonisomorphic 
minimal left ideals L \,..., L r in CG and C G = B\ ® ■ • • ® B r , where B, is generated 
by all minimal left ideals isomorphic to L,. Now B, St Mat„ ; .(C), by Corollary 8.65. 
But all minimal left ideals in Mat„ ; (C) are isomorphic, by Lemma 8.61(ii), so that L, = 
COL(l) = C"' (see Example 8.30). Therefore, 

B, = End (Li), 

where we have abbreviated Endc(L,) to End(L,). 


Proposition 8.119. 

(i) For each minimal left ideal L; in CG, there is an irreducible representation A: G 
GL {Li), given by left multiplication: 

t-i(g ): uj i-> gut, 


where g e G and ui e L,-; moreover, degree(A,) = n, = dim(L, ). 

(ii) The representation A.,- extends to a C-algebra map A/: CG —*■ CG if we define 


Xi(g)Uj 


gu, if j = i 

0 if j f i 


( 2 ) 


for g e G and u j e If. 


Proof, (i) Since L, is a left ideal in CG, each g e G acts on Li by left multiplication, and 
so the corresponding representation A, of G is as stated; it is an irreducible representation 
because minimal left ideals are simple modules. 

(ii) If we regard CG and End(L;) as vector spaces over C, then A, extends to a linear 
transformation A;: CG -> End(L, ) (because the elements of G are a basis of CG): 

g g 

Let us show that A,: CG -> End( L,) is actually a C-algebra map. If m, e Li and g, h e G, 
then 

l-i(gh): Uj (gh)uj, 

while 

Xi(g)Xi(h): Ui huj g{hui)\ 

these are the same, by associativity. 

At the moment, A i(g)ut is defined only for Uj e Bj = End(L,). For each g e G, 
we now extend the map A , (g) to CG = B\ © • • • © B r by defining A i(g)u j = 0, where 
uj € Bj = End(L/) and j f i. The extended map A , (g) (we keep the same notation even 
though its target has been enlarged from If to CG) is also a CG-algebra map. If j f i, then 
UjU j e BjB j — {0}, so that A j(g)(UjU j) = 0; on the other hand, (Xi(g)uj)(Xj(g)uj) — 0, 
by definition. • 
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It is natural to call two representations equivalent if their corresponding modules are 
isomorphic. 

Definition. If G is a group and o. r: G —»■ GL (n, C) are representations, then er and r 
are equivalent , denoted by a ~ r, if there is a nonsingular n x n matrix P that intertwines 
them; that is, 

Po(g)P~ l = r(g) 

for every g e G. 

Of course, this definition comes from Corollary 8.39, which says that the CG-modules 
(C") <T and (C") T are isomorphic as CG-modules if and only if a ~ x. 

Corollary 8.120. 

(i) Every irreducible representation of a finite group G is equivalent to one of the rep¬ 
resentations Xj given in Proposition 8.119(i). 

(ii) Every irreducible representation of a finite abelian group is linear. 

(iii) If a : G —v GL( V) is a representation of a finite group G, then u(g) is similar to a 
diagonal matrix for each g e G. 

Proof, (i) If a : G —> GL(V) is an irreducible representation er, then the corresponding 
CG-module V a is a simple module. Therefore, V a = L, , for some i, by Proposition 8.54. 
But Li = V Xi , so that V a = V x ' and a ~ A.,-. 

(ii) Since G is abelian, CG = If is commutative, and so all n, = 1. But n, = 
degree (A.,). 

(iii) If a' — CT|(g), then ofg) — cr(g). Now a' is a representation of the abelian group 
(g), and so part (ii) implies that the module V^"' is a direct sum of one-dimensional sub- 
modules. If V<£> = (i>i) © • • • © (v m ), then the matrix of cr(g) with respect to the basis 
vi,..., v m is diagonal. • 


Example 8.121. 

(i) The Wedderburn-Artin theorem can be restated to say that every representation r : G —» 

GL(V) is completely reducible: r = cri + • • • + cpt, where each rx / is irreducible; moreover, 
the multiplicity of each Oj is uniquely determined by r. Since each Oj is equivalent to 
some Xi, we usually collect terms and write r ~ : niiXj, where the multiplicities m, are 

nonnegative integers. 

(ii) The regular representation p : G — > CG is important because every irreducible repre¬ 
sentation is a summand of it. Now p is equivalent to the sum 

p ~ n\X\ H-h n r X r , 

where «, is the degree of X, [remember that CG = If. where If = End(L,) = 
Mat„ ; (C); as a CG-module, the simple module L, can be viewed as the first columns of 
Hi x ii/ matrices, and so If is a direct sum of rij copies of /.,]. < 
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Recall that the trace of an n x n matrix A — [a, 7 ] with entries in a commutative ring k 
is the sum of the diagonal entries: tr(A) = 1 a a- 

When k is a field, then tr(A) turns out to be the sum of the eigenvalues of A (we will 
assume this result now, but it is more convenient for us to prove it in the next chapter). 
Here are two other elementary facts about the trace that we will prove now. 

Proposition 8.122. 

(i) If A — [aij] and B — [bij] are n x n matrices with entries in a commutative ring k, 
then 

tr (A + B) — tr(A) + tr(B) and tr(Afi) = tr(RA). 

(ii) If B = PAP~ l , then tr (B) — tr (A). 

Proof (i) The additivity of trace follows from the diagonal entries of A+B being an+bn. 
If (AB)a denotes the ii entry of AB, then 

(AB)n = ^ aijb Jt . 


and so 

tr(AB) = AB)u = 'Y^aijbji. 

‘ i,j 

Similarly, 

tr (BA) = Y^bjiaij. 

j,i 

The entries commute because they lie in the commutative ring k, and so atjbji = h^ajj 
for all i, j. It follows that tr (AB) — tr(BA), as desired. 

(ii) 

tr (B) = tr((PA)P _1 ) = tr(P _1 (PA)) = tr(A). • 

It follows from (ii) that we can define the trace of a linear transformation T: V 
where V is a vector space over a field k, as the trace of any matrix arising from it 
and B are matrices of T, determined by two choices of bases of V, then B = PAP 
some nonsingular matrix P , and so tr( B) = tr(A). 

Definition. If a : G —> GL(V) is a representation, then its character is the function 
Xcr : G —> (C defined by 

Xcrig) = tr(cr(g)); 

we call Xcr the character afforded by a. An irreducible character is a character afforded 
by an irreducible representation. The degree of Xa is defined to be the degree of ct; that is, 

degree(xo-) = degree(a) = dim(V). 


-* V, 
: If A 
1 for 
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Example 8.123. 

(i) The character 6 afforded by a linear representation (see Example 8.118) is called a 
linear character, that is, 0 = x<r- where degree(a) = 1. Since every linear representation 
is simple, every linear character is irreducible. 

(ii) The representation a, : G —>■ GL(L,) [see Proposition 8.119(i)] is irreducible. Thus, 
the character 

Xi = Xii 

afforded by A.,- is irreducible. 

(iii) In light of Proposition 8.119(ii), it makes sense to speak of /, (m) for every u e CG. 
Of course, Xi( u j ) = 0 for all uj e End(Ly) when j # i, so that 


Xi ( u j ) 


\tr(liiuj)) if j — i 
) 0 if./#/. 


(iv) If o : G -> GL( V ) is any representation, then XaO ) — n, where n is the degree of a. 
After all, er(l) is the identity matrix, and its trace is n = dim(V). 

(v) Let o : G —> Sx be a homomorphism; as in Example 8.117, we may regard a as a 
representation on V, where V is the vector space over C with basis X. For every g e G, 
the matrix G(g) is a permutation matrix, and its xth diagonal entry is 1 if G(g)x = x; 
otherwise, it is 0. Thus, 

Xa(g) = tr(er(g)) = Fix(cr(g)), 

the number of x e X fixed by a(g). In other words, if A is a G-set, then we may view 
each g e G as acting on X, and the number of fixed points of the action of g is a character 
value (see Example 8.144 for a related discussion). ■* 


Characters are compatible with addition of representations: If g : G —»• GL(V) and 
r: G GL(W), then g + r : G -> GL(V ® W), and 


tr((<r + x )(g)) = tr 


(\ °(g) 

U 0 



= tr(cr(g)) + tr(r(g)). 


Therefore, 


Xcr + T — Xa + Xr ■ 


If g and r are equivalent representations, then 


tr(<x(g)) = tr(Pa(g)P ') = tr(r(g)) 


for all g e G; that is, they have the same characters: /„ = y T . It follows that if a: G —» 
GL(V) is a representation, then its character x<? can be computed relative to any convenient 
basis of V. 
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Proposition 8.124. 

(i) Every character Xa is an N -linear combination of the irreducible characters Xi = 
Xx t afforded by a, : G GL(L,): there are integers mi > 0 with 

Xn = 

i 


(ii) Equivalent representations have the same character. 

(iii) The only irreducible characters of G are xi, ■ ■. , Xr- 

Proof (i) The character /n arises from a representation a of G, which, in turn, arises 
from a CG-module V. But V is a semisimple module (because C G is a semisimple ring), 
and so V is a direct sum of simple modules: V = / Sj- By Proposition 8.54, each 

Sj = Li for some minimal left ideal L, . If, for each i, we let m, > 0 be the number of S; 
isomorphic to L ( , then y n - = m/Xi- 

(ii) This follows from part (ii) of Proposition 8.122 and Corollary 8.120(i). 

(iii) This follows from part (ii) and Corollary 8.120(i). • 

As a consequence of the proposition, we call Xt> ■ ■ • > Xr the irreducible characters 
of G. 


Example 8.125. 

(i) The (linear) character y \ afforded by the Uivial representation o : G -> C with rr(g) = 
1 for all g e G is called the trivial character. Thus, y. \ ( g) = 1 for all g e G. 

(ii) Let us compute the regular character \jr = Xp afforded by the regular representation 
p: G -* GL(CG), where p(g ): u gu for all g e G and u e CG. Any basis of CG 
can be used for this computation; we choose the usual basis comprised of the elements of 
G. If g — 1, then Example 8.123(iv) shows that f(\) = dim(CG) = |G|. On the other 
hand, if g f 1, then for all h e G, we have gh a basis element distinct from h. Therefore, 
the matrix of pig) has 0’s on the diagonal, and so its trace is 0. Thus, 


fig) = 



if.? #1 
if ? = 1 • 


We have already proved that equivalent representations have the same character. The 
coming discussion will give the converse: If two representations have the same character, 
then they are equivalent. 

Definition. A function tp: G -» C is a class function if it is constant on conjugacy 
classes; that is, if h = xgx , then (p(h) — (pig). 
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Every character y n afforded by a representation o is a class function: If h = xgx 
then 

a(h) = er(xgx~ l ) — o (x)o (g)o {x)~ l , 
and so tr(er(/i)) = tr(er(g)); that is, 


la (h) = Xo(g)- 

Not every class function is a character. For example, if / is a character, then — y is a 
class function; it is not a character because — / (1) is negative, and so it cannot be a degree. 

Definition. We denote the set of all class functions G —> C by cf(G): 

cf(G) — {ep: G -> C ; <p(g) — <p(xgx~ l ) for all x, g e Gj. 

It is easy to see that cf(G) is a vector space over C. 

An element u = ^ ?sG c g8 e CG is an /;-tuple (c g ) of complex numbers; that is, u is a 
function u : G -> C with u(g) = c g for all g e G. From this viewpoint, we see that cf(G) 
is a subring of CG. Note that a class function is a class sum; therefore, Femma 8.68 says 
that cf(G) is the center Z(CG), and so 

dim(cf(G)) = r, 

where r is the number of conjugacy classes in G (see Theorem 8.69). 

Definition. Write CG = B\ ® • • • ® B r , where B, A End(L ( ), and let eg denote the 
identity element of B \; hence, 

1 = e\ H- e r , 

where 1 is the identity element of CG. The elements eg are called the idempotents in CG. 
Not only is each eg an idempotent, that is, eg = eg, but it is easy to see that 

Ci ej — 8j j Cj , 

where 8g is the Kronecker delta. 

Lemma 8.126. The irreducible characters y \ . y r form a basis o/cf(G). 

Proof. We have just seen that dim(cf(G)) = r, and so it suffices to prove that y \..... y r 
is a linearly independent list, by Corollary 3.89(ii). We have already noted that y, ( uj ) = 0 
for all j f ;; in particular, Xi(ef) — 0. On the other hand, y, (eg ) = u /, where ig is the 
degree of y A , for it is the trace of the /;, x ig identity matrix. 

Suppose now that CiXi = 0. It follows, for all j. that 

0 = (J2 c ‘X,)(ej) = CjXj(ef) = cjnj. 


Therefore, all cj — 0, as desired. • 
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Theorem 8.127. Two representations of a finite group G are equivalent if and only if they 
afford the same character: Xa — Xr- 

Proof We have already proved necessity, in Proposition 8.124(h). For sufficiency. Propo¬ 
sition 8.124(ii) says that every representation is completely reducible: There are nonneg¬ 
ative integers m, and with a ~ mf/.j and r ~ By hypothesis, the corre¬ 

sponding characters coincide: 

I>iXi = X<r = Xr = tiXi- 
i i 

As the irreducible characters xi, • • • > Xr are a basis of cf(G), m; = £,■ for all i, and so 
a ~ r. • 

There are relations between the irreducible characters that facilitate their calculation. 
We begin by finding the expression of the idempotents e, in terms of the basis G of CG. 
By Example 8.123(iv), X/(l) — the degree of a,-. On the other hand, by Eq. (2) in 
Proposition 8.119, we have Xi( e j) — 0 if j f=i, so that 

m = Xi( 1) = = Xii^i). (3) 

j 

We also observe, for all y e G, that 

Xi(e/y) = Xi(y), (4) 

for y = J2j ejy, and so Xi(y) = J2j Xi(ejy) = Xi(.e,y), because ejy e Bj. 

Proposition 8.128. Ifei = J2 g eG aigg, where a, g e C, then 

_ rtiXAgf^) 

Uig ~ |G| 

Proof Let i/r be the regular character; that is, f is the character afforded by the regular 
representation. Now ejg~ l — anfig~ l , so that 

fi(eig~ l ) - 1V^ajhfihg -1 ). 
heG 

By Example 8.125(ii), x/r (1) = |G| when h — g and fi{hg~ l ) — 0 when h g. Therefore, 

_ fi(ejg~ l ) 

U>g ~ \G\ 

On the other hand, since i// = , njXj, we have 

fi(eig~ l ) = ^njXjietg _1 ) = n/x/Oig -1 ), 
j 

by Eq. (2) in Proposition 8.119. But Xi( e i8~ l ) — XiQ> -1 )> by Eq. (4). Therefore, 
aig = niXi(g~ l )/\G\. • 

It is now convenient to equip cf(G) with an inner product. 
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Definition. If a, p e cf(G), define 

(a. P) = 77777 ^«(g)P(g), 

11 geG 

where c denotes the complex conjugate of a complex number c. 

It is easy to see that we have defined an inner product; 10 that is, for all c\, C 2 e C, 

(i) (cioi + c 2 a 2 , P) = c\(ai,P) + c 2 (a 2 , P); 

(ii) (P, a) = JoPJf). 

Note that (a, a) is real, by (ii), and the inner product is definite ; that is, (a, a) > 0 if a ^ 0. 

Theorem 8.129. With respect to the inner product just defined, the irreducible characters 
Xi, ..., Xr form an orthonormal basis', that is, 

(Xi> Xj) = Sij. 

Proof. By Proposition 8.128, we have 

1 x ^ 

e J = \q\ l^ n jXj(8 )§■ 

Hence, 

1 x . 

Xi ( ej)/n j = —- S xj (g )Xi (g) 

g 

1 \ - - 

= 7^7 Xl(g)Xj(g) 

g 

= (Xi, Xj)', 

the next to last equation follows from Exercise 8.56(ii) on page 632, for Xj is a character 
(not merely a class function), and so Xj(g~> — Xj(g)- The result now follows, for 
Xi (e j)/n j = Sij, by Eqs. (2) and (3). • 

The inner product on cf(G) can be used to check irreducibility. 

Definition. A generalized character tp on a finite group G is a linear combination 

<p = j: m rxi> 

i 

where xi, ■ ■ ■, Xr are the irreducible characters of G and the coefficients m; e Z. 

If 9 is a character, then 6 — mtXi, where all the coefficients are nonnegative inte¬ 
gers, by Proposition 8.124. 

10 This inner product is not a bilinear form because we have (ft. a) = (a. ft), not (ft, a) = (a, ft). Such a 
function is often called a Hermitian form or a sesquilinearform (sesqui means “one and a half”). 
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Corollary 8.130. A generalized character 9 of a group G is an irreducible character if 
and only if 6(1) > 0 and 

(e,e) = i. 

Proof If 9 is an irreducible character, then 9 = Xi f° r some i, and so (9, 9) = (Xi • Xi) = 
1. Moreover, 0(1) = deg(/, ) > 0. 

Conversely, let 9 = ^ j m jXj , where mj e Z, and suppose that (9,9) = 1. Then 
1 = ff j my, hence, some mj — 1 and all other mj — 0. Therefore, 9 = ±x;, and so 
0(1) = ±Xj( 1). Since //(l) = deg(x/) > 0, the hypothesis 0(1) > 0 gives m; = 1. 
Therefore, 9 — Xi^ and so 0 is an irreducible character. • 

Let us assemble the notation we will use from now on. 

Notation. If G is a finite group, we denote its conjugacy classes by 

C l ,...,C r , 

a choice of elements, one from each conjugacy class, by 

gi e Ci, ...,g r € C r , 

its irreducible characters by 

Xl> 

their degrees by 

ni = xt(l), • • • - n r = Xr( 1), 
and the sizes of the conjugacy classes by 

hi = \C\\,..., h r = \C T \. 


The matrix [/a (gj)] is a useful way to display information. 


Definition. The character table of G is the rxr complex matrix whose i j entry is Xi (g j ) • 

We always assume that C\ — {1} and that xi is the trivial character. Thus, the first 
row consists of all l’s, while the first column consists of the degrees of the characters: 
X; (1) = Hi for all i, by Example 8.123(iv). The ith row of the character table consists of 
the values 

Xid). Xi(gl) . Xi(gr)- 

There is no obvious way of labeling the other conjugacy classes (or the other irreducible 
characters), so that a finite group G has many character tables. Nevertheless, we usually 
speak of “the” character table of G. 

Since the inner product on cf(G) is summed over all g e G, not just the chosen gj (one 
from each conjugacy class), it can be rewritten as a “weighted” inner product: 


1 x - - 

(X/. X./) = ryy 2^, hk ^'^k)Xj(gk)- 


k= 1 


Theorem 8.129 says that the weighted inner product of distinct rows in the character table 
is 0, while the weighted inner product of any row with itself is 1. 
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Example 8.131. 

A character table can have complex entries. For example, it is easy to see that the character 
table for a cyclic group G = (x) of order 3 is given in Table 8.1, where to = e 2n '^ is a 
primitive cube root of unity. 


gi 

1 

X 


hi 

1 

1 

1 

XI 

1 

1 

1 

X2 

1 

CO 

CO 2 

X3 

1 

CO 2 

CO 


Table 8.1. Character Table of I 3 ◄ 


Example 8.132. 

Write the four-group in additive notation: 

Y = { 0 , a, b, a + b}. 

As a vector space over F 2 , V has basis a, b. and the “coordinate functions” on Y, which 
take values in {1, — 1} C C, are linear, hence irreducible, representations. For example, the 
character y _2 arising from the function that is nontrivial on a and trivial on b is 

1—1 if u = a or v = a + b 

/2(v) = 1 1 -r o , 

1 n i> = 0 or v — b. 

Table 8.2 is the character table. 


gi 

0 

a 

b 

a + b 

hi 

1 

1 

1 

1 

XI 

1 

1 

1 

1 

X2 

1 

-1 

1 

-1 

X3 

1 

1 

-1 

-1 

X4 

1 

-1 

-1 

1 


Table 8.2. Character Table of V ◄ 


Example 8.133. 

Table 8.3 on page 618 is the character table for the symmetric group G = S 3 . Since two 
permutations in S n are conjugate if and only if they have the same cycle structure, there 
are three conjugacy classes, and we choose elements 1, (1 2), and (12 3) from each. In 
Example 8.7l(i), we saw that there are three irreducible representations: ).\ = the trivial 
representation, a 2 = sgn, and a third representation A 3 of degree 2. We now give the 
character table, after which we discuss its entries. 





618 


Algebras Ch. 8 


gi 

hi 

1 

1 

( 12 ) 

3 

(12 3) 

2 

Xl 

1 

1 

1 

X2 

1 

-1 

1 

X3 

2 

0 

-1 


Table 8.3. Character Table of S 3 

We have already discussed the first row and column of any character table. Since xi = 
sgn, the second row records the fact that 1 and (1 2 3) are even while (1 2) is odd. The 
third row has entries 

2 a b, 

where a and b are to be found. The weighted inner products of row 3 with the other two 
rows gives the equations 


2 + 3a + 2b — 0 
2 — 3a + 2b — 0. 


It follows easily that a — 0 and b = — 1. ◄ 

The following lemma will be used to describe the inner products of the columns of the 
character table. 


Lemma 8.134. If A is the character table of a finite group G, then A is nonsingular and 
its inverse A~ l has ij entry 


(.A~ l )ij 


hiXjigi) 

\G\ 


Proof If B is the matrix whose i j entry is displayed in the statement, then 


1 x - 

(AB)ij = — > Xfgk)hkXj(gk) 

|G| k 

1 x . _ 

= 7^7 2_^Xi(g)Xj(g) 
g 

= (Xi. Xj) 


because hkXj(gk) = EveC* Xj(y)- Therefore, AB = I. • 
The next result is fundamental. 


Theorem 8.135 (Orthogonality Relations). Let G be a finite group of order n with con- 
jugacy classes C\,... ,C r of cardinalities hi, ... ,h r , respectively, and choose elements 
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gi G Ci. Let the irreducible characters of G be xi, ■ ■ •, Xr, and let Xi have degree iij. 
Then the following relations hold: 


(i) 


(ii) 


^2h k Xi(gk)Xj(gk) 

k= 1 


( 0 if I f j- 

(|G| ifi = j- 


^2xi(gk)Xi(gt) 
i — 1 


| 0 ifk + l\ 

j \G\/h k ifk = l. 


Proof, (i) This is just a restatement of Theorem 8.129. 

(ii) If A is the character table of G and B = [h, /j (gi )/1 G |], we proved, in Lemma 8.134, 
that AB — I. It follows that BA = /; that is, (BA) k e = &kt- Therefore, 

1 \ - - 

z^ hkXi( ' 8k ' )Xi ^ 8 ^ = Skl ’ 

and this is the second orthogonality relation. • 


In terms of the character table, the second orthogonality relation says that the usual 
(unweighted, but with complex conjugation) inner product of distinct columns is 0 while, 
for every k, the usual inner product of column k with itself is G | / h k . 

The orthogonality relations yield the following special cases. 


Corollary 8.136. 

(i) \G\ = EUnj 
© £'■=i niXiigk) = 0 if k > 1 

(iii) EL l h kXi(gk ) = 0 if i > 1 

(iv) J2k=i h k\Xi(gk)\ 2 = |G| 


Proof, (i) This equation records the inner product of column 1 with itself: It is Theo¬ 
rem 8.135(h) when k — l — 1. 

(ii) This is the special case of Theorem 8.135(h) with 1=1, for /,-(l) = n ,. 

(iii) This is the special case of Theorem 8.135(i) in which j = 1. 

(iv) This is the special case of Theorem 8.135(i) in which j — i. • 

We can now give another proof of Burnside’s lemma. Theorem 2.113, which counts the 
number of orbits of a G-set. 
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Theorem 8.137 (Burnside’s Lemma). Let G be a finite group and let X be a finite 
G-set. If N is the number of orbits of X, then 

N = ] ^E Fix ^’ 

' ' geG 

where Fix(g) is the number of x e X with gx = x. 

Proof. Let V be the complex vector space having X as a basis. As in Example 8.117, 
the G-set X gives a representation o: G GL(V) by er (g)(x) — gx for all g e G and 
x e X\ moreover, if Xa is the character afforded by a, then Example 8.123(v) shows that 
X<j(g) = Fix(g). 

Let Oi,, On be the orbits of X. We begin by showing that N — dim(V G ), where 
V G is the space of fixed points: 

V G = {v e V : gv = v for all g e G}. 

For each i. define .v, to be the sum of all the x in 0,; it suffices to prove that these elements 
form a basis of V G . It is plain that s\,... ,sn is a linearly independent list in V G , and 
it remains to prove that they span V G . If u e V G , then u = fc;x c x x, so that gu — 
c x(8 x )- Since gu — u, however, c x = c gx . Thus, given x e X with x e O /, each 
coefficient of gx, where g e G, is equal to c x : that is, all the x lying in the orbit 0/ have 
the same coefficient, say, cj , and so u — ff j CjSj. Therefore, 

N = dim(V G ). 

Now define a linear transformation T: V —»• V by 

t — — y cr(g). 

\G\ 

geG 

It is routine to check that T is a CG-map, that T\ (V G ) = identity, and that im 7’ = V G . 
Since CG is semisimple, V — V G © W for some submodule W. We claim that T\W — 0. 
If w e W. then a (g)(w) e W for all g e G, because W is a submodule, and so T(w) e W. 
On the other hand, T ( w ) e im T — V G , and so T ( w) e V G IT W — {0}, as claimed. 

If uq,..., Wi is a basis of W. then s\,..., sn, itq,..., is a basis of V — V G © W. 
Note that T fixes each .v, and annihilates each Wj. Since trace preserves sums, 

tr(r) = j^| Zl tr(cr( ^ )) 

geG 

1 \ - 

= ]G\2^Xa(g) 
geG 

= — VFix(g). 

\G\ 
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It follows that 

tr(T) = dim(V G ), 


for the matrix of T with respect to the chosen basis is the direct sum of an identity block 
and a zero block, and so tr(T) is the size of the identity block, namely, dim(V G ) = N. 
Therefore, 


N = |^E Fix ^- 




Character tables can be used to detect normal subgroups. 

Definition. If Xz is the character afforded by a representation r : G —> GL(V ), then 

ker Xz — ker r. 


Proposition 8.138. Let 9 = Xz be the character of a finite group G afforded by a repre¬ 
sentation x : G -* GL(V). 

(i) For each g e G, we have 

|0fe)| < 0(1). 

(ii) 

kerf) = {geG:0 (g) = 0(1)}. 

(iii) If 9 — y . , ni jXj, where mj are positive integers, then 

kerf) = n ker Xj- 


(iv) If N is a normal subgroup of G, then there are irreducible characters Xi\> ■ ■ ■> Xi s 
with N = n,-ii kerxi r 


Proof, (i) By Lagrange’s theorem, gl G l = 1 for every g e G; it follows that the eigenval¬ 
ues £i,..., Sd of r(g), where d = 9(1), are |G|th roots of unity, and so |ey | = 1 for all j. 
By the triangle inequality in C, 


|0te)| = | J2 £ j 

j=1 


< d = 0(1). 


(ii) If g e ker 9 — kerr, then r(g) = I, the identity matrix, and |0(g)| = tr(7) = 0(1). 
Conversely, suppose that 9(g) = 0(1) = d\ that is, | J2j=i s j I = d. By Proposition 1.42, 
all the eigenvalues ej are equal, say, Sj = co for all j . Therefore, x(g) — col , by Corol¬ 
lary 8.120(iii), and so 


9(g) = 9(l)oo. 
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But 9(g) = 0(1), by hypothesis, and so co = 1; that is, rig) — I and g e ker r. 

(iii) For all g e G, we have 

0 (g) = ^2 m jXj(g); 
j 

in particular, 

0 ( 1 ) = ^rrijXji 1 ). 

j 

If g € ker0, then 9(g) = 0(1). Suppose that Xj'(g) Xj'( 1) for some j '. Since Xj'(g) 
is a sum of roots of unity. Proposition 1.42 applies to force \Xj'(g)\ < Xj'i 1), and so 
9(g) — JY mjXj(g) 7 ^ 0(1)- Therefore, g e pF kerxj- For the reverse inclusion, if 
g e ker Xj, then xj(g ) = Xj (1), and so 


0 (g) = J2 m JXj(g) = ^2 m jXjW = 0 ( 1 ); 
j j 


hence, g e ker 0 . 

(iv) It suffices to find a representation of G whose kernel is At. By part (iii) and Exam¬ 
ple 8.125(h), the regular representation p of G/N is faithful (i.e., is an injection), and so 
its kernel is {1}. If tt: G —> G/N is the natural map, then pit is a representation of G 
having kernel N. If 0 is the character afforded by pn, then 0 = / m jXj- where the mj 

are positive integers, by Lemma 8.126, and so part (iii) applies. • 


Example 8.139. 

We will construct the character table of S4 in Example 8.148. We can see there that 
ker x _2 — A 4 and kerx 3 = V are the only two normal subgroups of S4 (other than {1} 
and S4). Moreover, we can see that Y < A 4. 

In Example 8.140, we can see that ker X 2 = {1} U z (; U y G (where z G denotes the 
conjugacy class of z in G) and kerx 3 = {1} U z G U x G . Another normal subgroup occurs 
as kerx 2 FI kerx3 = {1} U z G ■ •* 

A normal subgroup described by characters is given as a union of conjugacy classes; 
this viewpoint can give another proof of the simplicity of A5. In Exercise 2.89(iv) on 
page 113, we saw that A5 has five conjugacy classes, of sizes 1, 12, 12, 15, and 20. Since 
every subgroup contains the identity element, the order of a normal subgroup of A 5 is the 
sum of some of these numbers, including 1. But it is easy to see that 1 and 60 are the only 
such sums that are divisors of 60, and so the only normal subgroups are {1} and A5 itself. 

There is a way to “lift” a representation of a quotient group to a representation of the 
group. 

Definition. Let H <\G and let cr: G/H —> GL(V ) be a representation. If 7 r: G —> G / H 
is the natural map, then the representation an: G -> GL(V) is called a lifting of a. 
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Scalar multiplication of G on a CG-module V is given, for v e V, by 

gv = ( gH)v. 

Thus, every CG-submodule of V is also a C(G/H)- submodule; hence, if V is a simple 
C(G///)-module, then it is also a simple CG-module. It follows that if a: G/H -> 
GL( V) is an irreducible representation of G/H. then its lifting an is also an irreducible 
representation of G. 

Example 8.140. 

We know that Z)g and Q are nonisomorphic nonabelian groups of order 8; we now show 
that they have the same character tables. 

If G is a nonabelian group of order 8 , then its center has order 2, say, Z(G) = (z). 
Now G/Z(G) is not cyclic, by Exercise 2.69 on page 95, and so G/Z(G) = Y. Therefore, 
if a: V —*■ C is an irreducible representation of V, then its lifting an is an irreducible 
representation of G. This gives 4 (necessarily irreducible) linear characters of G, each 
of which takes value 1 on z. As G is not abelian, there must be an irreducible character 
X 5 of degree n 5 > 1 (if all n, = 1, then CG is commutative and G is abelian). Since 
JT = 8, we see that n 5 = 2. Thus, there are five irreducible representations and, 
hence, five conjugacy classes; choose representatives gi to be 1, z, x, y, w. Table 8.4 is the 
character table. 


gi 

1 

z 

X 

y 

W 

h. 

1 

1 

2 

2 

2 

XI 

1 

1 

1 

1 

1 

X2 

1 

1 

-1 

1 

-1 

X3 

1 

1 

1 

-1 

-1 

X4 

1 

1 

-1 

-1 

1 

X5 

2 

-2 

0 

0 

0 


Table 8.4. Character Table of D% and of Q 


The values for xs are computed from the orthogonality relations of the columns. For 
example, if the last row of the character table is 

2 abed, 

then the inner product of columns 1 and 2 gives the equation 4 + 2a = 0, so that a — —2. 
The reader may verify that 0 = b — c = d. < 

The orthogonality relations help to complete a character table but, obviously, it would 
also be useful to have a supply of characters. One important class of characters consists of 
those afforded by induced representations; that is, representations of a group G determined 
by representations of a subgroup H of G. 
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The original construction of induced representations, due to F. G. Frobenius, is rather 
complicated. Tensor products make this construction more natural. The ring CG is a 
(CG, C//)-bimodule (for C H is a subring of CG), so that if V is a left C//-module, then 
the tensor product CG 0cH V is defined; Lemma 8.80 says that this tensor product is, in 
fact, a left CG-module. 

Definition. Let H be a subgroup of a group G. If V is a left C//-module, then the 
induced module is the left CG-module 

VI G = CG 0ch V. 

The corresponding representation p] G : G —> V G is called the induced representation. 
The character of G afforded by p] G is called the induced character , and it is denoted by 

Xp\ G - 

Let us recognize at the outset that the character of an induced representation need not 
restrict to the original representation of the subgroup. For example, we have seen that there 
is an irreducible character x of A 3 = I 3 having complex values, whereas every irreducible 
character of S 3 has (real) integer values. A related observation is that two elements may be 
conjugate in a group but not conjugate in a subgroup (for example, nontrivial elements in 
A 3 are conjugate in S 3 , for they have the same cycle structure, but they are not conjugate 
in the abelian group A 3 ). 

The next lemma will help us compute the character afforded by an induced representa¬ 
tion. 

Lemma 8.141. 

(i) If H < G, then CG is a free right CH-module on [G : H] generators. 

(ii) If a left CH-module V has a (vector space ) basis e\,..., e m , then a ( vector space) 
basis of the induced module Vj G = CG 0ch V is the family of all t, 0 ej, where 
t\,... ,tn is a transversal of H in G. 

Proof, (i) Since t\. .... t n is a transversal of H in G (of course, n — [G : //]), we see 
that G is the disjoint union 

G = \JtiH ; 

i 

thus, for every g e G, there is a unique i and a unique h e H with g — tjh. We claim that 
/|., is a basis of CG viewed as a right C//-module. 

If u e CG. then u — „ a g g. where a„ e C. Rewrite each term 

a gg — cigtjh = tjagh 

(scalars in C commute with everything), collect terms involving the same tp and obtain 
u = tj i]j, where rp e CH. 
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To prove uniqueness of this expression, suppose that 0 = f, ?;,, where rp e CH. 

Now rn — cijhh, where a,h € C. Substituting, 

0 = Y^aiktih. 

i,h 

But tjh — tjh' if and only if i = j and h — h', so that 0 = JT h anPih — J ZgeG a ih8 > 
where g = tih. Since the elements of G are a basis of CG (viewed as a vector space over 
C), we have ap, = 0 for all i.h, and so rji — 0 for all i. 

(ii) By Theorem 8.87, 

CG ®c h V = *iCH 0c h V. 

i 

It follows that every u e CG <S>ch V has a unique expression as a C-linear combination of 
ti < 8 > e. j , and so these elements comprise a basis. • 

We introduce the following notation. If H < G and /://-> C is a function, then 
X : G -* C is given by 

, I 0 if giH 
X(g) = \ , . .. 

[x(S') if geH. 

Theorem 8.142. If y n is the character afforded by a representation a : H —> GL( V) of 
a subgroup H of a group G, then the induced character Xa] G is given by 

Xa] G (g) = X! Xa(a~ l ga). 

1 1 aeG 

Proof. Let t\,.. ,t n be a transversal of H in G, so that G is the disjoint union G = 
U ( - tj H, and let e\, ..., e m be a (vector space) basis of V. By Lemma 8.141(h), a basis 
for the vector space V G = CG 0c h V consists of all ti 0 e j. If g e G, we compute the 
matrix of left multiplication by g relative to this basis. Note that 


where e H. and so 


gU — tk(i)hi , 


g(ti ®ej) = (gti ) 0 ej 

= tk(i)hi 0ej 

= 4(i) ® cr (hi)ej 

(the last equation holds because we can slide any element of H across the tensor sign). 
Now g(tj 0 ej) is written as a C-linear combination of all the basis elements of Vj G , for 
the coefficients t p 0 ej for p k(i) are all 0. Hence, cr] G (g) gives the nm x nm matrix 
whose m columns labeled by tj 0 ej, for fixed i, are all zero except for an m x m block 
equal to 

[ a pq{hi)] — [a pq (t p a j g t i )] ■ 
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Thus, the big matrix is partitioned into m x m blocks, most of which are 0, and a nonzero 
block is on the diagonal of the big matrix if and only if kii) = i ; that is, 

^k(i)8b — tj gU — hi £ H. 

The induced character is the trace of the big matrix, which is the sum of the traces of these 
blocks on the diagonal. Therefore, 

x<ri G (g)= 

i 

(remember that is 0 outside of H ). We now rewrite the summands (to get a formula that 
does not depend on the choice of the transversal): If t~ l gtj e //, then (t(h)~ l g(tjh) = 
gti)h in H , so that, for fixed i, 

X! X<t {(.tih)~ 1 g(t i h)) = \H\x a (tr l gtj), 

heH 

because is a class function on H. Therefore, 

Xo] G {g) = Y^Xaif^gti) 
i 

= 7^71^2 Xa((tih)~ l g(tih)) 

|H| a 

= 8 a) ■ • 

1 1 aeG 

Remark. We have been considering induced characters, but it is easy to generalize the 
discussion to induced class functions. If H < G, then a class function 9 on H has a unique 
expression as a C-linear combination of irreducible characters of H, say, 0 — a /i , and 
so we can define 

*1 G = I>iXi1 G . 

It is plain that 9] G is a class function on G, and that the formula in Theorem 8.142 extends 
to induced class functions. -4 

If, for h e //, the matrix of a(li) (with respect to the basis e \..... e m of V) is B(h), 
then define m x m matrices B(g), for all g e G, by 

0 if g i H\ 

Big ) if geH. 


Big) — 
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The proof of Theorem 8.142 allows us to picture the matrix of the induced representation 
in block form 


<y\ G (g) = 


B{t l Vi) B{t l l gt 2 ) 

B{tf l gh) B{q x gt2) 


B{t l l gt n ) 

B (L gtn) 


B(t n l gh) B(t n l gt 2 ) 


-1 , 


B(t l gt „) 


Corollary 8.143. Let H be a subgroup of a finite group G and let x be a character on 
H. 

(i) x1 G (D = [G:H]x(1). 

(ii) If H < G, then xl G (g) — 0 for all g £ H. 

Proof, (i) For all a e G, we have a -1 la = 1, so that there are |G| terms in the sum 
Xl G d) = | 77 | EflsG X(a~ { ga) that are equal to x(l); hence, 

X1 G 0)= ^xa) = [G: 

\n\ 


(ii) If H <\ G. then g f H implies a l ga f H for all a e G. Therefore, y ( a 1 ga) — 0 
for all a e G. and so x] G (g) — 0- • 


Example 8.144. 

Let H < G be a subgroup of index n, let X — {t[H, ..., t n H} be the family of left cosets 
of H , and let (p: G -> Sx be the (permutation) representation of G on the cosets of H . As 
in Example 8.123(v), we may regard <p\ G -> GL( V ), where V is the vector space over C 
having basis X ; that is, <p is a representation in the sense of this section. 

We claim that if x<p is the character afforded by <p, then x<p = f 1 G , where e is the trivial 
character on H . On the one hand. Example 8.123(v) shows that 


X<p(g) = Fix(<p(g)) 


for every g e G. 
notation) 


On the other hand, suppose (pig) is the permutation (in two-rowed 


(Pig) = 


(tiH ... t„H\ 
\gt\H ... gt n H J 


Now gtj H = tjH if and only if t j l gtj e H. Thus, e(r l gt,) f 0 if and only if gt, I! = 
tiH , and so 


e1 G (g) = FixOKg)). < 
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Even though a character X of a subgroup H is irreducible, its induced character need not 
be irreducible. For example, let G = S 3 and H be the cyclic subgroup generated by (1 2). 
The linear representation o — sgn: // —> C is irreducible, and it affords the character y n 
with 


Xa(l)=l and Xcr ((1 2)) = 1. 

Using the formula for the induced character, we find that 

Xaf 3 (l) = 3, Xaf 3 ((12)) = -1, and x-rl^Cd 2 3)) = 0. 

Corollary 8.130 shows that x CT j 53 is not irreducible, for (x^l^ 3 . Xerl^ 3 ) = 2. It is easy to 
see that x<t1 Si = X2 + X3- the latter being the nontrivial irreducible characters of S 3 . 

We must mention a result of R. Brauer. Call a subgroup £ of a finite group G elemen¬ 
tary if E = / x P, where Z is cyclic and P is a p-group for some prime p. 


Theorem (Brauer). Every complex character 6 on a finite group G has the form 

0 = £m^-1 G , 

i 


where m; € Z and the pi are linear characters on elementary subgroups ofG. 

Proof. See Curtis-Reiner, Representation Theory of Finite Groups and Associative Alge¬ 
bras, page 283. • 


Definition. If H is a subgroup of a group G, then every representation o : G —> GL(V) 
gives, by restriction, a representation a\H : H -» GL( V). (In terms of modules, every left 
CG-module V can be viewed as a left C//-module.) We call o \ H the restriction of a, and 
we denote it by crj h- The character of H afforded by n \ n is denoted by Xo\h■ 

The next result displays an interesting relation between characters on a group and char¬ 
acters on a subgroup. (Formally, it resembles the adjoint isomorphism.) 


Theorem 8.145 (Frobenius Reciprocity). Let H be a subgroup of a group G, let y be 
a class function on G, and let 9 be a class function on H. Then 

m G ,x)G = (0,xl h)h. 


where ( , )q denotes the inner product on cf(G) and ( , )n denotes the inner product 
on cf (H). 
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Proof. 


(0] G <x)g= — J2 9 ^ G ( S)x(8) 

|Lr| geG 

1 1 geG ' ' aeG 

1 1 • -i -zi- 

= T^rr^T 2^ e( - a s a )x(a ‘ ?a) ’ 


a.geG 


the last equation occurring because x is a class function. For fixed a e G , as g ranges over 
G, then so does a~ l ga. Therefore, writing x — a~ l ga, the equations continue 


1 

1 

|G| 

\H\ 

1 

1 

G 

\H\ 

1 

1 

I 

|G| 

\H\ 

1 

\H\ 

E 


a,xeG 


£(£ 0(*)x(*)) 


aeG xeG 


xeG 


= <fi,x\ H) 


H, 


the next to last equation holding because 9(x) vanishes off the subgroup H. • 

The following elementary remark facilitates the computation of induced class functions. 


Lemma 8.146. Let H be a subgroup of a finite group G, and let / be a class function on 
H. Then 

xl G (g)= tjt; J2\ c c(g;)\x(gf ' ggi)■ 

Iff I i 

Proof. Let |Cc(gi)l = If % ] giUQ — g, we claim that there are exactly m,- elements 
aeG with a~ l g,a — g. There are at least in, elements in G conjugating gi to g\ 
namely, all aao for a e Cq (gj ). There are at most m,- elements, for if b ~ 1 g,b — g, then 
b~ l gib — a^ 1 gja o, and so a^b e Cc(gj). The result now follows by collecting terms 
involving gi s in the formula for x 1 G (g) ■ • 


Example 8.147. 

Table 8.5 on page 630 is the character table of A 4 , where co — e 2jr, /3 is a primitive cube 
root of unity. 
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gi 

hi 

( 1 ) 

1 

(12 3) 

4 

(13 2 ) 

4 

(1 2)(3 4) 

3 

Xi 

1 

1 

1 

1 

X2 

1 

CO 

co 2 

1 

X3 

1 

co 2 

CO 

1 

X4 

3 

0 

0 

-1 


Table 8.5. Character Table of A 4 


The group A4 consists of the identity, eight 3-cycles, and three products of disjoint 
transpositions. In S 4 , all the 3-cycles are conjugate; if g — (12 3), then [54 ; Cs 4 (g)] = 8. 
It follows that | Cs 4 (g) | = 3, and so C^ 4 (g) = (g). Therefore, in A4, the number of 
conjugates of g is [A 4 : C A4 (g )] = 4 [we know that C A4 (g) = A 4 n Cs 4 (g) = (g)]. The 
reader may show that g and g~~ 1 are not conjugate, and so we have verified the first two 
rows of the character table. 

The rows for X 2 and /j are liftings of linear characters of A 4 /V = I3. Note that if h = 
(1 2)(3 4), then X 2 O 1 ) = X 2 (1) = 1. because Y is the kernel of the lifted representation; 
similarly,/ 3 (/z) = 1. Nowx 4 (l) = 3, because 3 + (n 4 ) 2 = 12. The bottom row arises from 
orthogonality of the columns. (We can check, using Corollary 8.130, that the character of 
degree 3 is irreducible.) < 


Example 8.148. 

Table 8.6 is the character table of S 4 . 


gi 

hi 

( 1 ) 

1 

( 12 ) 

6 

(12 3) 

8 

(1 2 3 4) 

6 

(1 2)(3 4) 

3 

Xi 

1 

1 

1 

1 

1 

X2 

1 

-1 

1 

-1 

1 

X3 

2 

0 

-1 

0 

2 

X4 

3 

1 

0 

-1 

-1 

X5 

3 

-1 

0 

1 

-1 


Table 8 . 6 . Character Table of ,S ' 4 


We know, for all n, that two permutations in S n are conjugate if and only if they have 
the same cycle structure; the sizes of the conjugacy classes in ,S ' 4 were computed in Exam¬ 
ple 2.5(i). 

The rows for X 2 and X 3 are liftings of irreducible characters of S 4 /V = S 3 . The en¬ 
tries in the fourth column of these rows arise from (1 2)V =(1 2 3 4)V; the entries 
in the last column of these rows arise from V being the kernel (in either case), so that 
X;'((l 2)(3 4)) = Xy (l) for j — 2, 3. 

We complete the first column using 24 = 1 + 1 + 4 + thus, n 4 = 3 = 115 . 

Let us see whether X 4 is an induced character; if it is, then Corollary 8.143(i) shows that 
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it arises from a linear character of a subgroup H of index 3. Such a subgroup has order 8, 
and so it is a Sylow 2-subgroup; that is, H = I)a- Let us choose one such subgroup: Let 

H = (V, (1 3)> = V U {(1 3), (2 4), (1 2 3 4), (1 4 3 2)}. 

The conjugacy classes are 


Ci = {1}; 

C 2 = {(1 3)(2 4)}; 

C 3 = {(1 2)(3 4), (1 4)(2 3)}; 

C 4 = {(1 3), (2 4)}; 

C 5 = {(1 2 3 4),(1 4 3 2)}. 

Let 6 be the character on H defined by 

Ci C 2 C 3 C4 C5 

11-11 -1. 

Define x 4 — 0] 54 . Using the formula for induced characters, assisted by Lemma 8.146, 
we obtain the fourth row of the character table. However, before going on to row 5, we 
observe that Corollary 8.130 shows that /4 is irreducible, for (x 4 , / 4 ) = 1. Finally, the 
orthogonality relations allows us to compute row 5. ◄ 

At this point in the story, we must introduce algebraic integers. Since G is a finite group, 
Lagrange’s theorem gives g = 1 for all g e G. It follows that if a: G —>• GL(V ) is 
a representation, then < 7 (,g)! G = I for all g; hence, all the eigenvalues of n(g ) are |G|th 
roots of unity, and so all the eigenvalues are algebraic integers. By Proposition 7.24, the 
trace of rr(g). being the sum of the eigenvalues, is also an algebraic integer. 

We can now prove the following interesting result. 

Theorem 8.149. The degrees n ( - of the irreducible characters of a finite group G are 
divisors of | G |. 

Proof. By Corollary 3.44, the rational number a — |G|/n; is an integer if it is also an 
algebraic integer. Now Corollary 8.10(ii) says that a is an algebraic integer if there is 
a faithful Z[a]-module M that is a finitely generated abelian group, where Z[a] is the 
smallest subring of C containing a. 

By Proposition 8.128, we have 
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Hence, ac, = ^geG Xi (g 1 )g. But e, is an idempotent: ef = e,, and so 

ae i = X! X<(g _1 )ge,-. 
geG 

Define M to be the abelian subgroup of CG generated by all elements of the form £ge,-, 
where £ is a |G|th root of unity and g e G; of course, M is a finitely generated abelian 
group. 

To see that M is a Z[«]-module, it suffices to show that aM C M. But 

a<;ge, = t;gaei 

= Sg ^2 Xi(h~ l )het 
heG 

= X! Xi(h~ l )^ghei. 
heG 

This last element lies in M, however, because Xi(h~) is a sum of |G|th roots of unity. 

Finally, if ft e C and u e CG, then flu — 0 if and only if f — 0 or u — 0. Since Z[a] C 
C and M C CG, however, it follows that M is a faithful Z[a]-module, as desired. • 

We will present two important applications of character theory in the next section; for 
other applications, as well as a more serious study of representations, the interested reader 
should look at the books of Curtis-Reiner, Feit, Huppert, and Isaacs. Representation the¬ 
ory was an essential ingredient of the proof of the classification of the finite simple groups 
in the 1980s: There are several infinite families and 26 sporadic groups belonging to no in¬ 
finite family (see the chapter by R. Carter in the book edited by Kostrikin and Shafarevich, 
as well as Gorenstein-Lyons-Solomon, The Classification of the Finite Simple Groups ). 
The ATLAS, by Conway et al, contains the character tables of every simple group of order 
under 10 25 as well as the character tables of all the sporadic groups. The largest sporadic 
simple group is called the Monster ; it has order 

2 46 • 3 20 • 5 9 • 7 6 • ll 2 • 13 3 • 17- 19-23 -29 ■ 31 -41 -47 -59 -71. 

Exercises 

8.55 Prove that if 6 is a generalized character of a finite group G, then there are characters / and 

with 9 = / — fi. 

8.56 (i) Prove that if z is a complex root of unity, then z~ l =z. 

(ii) Prove that if G is a finite group and ct : G —> GL(T) is a representation, then 

= Xcr (g) 

for all geG. 

Hint. Use the fact that every eigenvalue of o ( g ) is a root of unity, as well as the fact 
that if A is a nonsingular matrix over a field k and if «j, ..., u n are the eigenvalues of A 
(with multiplicities), then the eigenvalues of A~ l are uj , ..., uf , that is, uf, ..., ITT". 
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8.57 If o': G —> GL(;z, C) is a representation, its contragredient a* : G —»■ GL(n, C) is the func¬ 
tion given by 

o*(g) = ff(g _1 ) f , 

where 1 denotes transpose. 

(i) Prove that the contragredient of a representation o' is a representation that is irreducible 
when o is irreducible. 

(ii) Prove that the character Xa* afforded by the contragredient a* is 

Xa*(g ) = Xa(g), 

where Xa ( g ) is the complex conjugate. Conclude that if x is a character of G, then ~x is 
also a character. 

8.58 Construct an irreducible representation of S 3 of degree 2. 

8.59 (i) If g e G, where G is a finite group, prove that g is conjugate to g _1 if and only if / (g) 

is real for every character x of G. 

(ii) Prove that every character of S„ is real valued. (It is a theorem of F. G. Frobenius that 
every character of S n is integer valued.) 

8.60 (i) If G is a Unite abelian group, define its character group G* by 

G* = Hom(G,C x ), 

where C x is the multiplicative group of nonzero complex numbers. Prove that G* = G. 
Hint. Use the fundamental theorem of finite abelian groups. 

(ii) Prove that Hom(G, C x ) = Hom(G, Q/Z) when G is a finite abelian group. 

(iii) Prove that every irreducible character of a finite abelian group is linear. 

8.61 Prove that the only linear character of a simple group is the trivial character. Conclude that if 
X; is not the trivial character, then tij = x/(l) > 1 - 

8.62 Let 8 = Xa be the character afforded by a representation o' of a finite group G. 

(i) If g e G, prove that |0(g)[ = 0(1) if and only if o'(g) is a scalar matrix. 

Hint. Use Proposition 1.42 on page 23. 

(ii) If 9 is an irreducible character, prove that 

Z(G/ker0) = {geG:|0(g)|=0(l)}. 

8.63 If G is a finite group, prove that the number of its (necessarily irreducible) linear representa¬ 
tions is [G : G']. 

8.64 Let G be a finite group. 

(i) If g e G, show that |C G (g)| = IXj'(g)| 2 - Conclude that the character table of G 

gives |C G (g)|. 

(ii) Show how to use the character table of G to see whether G is abelian. 

(iii) Show how to use the character table of G to find the lattice of normal subgroups of G 
and their orders. 

(iv) If G is a finite group, show how to use its character table to find the commutator sub¬ 
group G'. 

Hint. If K <1 G, then the character table of G/K is a submatrix of the character table 
of G, and so we can find the abelian quotient of G having largest order. 
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(v) Show how to use the character table of a finite group G to determine whether G is 
solvable. 

8.65 (i) Show how to use the character table of G to find | Z(G) \. 

(ii) Show how to use the character table of a finite group G to determine whether G is 
nilpotent. 

8.66 Recall that the group Q of quaternions has the presentation 

Q = (A, B\A 4 = 1, A 2 = B 2 , BAB~ l = A -1 ). 


(i) Show that there is a representation o : Q —>■ GL(2, C) with 



"/ 0" 


' 0 f 

A 

and B 


0 —i 


-1 0 


(ii) Prove that a is an irreducible representation. 

8.67 (i) If a : G —*■ GL(V) and r : G —*■ GL(W) are representations, prove that 

ff® r: G-» GL(V <8> WO, 

defined by 

(o' 0 r)(g) = o(g) 8) r(g) 

is a representation. 

(ii) Prove that the character afforded by a ® r is the pointwise product: 

XaXr- g !-> tr(ff(g))tr(T(g)). 

(iii) Prove that cf(G) is a commutative ring (usually called the Burnside ring of G). 


8.6 Theorems of Burnside and of Frobenius 


Character theory will be used in this section to prove two important results in group the¬ 
ory: Burnside’s p m q" theorem and a theorem of Frobenius. We begin with the following 
variation of Schur’s lemma. 

Proposition 8.150. If a : G —> GL( V ) is an irreducible representation and if a linear 
transformation tp : V —>■ V satisfies 


(pa(g) = a{g)<p 


for all g € G, then (p is a scalar transformation: there exists a e C with (p — a\ y. 
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Proof. The vector space V is a CG-module with scalar multiplication gv — o(g)(v) for 
all v e V. and any linear transformation 0 satisfying the equation 0o(g) = cr(g)6 for all 
g e G is a CG-map V n -> V a . Since a is irreducible, the CG-module V n is simple; by 
Schur’s lemma [Theorem 8.52(ii)], we have End(V / ' T ) a division ring, and so every nonzero 
element in it is nonsingular. Now <p — a ly e End(V or ) for every a e C; in particular, this 
is so when a is an eigenvalue of (p (which lies in C because C is algebraically closed). The 
definition of eigenvalue says that q> — a 1 y is singular, and so it must be 0; that is, <p = a 1 v, 
as desired. • 

As in Proposition 8.119(ii), we may regard the irrreducible representation : G -> 
GL (Li), given by left multiplication on the minimal left ideal as a C-algebra map 
a,- : CG — > End(L, ) (after all, im a,- C End(L,)). Hence, the restriction to the center of 
CG is also an algebra map: 

li: Z(CG) —> End(L,) = Mat„,(C). 

Thus, for each z e Z(CG), we see that a; (z) is an n,- x n, complex matrix. By Proposi¬ 
tion 8.150, each kj(z) is a scalar matrix for every z e Z(CG): 


G (z) = a>j(z)F 


where a>j(z) e C. Moreover, the function a>i: Z(CG) —>• C is a C-algebra map because 
X( is. 

Recall, from Lemma 8.68, that a basis for Z(CG) consists of the class sums 

Zi = 

geQ 

where the conjugacy classes of G are Ci,..., C r . 


Proposition 8.151. Let zi, ■ ■ ■, Z r be the class sums of a finite group G. 

(i) For each i, j, we have 


Oi(Zj) = 

where gj e Cj. 

(ii) There are nonnegative integers a,j v with 


hjXi(gj) 


ZiZj 


j — X/ a ‘ 


i jvZv 


(iii) The complex numbers cot ( zj ) are algebraic integers. 
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Proof, (i) Computing the trace of A.,- ( zj ) = &>,- (zj)I gives 

niOJi(zj) = XiiZj ) = X! = hj Xi (gj ) i 

geCj 

for Xi is constant on the conjugacy class Cj. Therefore, (Oiizj ) = hjXiigj)/ni- 

(ii) Choose g v e C v . The definition of multiplication in the group algebra shows that the 
coefficient of g v in ZiZj is 

|{(g;. gj ) e Ci x Cj : gigj = g v }|, 

the cardinality of a finite set, and hence it is a nonnegative integer. As all the coefficients 
of are equal [for we are in Z(CG)], it follows that this number is Ujj v . 

(iii) Let M be the (additive) subgroup of C generated by all a>j(zj), for j — 1 Since 

&),• is an algebra map, 

a>i(zj)a)i(ze) — y^,aje v coi(z v ), 

V 

so that M is a ring that is finitely generated as an abelian group (because ajj v e Z). Hence, 
for each j, M is a Z[ft),-(z;)]-module that is a finitely generated abelian group. If M is 
faithful, then Corollary 8.10(ii) will give a>i(zj) an algebraic integer. But M C C, so that 
the product of nonzero elements is nonzero, and this implies that M is a faithful Z[&>,- (z j )]- 
module, as desired. • 

We are almost ready to complete the proof of Burnside’s theorem. 

Proposition 8.152. If(jii,hj) — 1 for some i , /, then either \ x, (g ,) = n, or xj (gj ) = 0. 

Proof. By hypothesis, there are integers ,v and t in Z with sn,- + thj — 1, so that, for 
gj e Cj, we have 

xjM= sx ,( Si)+ <hiiM. 

ni rij 

Hence, Proposition 8.151 (iii) gives Xi(gj)/ n i an algebraic integer, and so |/,(g;)| < 
by Proposition 8.138(i); thus, it suffices to show that if \Xi(gj)/ n i\ < 1- then Xi(gj) — 0. 

Let m(pc) e Z[x\ be the minimum polynomial of a = Xi(gj)/ n i• that is, m(pc) is the 
monic polynomial in Z[x] of least degree having a as a root. We proved, in Corollary 6.29, 
that m(x) is irreducible in Q[x], If a' is a root of m (x), then Proposition 4.13 shows that 
ol’ = o(a ) for some o e Gal(£/Q), where E/Q is the splitting field of m(x)(x' [G '' — 1). 
But 

ot — — (ei + ••• + £«,), 

Hi 

where the e’s are |G|th roots of unity, and so a’ — a (a) is also such a sum. It follows 
that \ct'\ < 1 [as in the proof of Proposition 8.138(i)]. Therefore, if Nipt) is the norm 
of a (which is, by definition, the absolute value of the product of all the roots of mix)), 
then Nia) < 1 (for we are assuming that |a| < 1). But Nia) is the absolute value of the 
constant term of mix), which is an integer. Therefore, Nia) = 0, hence a — 0, and so 
Xiigj) — 0, as claimed. • 
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At last, we can prove the hypothesis of Proposition 8.116, stated at the beginning of the 
previous section. 

Theorem 8.153. If G is a nonabelian finite simple group, then {1} is the only conjugacy 
class whose size is a prime power. Therefore, Burnside’s theorem is true: every group of 
order p m q n , where p and q are primes, is solvable. 

Proof. Assume, on the contrary, that hj = p* > 1 for some j. By Exercise 8.62(h) on 
page 633, for all i, we have 


Z(G/kerxi) = {g e G : |x ; (g)l = «<}• 


Since G is simple, ker/, = {1} for all i, and so Z(G/ker/,) = Z(G) — {1}. By 
Proposition 8.152, if («, , h ;) — 1, then either | Xi(gj )I = n,- or Xiigj ) — 0. Of course, 
X i (gj ) = 1 for all j, where xi is the trivial character. If Xi is not the trivial character, then 
we have just seen that the first possibility cannot occur, and so Xiigj) — 0- On the other 
hand, if (n,-, hj) ^ 1, then p | n ( - (for hj — p e ). Thus, for every i ^ 1, either Xiigj ) = 0 
or p | n*. 

Consider the orthogonality relation. Corollary 8.136(ii): 


J2 n iXi(gj) = °- 

; = 1 

Now«i = 1 = xi (gj)- while each of the other terms is either 0 or of the form pa,-, where 
otj is an algebraic integer. It follows that 


0=1 + pf), 


where f> is an algebraic integer. This implies that the rational number — 1 /p is an algebraic 
integer, hence lies in Z, and we have the contradiction that — 1 /p is an integer. • 

Another early application of characters is a theorem of F. G. Frobenius. We begin with 
a discussion of doubly transitive permutation groups. Let G be a finite group and X a finite 
G-set. Recall that if x e X , then its orbit is 0(x ) = {gx : g e G} and its stabilizer is 
G.v — {g e G : gx — x}. Theorem 2.98 shows that |G(x)||G^| = |G|. A G-set X is 
transitive if it has only one orbit: If x, y e X , then there exists g e G with y — gx: in this 
case, 0(x) = X. 

If A is a G-set, then there is a homomorphism a: G —> .S+, namely, g a g , where 
otg (x ) = gx. We say that X is a faithful G-set if a is an injection; that is, if gx — x 
for all x e X, then g — 1. In this case, we may regard G as a subgroup of Sx acting as 
permutations of X. 

Cayley’s theorem (Theorem 2.87) shows that every group G can be regarded as a faithful 
transitive G-set. 
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Definition. A G-set A is doubly transitive if, for every pair of 2-tuples (x\ , X 2 ) and 
(y] , >’ 2 ) in X x X with x\ ^ X 2 and vi ^ y 2 , there exists g e G with vi = gx 1 and 
J 2 = gx 2 . 11 

We often abuse language and call a group G doubly Uansitive if there exists a doubly 
Uansitive G-set. 

Note that every doubly Uansitive G-set X is transitive: If x ^ y, then (x, y) and (y. x) 
are 2-tuples as in the definition, and so there is g e G with y = gx (and x = gy). 


Example 8.154. 

(i) If n > 2, the symmetric group S„ is doubly Uansitive; that is, A = {1,..., n} is a 
doubly transitive Sx- set. 

(ii) The alternating group A n is doubly transitive if n > 4. 

(iii) Let V be a finite-dimensional vector space over F 2 , and let X — V — {0}. Then X is 
a doubly transitive GL(V)-set, for every pair of distinct nonzero vectors xi, X 2 in V must 
be linearly independent (see Exercise 3.69 on page 170). Since every linearly independent 
list can be extended to a basis, there is a basis x\, xi ,..., x n of V. Similarly, if yi, y 2 is 
another pair of distinct nonzero vectors, there is a basis yi, y 2 . ..., y n . But GL(V) acts 
Uansitively on the set of all bases of V, by Exercise 3.78 on page 181. Therefore, there is 
g e GL(V) with y, = gx, for all i, and so A is a doubly Uansitive GL(V)-set. 4 


Proposition 8.155. A G-set X is doubly transitive if and only if for each x e A, the 
G x -set A — {x} is transitive. 

Proof Let A be a doubly Uansitive G-set. If y, z € A — {x}, then (y, x) and (z, x) are 
2-tuples of distinct elements of A, and so there is g e G with z = gy and x = gx. The 
latter equation shows that g e G x , and so A — {x} is a transitive G v -set. 

To prove the converse, let (xj,, X 2 ) and (yi, y 2 ) be 2-tuples of distinct elements of A. 
We must find g e G with yi = gx 1 and y 2 = gy 2 - Let us denote (gxi, gX 2 ) by g{x\, xf). 
There is h e G Xl with h(x 1 , X 2 ) = (yi, X 2 ): if x\ — yi, we may take h = lx; if xi ^ yi, 
we use the hypothesis that A — {X 2 } is a Uansitive G. V7 -set. Similarly, there is h' e G Vl 
with h'(y 1 , X 2 ) = (yi, y 2 )- Therefore, h'h(x 1 , X 2 ) = (yi, y 2 ), and A is a doubly transitive 
G-set. • 


Example 8.156. 

Let k be a field, let /(x) e k [x | have no repeated roots, let E/k be a splitting field, and let 
G = Gal (E/k) be the Galois group of fix). If A = {oq,..., a,,} is the set of all the roots 


"More generally, we call a G-set X ^-transitive, where 1 < k < |X|, if, for every pair of A-tuples (.vi,..., xp) 
and (yi,..., yfc) in X x • • • x X having distinct coordinates, there exists g e G with yi = gxj for all i < k. It 
can be proved that if k > 5 , then the only faithful £-transitive groups are the symmetric groups and the alternating 
groups. The five Mathieu groups are interesting sporadic simple groups that are also highly transitive: M22 is 
3 -transitive, M\\ and M23 are 4 -transitive, and M\2 and M24 are 5 -transitive. 
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of fix), then X is a G-set (Theorem 4.3) that is transitive if and only if fix) is irreducible 
(Proposition 4.13). Now fix) factors in k(oii)[x]: 

fix) = ix - <x\)f\(x). 

The reader may show that G i = Gal(£’/fc(ai)) < G is the stabilizer G U] and that X — {aj} 
is a Gi-set. Thus, Proposition 8.155 shows that X is a doubly transitive G-set if and only 
if both fix) and fiix) are irreducible (over k[x] and fc(ai)[x], respectively). ◄ 

Recall Example 2.92(h): If H is a subgroup of a group G and X = G/H is the family 
of all left cosets of H in G, then G acts on G/H by g: aH i-»- gaH. The G-set X is 
transitive, and the stabilizer of aH e G/H is aHa~ x \ that is, gaH — aH if and only if 
a~ l ga e H if and only if g e aHa~ l . 

Proposition 8.157. If X is a doubly transitive G-set, then 

\G\ = nin - l)\G x , y \, 

where n — |X| and G Xi y = {g e G : gx — x and gy = y}. Moreover, if X is a faithful 
G-set, then |G. Vi y| is a divisor of in — 2)!. 

Proof. First, Theorem 2.98 gives |G| = n|G x |, because X is a transitive G-set. Now 
X — {x} is a transitive G^-set, by Proposition 8.155, and so 

\G x \ = \X-{x}\\iG x ) y \ = in-\)\G x , y \, 

because iG x ) y — G xy . The last remark follows, in this case, from G x y being a subgroup 
of Sx~ {.v,v} = Sn-2- • 

It is now easy to give examples of groups that are not doubly transitive, for the orders 
of doubly transitive groups are constrained. 

Definition. A transitive G-set X is called regular if only the identity element of G fixes 
any element of X ; that is, G x = {1} for all x e X. 

For example, Cayley’s theorem shows that every group G is isomorphic to a regular 
subgroup of Sq ■ The notion of regularity extends to doubly transitive groups. 

Definition. A doubly transitive G-set X is sharply doubly transitive if only the identity 
of G fixes two elements of X; that is, G x y = {1} for all distinct pairs x, y e X. 

Proposition 8.158. The following conditions are equivalent for a faithful doubly transi¬ 
tive G-set X with \X\ = n. 

(i) X is sharply doubly transitive. 

(ii) If ix i, x' 2 ) and iy 1 , 3 ^ 2 ) are 2-tuples in X x X with x\ X 2 and y\ y 2 , then there 
is a unique g e G with y 1 = gx 1 and y 2 — gy 2 - 
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(iii) |G| = n(n — 1). 

(iv) G x ,y = {1 } for all distinct x, y e X. 

(v) For every x e X, the G x -set X — {x} is regular. 

Proof. All the implications are routine. • 


Example 8.159. 

(i) S 3 and A 4 are sharply doubly transitive groups. 

(ii) The affine group Aff(l, R) was defined in Exercise 2.46 on page 80; it consists of all 
the functions /; R -* R of the form f(x ) = ax + b with a 0 under composition, and 
it is isomorphic to the subgroup of GL(2, R) consisting of all matrices of the form [2 *]. 
It is plain that we can define Aff(l, k) for any field k in a similar way. In particular, if k 
is the finite field F ? , then the affine group Aff(l, F ? ) is finite, and of order q{q — 1). The 
reader may check that is a sharply doubly transitive Aff(l, F ? )-set. ◄ 

Notation. If G is a group, then G # — {g e G : g 1}. 

By Cayley’s theorem, every group is regular. We now consider transitive groups G such 
that each g e G # has at most one fixed point. In case every g e G # has no fixed points, 
then we say that the action of G is fixed point free. J. G. Thompson proved that if a finite 
group H has a fixed point free automorphism a of prime order (that is, the action of the 
group G — {a) on H # is fixed point free), then H is nilpotent (see Robinson, A Course in 
the Theory of Groups, pages 306-307). Thus, let us consider such actions in which there 
is some g e G # that has a fixed point; that is, the action of G is not regular. 

Definition. A finite group G is a Frobenius group if there exists a transitive G-set X 
such that 

(i) every g e G # has at most one fixed point; 

(ii) there is some g e G # that does have a fixed point. 

If x e X, we call G x a Frobenius complement of G. 

Note that condition (i) implies that the G-set X in the definition is necessarily faithful. 
Let us rephrase the two conditions: (i) that every g e G # has at most one fixed point says 
that G K y = {1}; (ii) that there is some g e G # that does have a fixed point says that 
GvtM 11 - 

Example 8.160. 

(i) The symmetric group .S 3 is a Frobenius group: X = {1,2, 3} is a faithful transitive 
53 -set; no a e (S 3 f fixes two elements; each transposition (i j ) fixes one element. 
The cyclic subgroups ((i j)) are Frobenius complements (so Frobenius complements 
need not be unique). A permutation ft e .S 3 has no fixed points if and only if ft is a 
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3-cycle. We are going to prove that, in every Frobenius group, 1 together with all those 
elements having no fixed points comprise a normal subgroup. 

(ii) The example of S 3 in part (i) can be generalized. Let X be a G-set, with at least three 
elements, which is a sharply doubly transitive G-set. Then X is transitive, G xy = {1}, and 
G.v ^4 {1} (for if x, y, z € X are distinct, there exists g e G with x = gx and z = gy). 
Therefore, every sharply doubly transitive group G is a Frobenius group. ◄ 

Proposition 8.161. A finite group G is a Frobenius group if and only if it contains a 
proper nontrivial subgroup H such that H IT gHg~ [ — {\} for all g f H. 

Proof. Let X be a G-set as in the definition of Frobenius group. Choose x e X, and 
define H — G x . Now H is a proper subgroup of G, for transitivity does not permit gx = x 
for all g € G. To see that H is nontrivial, choose g e G # having a fixed point; say, gy — y. 
If v = x , then g e G x = //. If y f x, then transitivity provides h e G with hy — x, and 
Exercise 2.99 on page 114 gives H = G x — hG y h~ x ^ {1}. If g $. H, then gx f x. Now 
g(G.v)g _1 = G gx . Hence, if h e H fl gHg~ l = G x fl G gx , then h fixes x and gx\ that is, 
h e G x , y = { 1 }. 

For the converse, we take X to be the G-set G/H of all left cosets of H in G, where 
g: aH gaH for all g e G. We remarked earlier that X is a transitive G-set and that 
the stabilizer of aH e G/H is the subgroup aHa~ l of G. Since H {1}, we see that 
G a ti 7 £ {1}- Finally, if aH bH, then 

GaH.bH — G a H Fl GbH = aHa~ l fl bHb~ l — a(H IT a~ l bHb~ l a)a~ l = {1}, 

because a~ l b </ H. Therefore, G is a Frobenius group. • 

The significance of this last proposition is that it translates the definition of Frobenius 
group from the language of G-sets into the language of abstract groups. 

Definition. If X is a G-set, define its Frobenius kernel to be the subset 
lV = {l}U{geG:g has no fixed points}. 

When X is transitive, we can describe N in terms of a stabilizer G x . If a </ /V # , 
then there is some y e X with ay = y. Since G acts transitively, there is g e G with 
gx = y, and a e G y — gG x g~ l . Hence, a e U s sg 8G x g~ l . For the reverse inclusion, if 
a e U^sG sG x g~ l , then a e gG x g _1 = G gx for some g e G, and so a has a fixed point; 
that is, a f N. We have proved that 

(V = {1} U (G — ([J gG x g~ 1 )). 

g sG 

Exercise 5.32 on page 278 shows that if G x is a proper subgroup of G, then G f 
U ?s g gG x g~ l , and so N ^ { 1 } in this case. 
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Proposition 8.162. If G is a Frobenius group with Frobenius complement H and Frobe- 
nius kernel N, then | N — [G : H], 

Proof. By Proposition 8.161, there is a disjoint union 

G = {l}u(U l ?H # g-')uiV # . 

geG 

Note that Ng(H) = H■ If g f H , then H fl gHg~ { — {1}, and so gHg~ l H. Hence, 
the number of conjugates of H is [G : N(; ill)] = [G : H] (Proposition 2.101). Therefore, 

I U geG gH # g~ l \ = [G : H](\H\ - 1), and so 

\N\ = |1V # | + 1 = |G| - ([G : H](\H\ - 1)) = [G : H], . 

The Frobenius kernel may not be a subgroup of G. It is very easy to check that if g e N. 
then g _1 e N and aga~ l e N for every a e G; the difficulty is in proving that N is closed 
under multiplication. For example, if V — k" is the vector space of all n x 1 column vectors 
over a field k, then V /# , the set of nonzero vectors in V, is a faithful transitive GL( V )-set. 
Now A e GL(V) has a fixed point if and only if there is some v e V # with Av — v: that 
is, A has a fixed point if and only if 1 is an eigenvalue of A. Thus, the Frobenius kernel 
now consists of the identity matrix together with all linear transformations which do not 
have 1 as an eigenvalue. Let \k\ > 4, and let a be a nonzero element of k with a 2 f 1. 
Then A — [ q ® ] and B = “ Q 1 ^ j lie in N, but their product AB = q ^2 j does not lie 
in N. However, if G is a Frobenius group, then N is a subgroup; the only known proof of 
this fact uses characters. 

We have already remarked that if i/r is a character on a subgroup H of a group G, then 
the restriction (x[r ] 0 ) h need not equal \[r. The next proof shows that irreducible characters 
of a Frobenius complement do extend to irreducible characters of G. 

Lemma 8.163. Let G be a Frobenius group with Frobenius complement H and Frobenius 
kernel N. For every irreducible character i/f on H other than the trivial character \f /\, 
define the generalized character 

(p — if — dfi 1 , 

where d = ^(l)- Then — (p] G + dx 1 is an irreducible character on G, and i/Pj) = 1 \r\ 
that is, xjr*{h) = (h) for all h e H. 

Proof. Note first that q>( 1) = 0. We claim that the induced generalized character <p\ G 
satisfies the equation 

(<P] G )h = <P- 

If fl = 1, ..., f„ is a transversal of H in G, then for g e G, the matrix of tp] G (g) on 
page 627 has the blocks B(t~ l gt ,) on its diagonal, where B(tf l gti) = 0 if t~ l gtj f. H 
(this is just the matrix version of Theorem 8.142). If h e H , then tf 1 h t, f H for all i f 1, 
and so B ( tf 1 h f,■) = 0. Therefore, there is only one nonzero diagonal block, and 

trM G (/i)) = tr(£(/ 0 ); 
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that is, 

<p] G (h) = <p(h). 

We have just seen that <p] G is a generalized character on G such that {<p] G ) H — cp. By 
Frobenius reciprocity (Theorem 8.145), 

(<p] G , <pf) G = (9>, ((P] G )h)h = (<P, <P)h- 


But <p — xjt — df\ , so that orthogonality of i jr and f 1 gives 

(<P,<P)h - 1 +d 2 . 


Similarly, 


C <P] G , Xi )g = (<P, )h = ~d , 


where / \ is the trivial character on G. Define 


xjr* = cp] G + dxi- 


Now xj/* is a generalized character on G, and 

V)g = (V\ G , (p] G )G+2d((p} G , xi )G+d 2 
= l+d 2 -2d 2 + d 2 = 1. 


We have 


(l/ r *)ff = (‘P] G )h +d(xi)H — (p + dxj/i = -dfi) +dxj/\ = f. 

Since i/f*(l) = xjs( 1) > 0, Corollary 8.130 says that \jf* is an irreducible character on G. • 

Theorem 8.164 (Frobenius). Let G be a Frobenius group with Frobenius complement 
H and Frobenius kernel N. Then N is a normal subgroup of G, N D H — {1}, and 
NH = G. 


Remark. A group G having a subgroup Q and a normal subgroup K such that 
K (T Q — {1} and K Q — G is called a semidirect product. We will discuss such groups in 
Chapter 10. ◄ 

Proof. For every irreducible character i// on H other than the trivial character f \, define 
the generalized character tp — f—dfi. where d — x]/(\). By the lemma, xjr* — cp] G +c//i 
is an irreducible character on G. Define 

N* = Q kerf*. 


Of course, N* is a normal subgroup of G. 



644 


Algebras Ch. 8 


By Lemma 8.163, f*(h) = f(h) for all h e //; in particular, if h — 1, we have 

= )=d. (5) 

If g e A' # , then for all a e G, we have g aHa~ l (for g has no fixed points), and so 
ip(aga~ x ) — 0. The induced character formula. Theorem 8.142, now gives <p1 G (g) — 0. 
Hence, if g e A /# , then Eq. (5) gives 


f*(g) = <P] °(g) + dxi(g) = d. 


We conclude that if g e N, then 




that is, g e kcr ij/ *. Therefore, 


N c N*. 

The reverse inclusion will arise from a counting argument. 

Let h € H IT N*. Since h e H, Lemma 8.163 gives i Js*(h) = i/(h). On the other 
hand, since h e N*, we have = Vf*(l) = d. Therefore, \jr(h) — \jr*(h) = 

d = ir(l), so that h e ker \[r for every irreducible character i// on H. Consider the 
regular character, afforded by the regular representation p on H: xp — Hi n ii f i- Now 
X p (h) — Hi n i'P'i(h) 7 ^ 0, so that Example 8.125(ii) gives h = 1. Thus, 

H HN* = {1}. 


Next, |G| = \H\[G : H] = |//| |iV|, by Proposition 8.162. Note that HN* is a subgroup 
of G, because N* <1 G. Now \HN*\\H fl N*\ — |//||A^*|, by the second isomorphism 
theorem; since HON* — {1}, we have |//||A^| = |G| > = |//||A^*|. Hence, 

\N\ > \N*\. But \N\ < |N*|, because N c N*, and so N = N*. Therefore, IV < G, 
H (1 N = {1}, and HN — G. • 

Much more can be said about the structure of Frobenius groups. Every Sylow sub¬ 
group of a Frobenius complement is either cyclic or generalized quaternion (see Huppert, 
Endliche Gruppen I, page 502), and it is a consequence of J. G. Thompson’s theorem on 
fixed-point-free automorphisms that every Frobenius kernel is nilpotent; that is, N is the 
direct product of its Sylow subgroups. The reader is referred to Curtis-Reiner, Representa¬ 
tion Theory of Finite Groups and Associative Algebras , pages 242-246, or Feit, Characters 
of Finite Groups, pages 133-139. 
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Exercises 

8.68 Prove that the affine group Aff(l, F ? ) in Example 8.159(h) is sharply doubly transitive. 

8.69 If H < G and the family of left cosets G/H is a G-set via the representation on cosets, prove 

that G/H is a faithful G-set if and only if f~) aeG = {1). Give an example in which 

G/H is not a faithful G-set. 

8.70 Prove that every Sylow subgroup of SL(2, F 5 ) is either cyclic or quaternion. 

8.71 A subset A of a group G is a T.I. set (or a trivial intersection set) if A C Nq(A) and 
AflgAg -1 C (1) for all g (/ N G (A). 

(i) Prove that a Frobenius complement H in a Frobenius group G is a T. I. set. 

(ii) Let A be a T. I. set in a finite group G, and let N — Nq(A). If a be a class function 

vanishing on N — A and f is a class function on N vanishing on (U^eG^^ ^ ) — 4. 

prove, for all g e N # , that a] G (g) = a(g ) and f] G (g) = /3(g). 

Hint. See the proofs of Lemma 8.164 and Theorem 8.163. 

(iii) If a(l) = 0, prove that (a, f))^ = (a\ G , f] G )o- 

(iv) Let H be a self-normalizing subgroup of a finite group G; that is, H = Nq(H). If 
H is a T. I. set, prove that there is a normal subgroup K of G with K IT H = {1} and 
KH = G. 

Hint. See Feit, Characters of Finite Groups, page 124. 

8.72 Prove that there are no nonabelian simple groups of order n, where 60 < n < 100. 

Hint. By Burnside’s theorem, the only candidates for n in the given range are 66 , 70, 78, 84, 
and 90, and 90 was eliminated in Exercise 5.29(ii) on page 278. 

8.73 Prove that there are no nonabelian simple groups of order n, where 101 < n < 168. We 
remark that PSL(2, F 7 ) is a simple group of order 168, and it is the unique such group, to 
isomorphism. With Proposition 5.41, Corollary 5.68, and Exercise 8.72, we see that A 5 is the 
only nonabelian simple group of order strictly less than 168. 

Hint. By Burnside’s theorem, the only candidates for n in the given range are 102. 105, 110, 
120, 126, 130, 132, 138, 140, 150, 154, 154, 156, and 165. Use Exercise 2.98 on page 114 
and Exercises 5.30 and 5.31 on page 278. 
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This chapter begins with the study of modules over PIDs, including characterizations of 
their projective, injective, and flat modules. Our emphasis, however, is on finitely gener¬ 
ated modules, because the generalization of the Fundamental Theorem of Finite Abelian 
Groups, when applied to k[x]-modules, yields the rational and Jordan canonical forms for 
matrices. The Smith normal form is also discussed, for it can be used to compute the invari¬ 
ants of a matrix. We then consider bilinear and quadratic forms on a vector space, which 
lead to symplectic and orthogonal groups. Multilinear algebra is the next step, leading to 
tensor algebras, exterior algebras, and determinants. We end with an introduction to Lie 
algebras, which can be viewed as a way of dealing with a family of linear transformations 
instead of with individual ones. 


9.1 Modules over PIDs 

The structure theorems for finite abelian groups will now be generalized to modules over 
PIDs. As we have just said, this is not mere generalization for its own sake, for the module 
version will yield canonical forms for matrices. Not only do the theorems generalize, but 
the proofs of the theorems generalize as well, as we shall see. 

Definition. Let M be an /?-module. If m e M, then its order ideal (or annihilator) is 

ann(m) — {r e R : rm — 0}. 

We say that m has finite order (or is a torsion 1 element ) if ann(m) ^ {0}; otherwise, m has 

infinite order. 

When a commutative ring R is regarded as a module over itself, its identity 1 has infinite 
order, for ann(l) = { 0 }. 

1 The etymology of the word torsion is given on page 267. 
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Order ideals generalize the group-theoretic notion of the order of an element. Recall that 
if G is an additive abelian group, then an element g e G has finite order if ng = 0 for some 
positive integer n, while g has order d del is the smallest positive integer with dg — 0. On 
the other hand, ann(g) is an ideal in Z and, as any nonzero ideal in Z, it is generated by 
the smallest positive integer in it. Thus, the order ideal ann(g) = (d), the principal ideal 
generated by the order d of g. In Proposition 7.12, we proved that if M = (m) is a cyclic 
/f-module, where R is any commutative ring, then M = R/1. The ideal I in this corollary 
is ker</>, where <p: R —> M is the map r i-»- rm, so that I = ann(m), and 

(m> = R/ann(m). 

Definition. If M is an R-module, where R is a domain, then its torsion submodule 2 tM 
is defined by 

tM — {m e M : m has finite order}. 

Proposition 9.1. If R is a domain and M is an R-module, then tM is a submodule of M. 

Proof. If m, m' e tM, then there are nonzero elements r, r' e R with rm = 0 and r'm' = 
0. Clearly, rr'(m + in') — 0. Since R is a domain, rr' 0, and so ann(m + m') {0}; 

therefore, m + in' e tM. 

If s e R, then sm e tM, for r e ann(sm) because rsm = 0. • 

This proposition can be false if R is not a domain. For example, let R = \.\n M — Hf,, 
both [3] and [4] have finite order, for [2] e ann([3]) and [3] e ann([4]). On the other hand, 
[3] + [4] = [1], and [1] has infinite order in M, for ann([l]) = {0}. 

For the remainder of this section, R will be a domain (indeed, it will soon be restricted 
even further). 

Definition. If R is a domain and M is an R-module, then M is torsion if tM — M, while 
M is torsion-free if tM — {0}. 


Proposition 9.2. Let M and M' be R-modules, where R is a domain. 

(i) M/tM is torsion-free. 

(ii) If M = M', then tM = tM' and M/tM = M'/tM'. 

Proof, (i) Assume that m + tM 0 in M/tM\ that is, in has infinite order. If m + tM has 
finite order, then there is some r e R with r f 0 such that 0 = r(in + tM) = rm + tM ; 
that is, rm e tM. Thus, there is s e R with s / 0 and with 0 = s(rm) — (sr)in. But 
sr f 0, since R is a domain, and so ann(m) f {0}; this contradicts m having infinite order. 


’There is a generalization of the torsion submodule, called the singular submodule. which is defined for left 
R-modules over any not necessarily commutative ring. See Dauns, Modules and Rings, pages 231-238. 
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(ii) If tp: M —*■ M' is an isomorphism, then cp{tM ) C tM'. for if rm = 0 with r ^ 0, then 
r<p(m ) = <p(rm) = 0 (this is tme for any //-homomorphism); hence, <p\tM: tM -> tM' is 
an isomorphism (with inverse <p _1 | tM'). For the second statement, the map ip: M/tM —» 
M'/tM'. defined by ip: m + tM i-»- < p{m) + tM'. is easily seen to be an isomorphism. • 

Here is a fancy proof of Proposition 9.2. There is a functor t: r Mod —> ^Mod defined 
on modules by M h* tM and on morphisms by <p i-> <p\tM. That tM = tM' follows from 
the fact that every functor preserves equivalences. 

A non-noetherian commutative ring R. by its very definition, has an ideal that is not 
finitely generated. Now R, viewed as a module over itself, is finitely generated; indeed, it 
is cyclic (with generator 1). Thus, it is possible that a submodule of a finitely generated 
module need not, itself, be finitely generated. This cannot happen when R is a PID; in fact, 
we have proved, in Proposition 7.23(h), that if R is a PID, then every submodule S of a 
finitely generated //-module is itself finitely generated; indeed, if M can be generated by n 
elements, then S can be generated by n or fewer elements. 

Theorem 9.3. If R is a PID, then every finitely generated torsion-free R-module M is 
free. 

Proof. We prove the theorem by induction on n, where M — (tq..... v„). 

If n — 1, then M is cyclic; hence, M — (iq) = R/ ann(tq). Since M is torsion-free, 
ann(tq) = {0}, so that M 'A //, and hence M is free. 

For the inductive step, let M — {v\, ..., i>„ + ]) and define 

S — [m e M : there is r e R. r ^ 0, with rm e (u n +i)}; 

it is easy to check that S is a submodule of M. Now M/S is torsion-free: If x e M, x f S , 
and r(x + S) — 0, then rx e S: hence, there is r' e R with r' 0 and rr'x e (iVi+ 1 ). Since 
rr' f 0, we have x e S, a contradiction. Plainly, M/S can be generated by n elements, 
namely, iq + S,.... v n + S. and so M/S is free, by the inductive hypothesis. Since free 
modules are projective. Proposition 7.54 gives 

M = S © (M/S). 

Thus, the proof will be completed once we prove that S = R. 

If x e S. then there is some nonzero r e R with rx e (v„+i); that is, there is a e R with 
rx — av n +\. Define (p: S -> Q — Frac(//), the fraction field of R. by (p: x i->- a/r. It is 
a straightforward calculation, left to the reader, that cp is a (well-defined) injective //-map. 
If D = im <p, then D is a finitely generated submodule of Q. 

The proof will be complete if we can prove that every finitely generated submodule D 
of Q is cyclic. Now 

D = (b\/c\,..., b m /c m ), 

where b ,, a e R. Let c — P|, c i, and define / : I) R by f: d \-r cd for all d e D (it 
is plain that / has values in //, for multiplication by c clears all denominators). Since D is 
torsion-free, / is an injective //-map, and so D is isomorphic to a submodule of //; that is, 
D is isomorphic to an ideal of R. Since R is a PID, every nonzero ideal in R is isomorphic 
to R: hence, S = ixrup = D = R. • 
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Corollary 9.4. If R is a PID, then every submodule S of a finitely generated free R- 
module F is itself free, and rank(S) < rank(C). In particular, every finitely generated 
projective R-module P is free. 

Proof. By Proposition 7.23(ii), the submodule S can be generated by n or fewer elements, 
where n = rank! F). Now F is torsion-free, and hence S is torsion-free. Theorem 9.3 now 
applies to give S free. 

The second statement follows from Theorem 7.56: the characterization of projective 
modules as direct summands of free modules. Since P is finitely generated, there is a 
finitely generated free module F and a surjection q: F —> P ; since P is projective, there 
is a map j: P -> F with qj — lp. Thus, j restricts to an isomorphism of P with a 
submodule of F, which is free, by the first part of the proof. • 

Remark. Both statements in the corollary are true without the finiteness hypothesis, and 
we shall soon prove them. < 

Corollary 9.5. 

(i) If R is a PID, then every finitely generated R-module M is a direct sum 

M = tM © F, 

where F is a finitely generated free R-module. 

(ii) If M and M' are finitely generated R-modules, where R is a PID, then M = M' if 
and only if tM = tM' and rank (M/tM) — rank(M' / tM’). 


Proof, (i) The quotient module M/tM is finitely generated, because M is finitely gen¬ 
erated, and it is torsion-free, by Proposition 9.2(i). Therefore, M/tM is free, by Theo¬ 
rem 9.3, and hence M/tM is projective. Finally, M = tM © (M/tM), by Corollary 7.55 
on page 476. 

(ii) By Proposition 9.2(h), if M = M', then tM = tM' and M/tM = M'/tM'. Since 
M/tM is finitely generated torsion-free, it is a free module, as is M'/tM' , and these are 
isomorphic if they have the same rank. 

Conversely, since M = tM © (M/tM) and M' = tM' © (M'/tM'), Proposition 7.30 
assembles the isomorphisms on each summand into an isomorphism M -> M'. • 


Remark. This corollary requires the finitely generated hypothesis. There exist abelian 
groups G whose torsion subgroup tG is not a direct summand of G [see Exercise 9.1 (iii) 
on page 663]. ◄ 


We can now characterize flat modules over a PID. 
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Corollary 9.6. If R is a PID, then an R-module M is flat if and only if it is torsion-free. 

Proof. By Theorem 9.3, every finitely generated torsion-free R -module is free, and so it 
is flat, by Lemma 8.98. By Lemma 8.97, M itself is flat. 

Conversely, if M is not torsion-free, then it contains a nonzero element m of finite 
order, say, (r). If i: R —> TractW) is the inclusion, then in <Ei 1 e kerfl m <S> i), for in 
M <S)r Tract/t), we have 

r 1 

m <g> 1 = m <g> - = rm ® - = 0. 

r r 

On the other hand, m ® 1 fl 0 in M <S)r R. for the map m 8 1 !-> m is an isomorphism 
M <S>« R —> M, by Proposition 8 . 86 . Therefore, M is not flat. • 

Before continuing the saga of finitely generated modules, we pause to prove an impor¬ 
tant result: the generalization of Corollary 9.4, in which we no longer assume that free 
modules F are finitely generated. We begin with a second proof of the finitely generated 
case that will then be generalized. 

Proposition 9.7. If R is a PID, then every submodule H of a finitely generated free 
R-module F is itself free, and rank(H) < ranl<( F). 

Proof. The proof is by induction on n — rank(T). If n = 1, then F = R. Thus, H is 
isomorphic to an ideal in R\ but all ideals are principal, and hence are isomorphic to {0} or 
R. Therefore, H is a free module of rank < 1. 

Let us now prove the inductive step. If {x\ ,..., x n +\] is a basis of F, define F' = 
(xi,..., x n ), and let H' = H fl F'. By induction, H’ is a free module of rank < n. Now 

H/H' = H/(H fi F') = (H + F')/F' c F/F' = R. 

By the base step, either H/H' = {0} or H/H' = R. In the first case, H = H', and we 
are done. In the second case. Corollary 7.55 gives H — H' © (h) for some h e H, where 
(h) = R , and so H is free abelian of rank < n + 1. • 

We now remove the finiteness hypothesis. 

Theorem 9.8. If R is a PID, then every submodule H of a free R-module F is itself free, 
and rankf//) < rank! F). In particular, every projective R-module H is free. 

Proof. We are going to use the statement, equivalent to the axiom of choice and to Zorn’s 
lemma (see the Appendix), that every set can be well-ordered. In particular, we may as¬ 
sume that {xk : k e AT} is a basis of F having a well-ordered index set K. 

Tor each k e K, define 

F k = (Xj ■ j <k) and F k = ( Xj : j < k) = F[ © {x k ); 
note that F = F k . Define 

H k = H n F' k 


and H k = H fl F k . 
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Now H k — H fl F' k — H k fl F' k , so that 

Hk/Hfr = H k /(H k n Fl) 

= (H k + F' k )/F' k C F k /F' k = R. 


By Corollary 7.55, either H k — H' k or Il k = fl k © {h k ), where h k e H k C // and 
(. h k ) = R. We claim that H is a free /?-module with basis the set of all h k . It will then 
follow that rank(//) < rank(F). 

Since F = (J F k , each f e F lies in some F k ; since K is well-ordered, there is a 
smallest index k e K with / e F k , and we denote this smallest index by /x(/). In 
particular, if h e //, then 


guilt) — smallest index k with h e F k . 

Note that if h e H' k C F' k , then /i(h ) -< k. Let H* be the submodule of H generated by all 
the h k . 

Suppose that H* is a proper submodule of H. Let j be the smallest index in 

{fi(h) : h e Handh i //*}, 

and choose h' e H to be such an element having index j ; that is. It Fl* and //(/;') = j. 
Now h 1 e H C 1 Fj, because /xl/L) = j , and so 

h' = a + rh j, where a e H', and r e R. 

Thus, a — h' — rh/ e //' and a H*\ otherwise W e H* (because hj e H*). Since 
/i(a) < j, we have contradicted j being the smallest index of an element of H not in H*. 
We conclude that H* = H\ that is, every h e H is a linear combination of h k s. 

It remains to prove that an expression of any h e H as a linear combination of h k s is 
unique. By subtracting two such expressions, it suffices to prove that if 


0 = nh kl + r 2 h kl H-f r n h kn . 


then all the coefficients r ,• = 0. Arrange the terms so that k\ -< k 2 < ■ ■ ■ -< k n . If r n ^ 0, 
then r n h kn e (h k „) fl H' k — {0}, a contradiction. Therefore, all r,- = 0, and so H is a free 
module with basis {h k : k e K}. • 

We return to the discussion of finitely generated modules. In light of Proposition 9.2(ii), 
the problem of classifying finitely generated /^-modules, when R is a PID, is reduced to 
classifying finitely generated torsion modules. Let us say at once that these modules are 
precisely the generalization of finite abelian groups. 

Proposition 9.9. An abelian group G is finite if and only if it is a finitely generated torsion 
Jj-module. 
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Proof. If G is finite, it is obviously finitely generated; moreover, Lagrange’s theorem says 
that G is torsion. 

Conversely, suppose that G — {x\, ..., x„} and there are nonzero integers dj with 
djXj = 0 for all i. It follows that each g e G can be written 

g = mix i H-1- m n x n . 

where 0 < m; < dj for all i. Therefore, |G| < f] dj, and so G is finite. • 

Definition. Let R be a PID and M be an P-module. If P = (p) is a nonzero prime ideal 
in R, then M is (p)-primary if, for each m e M, there is n > 1 with p n m = 0. 

If M is any P-module, then its (p)-primary component is 

Mp — {m e M : p"m = 0 for some n > 1}. 

If we do not want to specify the prime P, we may write that a module is primary (instead 
of P-primary). It is clear that primary components are submodules. 

All of the coming theorems in this section were first proved for abelian groups and, 
later, generalized to modules over PIDs. The translation from abelian groups to modules 
is straightforward, but let us see this explicitly by generalizing the primary decomposition 
to modules over PIDs by adapting the proof given in Chapter 5 for abelian groups. For the 
reader’s convenience, we reproduce this proof with the finiteness hypothesis eliminated. 

Theorem 9.10 (Primary Decomposition). 

(i) Every torsion abelian group G is a direct sum of its p-primary components: 

p 

(ii) Every torsion R-module M, where R is a PID, is a direct sum of its 
P-primary components: 

M = £m p . 
p 

Proof, (i) Let x e G be nonzero, and let its order be d. By the fundamental theorem of 
arithmetic, there are distinct primes pi, ..., p n and positive exponents e\,... ,e n with 

d = p e l i ---p e n ". 

Define r, = d / p e f , so that //' r, = d. It follows that r,x e G Pi for each i. But the gcd of 
ri, ..., r„ is 1, and so there are integers si, ... ,s„ with 1 = JL s,r,. Therefore, 
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For each prime p, write H p — G ? j. By Exercise 7.79 on page 519, it suffices to 

prove that if 

x e G p fi H p , 

then x — 0. Since x e G p , we have p e x — 0 for some l > 0; since x e II n , we have 
ux = 0, where u — q j 1 • • • q„, <y, =/= p, and fi > 1 for all i. But p l and u are relatively 
prime, so there exist integers s and t with 1 = sp 1 + tu. Therefore, 

x — (sp 1 + tu)x = sp e x + tux = 0. 


(ii) We now translate the proof just given into the language of modules. If m e M is 
nonzero, its order ideal ann(m) = (d), for some d e R. By unique factorization, there are 
irreducible elements p\, no two of which are associates, and positive exponents 

e\,... ,e n with 

d = p e 1 l -- P fr- 

By Proposition 6.17, Pj — ( p ,) is a prime ideal for each i. Define r, = d / p c -', so that 
pfrj — d. It follows that rpn e Mp t for each i. But the gcd of the elements rj,..., r„ 
is 1, and so there are elements si,..., s n e R with 1 = Sj r,. Therefore, 

m = ^ Sjrjin e l [Jm p 
i \ P 


For each prime P, write Hp — G qj . By Exercise 7.79 on page 519, it suffices 

to prove that if 

m € Mp (T Hp, 

then m = 0. Since m e Mp where P — (p), we have pfu = 0 for some t > 0; since 
m e Hp, we have urn = 0, where u — q{' ■ ■ ■ q„, Q, — (q,), and f > 1. But p e and u 
are relatively prime, so there exist s,teR with 1 = sp 1 + tu. Therefore, 

m — (sp ( + tu)m — sp ( m + turn — 0 . • 


Proposition 9.11. Two torsion modules M and M 1 over a PID are isomorphic if and only 
if Mp = M’p for every nonzero prime ideal P. 

Proof. If /: M — > M’ is an W-map, then f(Mp) C M' p for every prime ideal P — (/;), 
for if p ( m — 0, then 0 = f(p e m) = p l f(m). If / is an isomorphism, then f~ l : M' —> 
M is also an isomorphism. It follows that each restriction f\Mp: Mp —»• M' p is an 
isomorphism, with inverse f~ x \M’ p . Conversely, if there are isomorphisms fp : Mp —» 
M'p for all P, then there is an isomorphism rp\ 'ffp Mp —> Jfp M' p given by m p 
J2pfp( m P)- • 



654 


Advanced Linear Algebra Ch. 9 


We remark that there is a fancy proof here, just as there is for Proposition 9.2. Define 
a “P-torsion functor” tp: p Mod —» /{Mod on modules by M h> (tM)p and on mor- 
phisms by <p i-> (p\{tM)p. That (tM)p = (tM')p follows from the fact that every functor 
preserves equivalences. 

For the remainder of this section, we shall merely give definitions and statements of 
results; the reader should have no difficulty in adapting proofs of theorems about abelian 
groups to proofs of theorems about modules over PIDs. 

Theorem 9.12 (Basis Theorem). If R is a PID, then every finitely generated module 
M is a direct sum of cyclic modules in which each cyclic summand is either primary or is 
isomorphic to R. 

Proof By Corollary 9.5, M = tM © F, where F is finitely generated free; see Theo¬ 
rem 5.18 for the abelian group version of the basis theorem. • 

Remark. The reader may be amused by a sophisticated proof of the basis theorem. By 
Corollary 9.5 and Theorem 9.10(ii), we may assume that M is P-primary for some prime 
ideal P — (p). 

There is a positive integer e with p e M = {0}: if M = (mi, ..., m n ), then p e ‘mi = 0 
for some ei, and we choose e to be the largest of the e; (we may assume that e — e n ). 
By Exercise 7.4 on page 440, if / = ( p e ), then M / JM is an R /7-module; indeed, since 
JM — {0}, we have M itself is an P/T-module. Now (m„) = R/(p e ) = R/J is an 
injective P/T-module, by Proposition 7.76, and Proposition 7.64 says that the submodule 
S = (m n ) is a direct summand: 

M — (m„) © T, 

where T is an R j J -submodule of M; a fortiori, T is an P-submodule of M [if r e R 
and t e T, then (r + J)t makes sense; define rt = (r + J)t], As T can be generated 
by fewer than n elements, we may assume, by induction, that it is a direct sum of cyclic 
submodules. ■* 

Corollary 9.13. Every finitely generated abelian group is a direct sum of cyclic groups, 
each of prime power order or infinite. 

When are two finitely generated modules M and M' over a PID isomorphic? 

Before stating the next lemma, recall that M/pM is a vector space over R/(p), and we 
define 

d(M) — dim (M/pM). 

In particular, d(pM) = dim (pM / p 2 M) and, more generally, 

d(p"M) = dim (p n M/p" +l M). 

Definition. If M is a finitely generated (/p)-prim ary P-module, where R is a PID and 
P — (p) is a prime ideal, then 

U P (n, M) = d(p"M) - d(p n+l M). 
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Theorem 9.14. If R is a PID and P — (p) is a prime ideal in R, then any two decompo¬ 
sitions of a finitely generated P -primary R-module M into direct sums of cyclic modules 
have the same number of cyclic summands of each type. More precisely, for each n > 0, 
the number of cyclic summands having order ideal ( p " +l ) is Up(n , M). 

Proof. See Theorem 5.23. • 


Corollary 9.15. IfM and M' are P-primary R-modules, where R is a PID, then M = M' 
if and only ifUpfii, M) = Up(n, M')for all n > 0. 

Proof. See Corollary 5.24. • 


Definition. If M is a P -primary R -module, where R is a PID, then the elementary divi¬ 
sors of M are the ideals (p" +] ), each repeated with multiplicity Up(n, M). 

If M is a finitely generated torsion R -module, then its elementary divisors are the ele¬ 
mentary divisors of all its primary components. 


The next definition is motivated by Corollary 5.30: If G is a finite abelian group with 
elementary divisors [pfi }, then 


ij 


Definition. If M is a finitely generated torsion /?-module, where R is a PID, then the 
order of M is the principal ideal generated by the product of its elementary divisors. 


namely, (j[ ij /r"). 


Example 9.16. 

If k is a field, how many A [x]-modules are there of order (x — l) 3 (x + l) 2 ? By the primary 
decomposition, every A[x]-module of order (x — l) 3 (x + l) 2 is the direct sum of primary 
modules of order (x — l) 3 and (x + l) 2 , respectively. There are three modules of order 
(x — l) 3 , described by the elementary divisors 

(x — 1, x — 1, x — 1), (x — 1, (x — l) 2 ), and (x — l) 3 ; 

there are two modules of order (x + l) 2 , described by the elementary divisors 

(x + 1, x + 1) and (x + l) 2 . 

Therefore, to isomorphism, there are six modules of order (x — l) 3 (x + l) 2 . 

The reader has probably noticed that this argument is same as that in Example 5.26 on 
page 264 classifying all abelian groups of order 72 = 2 3 3 2 . < 
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Theorem 9.17 (Fundamental Theorem of Finitely Generated Modules). If R is a 

PID, then two finitely generated R-modules are isomorphic if and only if their torsion 
submodules have the same elementary divisors and their free parts have the same rank. 

Proof By Theorem 9.10(ii), M = M' if and only if, for all primes P , the primary compo¬ 
nents Mp and M' p are isomorphic. Corollary 9.15, Proposition 9.5(ii), and Proposition 9.11 
now complete the proof. • 

Here is a second type of decomposition of a finitely generated torsion module into a 
direct sum of cyclics that does not mention primary modules. 

Proposition 9.18. If R is a PID, then every finitely generated torsion R-module M is a 
direct sum of cyclic modules 

M = R/(c i) © R/(c 2 ) 0 • • • 0 R/(c t ), 

where t > 1 and c i | c 2 \ ■ • ■ \ c t . 

Proof See Proposition 5.27. • 

Definition. If M is a finitely generated torsion /?-module, where R is a PID, and if 
M = R/(cf) 0 R/(c 2 ) © • • • © R/(c t ), 

where t > 1 and c\ \ c 2 \ ■ ■ ■ \ c t , then (cj), ( c 2 ), ..., (c t ) are called the invariant factors 
of M. 

Corollary 9.19. If M is a finitely generated torsion module over a PID R, then 

(c,) = {reR:rM = { 0}}, 

where (c ? ) is the last ideal occurring in the decomposition of M in Proposition 9.18. 

In particular, if R = k[x\ where k is afield, then c t is the polynomial of least degree 
for which c t M = {0}. 

Proof For the first statement, see Corollary 5.28. 

The second statement follows from the fact that every nonzero ideal in k[x] is generated 
by the monic polynomial of least degree in it. • 


Definition. If M is an R -module, then its exponent (or annihilator ) is the ideal 

ann(M) = {r e R : rM = {0}}. 

Corollary 9.19 computes the exponent of a finitely generated torsion module over a PID; 
it is the last invariant factor (c r ). 
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Corollary 9.20. If M is a finitely generated torsion R-module, where R is a PID, with 
invariant factors ci, ..., c t , then the order of M is (fl/=i c i)- 

Proof. See Corollary 5.30. The reader should check that the principal ideal generated by 
the product of the elementary divisors (which is the definition of the order of M ) is equal 
to the principal ideal (ri/=i c i ) • • 


Example 9.21. 

We displayed the elementary divisors of k[x]-modules of order (x — l) 3 (x + l ) 2 in Exam¬ 
ple 9.16; here are their invariant factors. 

Elementary divisors ** Invariant factors 


(x — 1, 

x — 1 , x — 1, 

X + 

1, 

X + 1) 


(x - 

- 1, u - l) 2 . 

x T 

1, 

X + 1) 



((* - l) 3 . 

X + 

1, 

X + 1) 


(x - 

- 1, x — 1, X 

-1, 

(x 

+ 1) 2 ) 



(x — 1, (x — 

l) 2 , 

(x 

+ 1) 2 ) 



((x - 

l) 3 , 

(x 

+ 1) 2 ) 



x - 1 | (x - l)(x + 1 ) | (x - l)(x + 1 ) 
(x - l)(x + 1 ) | (x - l) 2 (x + 1 ) 

X + 1 I (x - l) 3 (x + 1) 
x — 1 | x — 1 | (x — l)(x + l ) 2 
x - 1 I (x - l) 2 (x + l ) 2 
(x-l) 3 (x+l ) 2 « 


Theorem 9.22 (Invariant Factors). If R is a PID, then two finitely generated R- 
modules are isomorphic if and only if their torsion submodules have the same invariant 
factors and their free parts have the same rank. 

Proof. By Corollary 9.5(i), every finitely generated R-module M is a direct sum M — 
tM ® F. where F is free, and M = iVl' if and only if tM = tM' and F = F'. Corol¬ 
lary 9.5(ii) shows that the free parts F = M/tM and F' = M'/tM' are isomorphic, and 
a straightforward generalization of Theorem 5.32 shows that the torsion submodules are 
isomorphic. • 

The reader should now be comfortable when we say that a theorem can easily be gener¬ 
alized from abelian groups to modules over PID’s. Consequently, we will state and prove 
theorems only for abelian groups, leaving the straightforward generalizations to modules 
to the reader. 

Let us now consider modules that are not finitely generated. Recall that an abelian 
group D is divisible if, for each d e D and each positive integer n, there exists d' e D 
with d — nd'. Every quotient of a divisible group is divisible, as is every direct sum of 
divisible groups. Now Corollary 7.73 states that an abelian group D is an injective Z- 
module if and only if it is divisible, so that classifying divisible abelian groups describes 
all injective abelian groups. 


Proposition 9.23. A torsion-free abelian group D is divisible if and only if it is a vector 
space over Q. 
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Proof. If D is a vector space over Q, then it is a direct sum of copies of Q, for every 
vector space has a basis. But Q is a divisible group, and any direct sum of divisible groups 
is itself divisible. 

Let D be torsion-free and divisible; we must show that D admits scalar multiplication 
by rational numbers. Suppose that d e D and n is a positive integer. Since D is divisible, 
there exists d' e D with nd' — d [of course, d' is a candidate for (1 /n)d]. Note, since D 
is torsion-free, that d' is the unique such element: If also nd" = d, then n(d' — d”) = 0, 
so that d' — d" has finite order, and hence is 0. If m/n e Q, define ( m/n)d — md' , where 
nd' — d. It is a routine exercise for the reader to prove that this scalar multiplication is 
well-defined [if m/n = a/b , then ( m/n)d = ( a/b)d] and that the various axioms in the 
definition of vector space hold. • 

Definition. If G is an abelian group, then dG is the subgroup generated by all the divisible 
subgroups of G. 

Proposition 9.24. 

(i) For any abelian group G, the subgroup dG is the unique maximal divisible subgroup 
of G. 

(ii) Every abelian group G is a direct sum 

G = dG © R, 

where dR — [0[. Hence, R = G/dG has no nonzero divisible subgroups. 

Proof, (i) It suffices to prove that dG is divisible, for then it is obviously the largest such. 
If x e dG, then x — x\ + • • • + x t , where Xj e Dj and the Z), are divisible subgroups of 
G. If n is a positive integer, then there are y,- e Z), with Xj = ny,-, because D, is divisible. 
Hence, y = y\ + • • • + y t e dG and x = ny, so that dG is divisible. 

(ii) Since dG is divisible, it is injective, and Proposition 7.64 gives 

G = dG ® R, 

where R is a subgroup of G. If R has a nonzero divisible subgroup D, then R — I) © S for 
some subgroup S, by Proposition 7.64 on page 481. But dG © D is a divisible subgroup 
of G properly containing dG, and this contradicts part (i). • 


Definition. An abelian group G is reduced if dG = {0}; that is, G has no nonzero 
divisible subgroups. 

In Exercise 9.18 on page 665, we prove that an abelian group G is reduced if and only 
if Hom(Q, G) = {0}. 

We have just shown that G/dG is always reduced. The reader should compare the roles 
of the maximal divisible subgroup dG of a group G with that of tG, its torsion subgroup: 
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G is torsion if tG = G and it is torsion-free if tG = {0}; G is divisible if dG — G and it 
is reduced if dG = {0}. There are exact sequences 

0 -»• dG G -> G/dG 0 

and 

0 -> fG -> G -> G/tG -> 0; 

the first sequence always splits, but we will see, in Exercise 9. l(iii) on page 663, that the 
second sequence may not split. 

The following group has some remarkable properties. 

Definition. If p is a prime, a complex number z is a ptli-power root of unity if z p " = 1 
for some n > 1. The quasicyclic group (also called the Priifer group of type p°°) is 

Z(p°°) — {complex pth power roots of unity}. 

Of course, if z is a pth power root of unity, say, z p " — 1, then z is a power of the 
primitive p n th root of unity z n = e lnilpn . Note, for every integer n > 1, that the subgroup 
(z n ) is the unique subgroup of Z (p°°) of order p n , for the polynomial x p — 1 e C[x] has 
at most p" complex roots. 

Proposition 9.25. Let p be a prime. 

(i) Zip°°) is isomorphic to the p-primary component q/'Q/Z. 

(ii) Z (p°°) is a divisible p-primary abelian group. 

(iii) The subgroups of Zip 00 ) are 


{1} c (zi) c (z 2 ) c • • • C (z n ) C (z„+i> c • • • C Zip 00 ), 

and so they are well-ordered by inclusion. 3 

(iv) Zip 00 ) has the DCC on subgroups but not the ACC . 4 

Proof, (i) Define <p: ^ p Zip 00 ) —>■ Q/Z by <p: (e 27 ric p/ P np ) c p /p n P + Z, where 

c p e Z. It is easy to see that <p is an injective homomorphism. The proof that (p is 
suijective is really contained in the proof of Theorem 5.13, but here it is again. Let a/b e 
Q/Z, and write b — Y\ r P np ■ Since the numbers b/p n r are pairwise relatively prime, 
there are integers m p with 1 = ^ p m p ib/p" p ). Therefore, a/b — am p /p n p = 
(piiam p /p np )). 

(ii) Since a direct summand is always a homomorphic image. Zip 00 ) is a homomorphic 
image of the divisible group Q/Z; but every quotient of a divisible group is itself divisible. 


'The group Z(p°°) is called quasicyclic because every proper subgroup of it is cyclic. 

^Theorem 8.46, the Hopkins-Levitzki theorem, says that a ring with DCC must also have ACC. This result 
shows that the analogous result for groups is false. 
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(iii) Let S be a proper subgroup of Z (p°°). Since {z n : n > 1} generates Z (p°°), we 
may assume that z m f S for some m. It follows that zi f S for all i > m; otherwise 

p£—m 

Z m — Z( e S. If S f {0}, we claim that S contains some z„; indeed, we show that S 
contains z t- Now S must contain some element x of order p, and x = zf where 1 < c < p 
[for (zi > contains all the elements in Z (p°°) of order p]. Since p is prime, (c, p) — 1, and 
there are integers u, v with \ = cu + pv, hence, zi = z™ pv = z\ u — x u e S. Let d be 
the largest integer with zd € S. Clearly, (z,i) L S. For the reverse inclusion, let s e S. If 
,v has order p" > p d , then (s) contains z n * because (z n ) contains all the elements of order 
p" in Z (p°°). But this contradicts our observation that z.i (f S for all i > d. Hence, ,y has 
order < P d , and so s e (Zd }; therefore, S — {zd)- 

As the only proper nonzero subgroups of Z (p°°) are the groups ( z n ), it follows that the 
subgroups are well-ordered by inclusion. 

(iv) First, Z(p°°) does not have the ACC, as the chain of subgroups 

{1}C( Z1 >C( Z2 }C... 

illustrates. It is proved in Proposition A.3 of the Appendix that every strictly decreasing 
sequence in a well-ordered set is finite; it follows that Z(p°°) has the DCC on subgroups. 


Notation. If G is an abelian group and n is a positive integer, then 

G[n] = {g e G : ng = 0}. 

It is easy to see that G[n ] is a subgroup of G. Note that if p is prime, then G[p] is a 
vector space over ¥ p . 

Lemma 9.26. If G mid H are divisible p-primarv abelian groups, then G = H if and 
only if G[p] = H[p], 

Proof. If there is an isomorphism /; G —> H, then it is easy to see that its restriction 
f\G[p] is an isomorphism G[p] —»• H[p] (whose inverse is f~ l \H[p]). 

For sufficiency, assume that /: G[p] -> H[p] is an isomorphism. Composing with 
the inclusion H[p] —> H , we may assume that /; G[p] -> H. Since H is injective, / 
extends to a homomorphism F: G -> H \ we claim that any such F is an isomorphism. 

(i) F is an injection. 

If g £ G has order p, then F(g) — f(g) f 0, by hypothesis. Suppose that g has 
order p" for n > 2. If F(g) — 0, then F(p"~ [ g) — 0, and this contradicts the hypothesis, 
because p n ~ l g has order p. Therefore, F is an injection. 

(ii) F is a surjection. 

We show, by induction on n > 1, that if h e H has order p", then Ii e imf. If n — 1, 
then h e H[p] — im / C im F. For the inductive step, assume that h e H has order p" +l . 
Now p n h e H[p], so there exists g e G with F(g) = f(g) = p n h. Since G is divisible, 
there is g' e G with p n g' — g\ thus, p n (h — F(g')) = 0. By induction, there is x e G 
with F(x) = h — F(g'). Therefore, F(x + g') = h, as desired. • 
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The next theorem classifies all divisible abelian groups. 

Definition. If D is a divisible abelian group, define 

800 (D) - dim q(D/tD) 


and, for all primes p. 


8 p (D) = dimj^fD [/?]). 


Theorem 9.27. 

(i) An abelian group D is an injective Z -module if and only it is a divisible group. 

(ii) Every divisible abelian group is isomorphic to a direct sum of copies of Q and of 
copies ofL(p°°) for various primes p. 

(iii) Two divisible groups D and O' are isomorphic if and only if 800(D) = 8oo(D') and 
8 p (D) — 8 p (D' ) for all primes p. 

Proof, (i) This is proved in Corollary 7.73. 

(ii) If x e D has finite order, if n is a positive integer, and if x = ny, then y has finite order. 
It follows that if D is divisible, then its torsion subgroup tD is also divisible, and hence 


d = tD® y, 

where V is torsion-free (by Proposition 7.64 on page 481). Since every quotient of a 
divisible group is divisible, V is torsion-free and divisible, and hence it is a vector space 
over Q, by Proposition 9.23. 

Now tD is the direct sum of its primary components: tD = T p , each of which is 
p-primary and divisible, and so it suffices to prove that each T p is a direct sum of copies 
of Z (p°°). If Am\(T p [p]) — r (r may be infinite), define W to be a direct sum of r copies 
of Z (p°°), so that dim(W[p]) = r. Lemma 9.26 now shows that T p = W. 

(iii) By Proposition 9.2(ii), if D = />', then D/tD = D'/tD' and tD — tD': hence, the 
p-primary components (tD) p = (tD') p for all p. But D/tD and D'/tD' are isomorphic 
vector spaces over Q, and hence have the same dimension; moreover, the vector spaces 
(tD) p [p] and (tD') p [p] are also isomorphic, so they, too, have the same dimension. 

For the converse, write D — V © T p and />' = V' © T' p , where V and V' are 
torsion-free divisible, and T p and T' p are /^-primary divisible. By Lemma 9.26, 8 p (D ) = 
8 p(D') implies T p = T’ p , while 800 (D) — 8 oo(D') implies that the vector spaces V and 
V' are isomorphic. By Proposition 7.30, these isomorphisms can be assembled to give an 
isomorphism between D and />'. • 

We can now describe some familiar groups, but the reader may have to review a bit of 
field theory. 
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Corollary 9.28. Let k be an algebraically closed field, let k x be its multiplicative group, 
and let T be the torsion subgroup ofk x . 

(i) Ifk has characteristic 0, then T = Q/Z, and k x = (Q/Z) © V, where V is a vector 
space over Q. 

(ii) Ifk has prime characteristic p, then T = Ylq^tp Z((/°°). Ifk is the algebraic closure 
ofFp, then 5 

k x = (q°°). 

q¥=p 


Proof. Since k is algebraically closed, the polynomials x n — a have roots in k whenever 
a e k\ this says that every a has an nth root in k, which is the multiplicative way of saying 
that k x is a divisible group. An element a e k has finite order if and only if a" = 1 for 
some positive integer n; that is, a is an nth root of unity. It is easy to see that T is, itself, 
divisible. Hence, k x = T © V, by Lemma 9.24, where V is a vector space over Q (for V 
is torsion-free divisible). 

(i) If k = Q is the algebraic closure of Q, there is no loss in generality in assuming that 
k C C. Now the torsion subgroup T of k consists of all the roots of unity e 2nir , where 
r e Q. It follows easily that the map r i-^ e 2n,r is a surjection Q —> T having kernel Z, 
so that T = Q/Z. 

If k is any algebraically closed field of characteristic 0, then Q C. k implies Q C k. 
There cannot be any roots of unity in k not in Q, because Q already contains n roots of 
x n - 1. 

(ii) Let k = ¥ p . Every element a e k is algebraic over F p , and so F p (a)/F p is a finite 
field extension; say, [F p (a) ; F p ] = m for some m. Hence, |F p (a)| = p m and F p {a) is a 
finite field. Now every nonzero element in a finite field is a root of unity (for it is a root of 
x p — x for some m). But k x — T © V , where V is a vector space over Q. It follows that 
V = {0}, for every nonzero element of k is a root of unity. 

We now examine the primary components of k x . If q f p is a prime, then the poly¬ 
nomial f(x) — x q — 1 has no repeated roots (for gcd(/(x), fix)) = 1), and so there is 
some qth root of unity other than 1. Thus, the ^-primary component of k x is nontrivial, 
and so there is at least one summand isomorphic to Z (q°°). Were there more than one such 
summand, there would be more than q elements of order q , and this would provide too 
many roots for x q — 1 in the field k. Finally, there is no summand isomorphic to Z(p°°), 
for the polynomial x p — 1 = (x — 1 ) p in k[x], and so it has no roots other than 1. • 


Corollary 9.29. The following abelian groups are isomorphic: 

C x ; (Q/Z) ©R; R/Z; Y\Z(p°°)\ S 1 . 

P 

5 The additive group of k is easy to describe, for k is a vector space over ¥ p , and so it is a direct sum of 
(infinitely many) copies of F^. 
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Here S l is the circle group ; that is, the multiplicative group of all complex numbers z with 

|z| = 1. 

Proof. The reader may use Theorem 9.27, because, for every group G on the list, we 
have S p (G ) = 1 for all primes p and 800 (G) = c (the cardinal of the continuum). See 
Exercise 9.29 on page 666 for G — Y[ p Z(p°°). • 

Exercises 

9.1 Let G = Ylp( a p)> where p varies over all the primes, and (a p ) = I p . 

(i) Prove that tG — ^2 p {a p ). 

Hint. Use Exercise 5.4 on page 267. 

(ii) Prove that G/tG is a divisible group. 

(iii) Prove that tG is not a direct summand of G. 

Hint. Show that Hom(Q, G) = {0} but that Hom(Q, G/tG) 7^ {0}, and conclude that 
G/tG cannot be isomorphic to a subgroup of G. 

9.2 Let R be a PID, and let M be an P-module, not necessarily primary. Define a submodule 
S C M to be a pure submodule it S D rM = rS for all r e R. 

(i) Prove that if M is a (p)-primary module, where ( p) is a nonzero prime ideal in R, then 
a submodule SQM is pure as just defined if and only if S fl p" M = p" S for all n > 0. 

(ii) Prove that every direct summand of M is a pure submodule. 

(iii) Prove that the torsion submodule tM is a pure submodule of M. 

(iv) Prove that if M/S is torsion-free, then 5 is a pure submodule of M. 

(v) Prove that if <S is a family of pure submodules of a module M that is a chain under 

inclusion (that is, if S, S' e 5. then either S C S' or S' C 5), then 5 is a pure 

submodule of M. 

(vi) Give an example of a pure submodule that is not a direct summand. 

9.3 (i) If F is a finitely generated free P-module, where R is a PID, prove that every pure 

submodule of F is a direct summand. 

(ii) If I? is a PID and M is a finitely generated I?-module, prove that a submodule S C Mis 
a pure submodule of M if and only if 5 is a direct summand of M. 

9.4 Prove that if R is a domain that is not a field, then an P-module M that is both projective and 
injective must be {0}. 

Hint. Use Exercise 7.43 on page 487. 

9.5 If M is a torsion module over a domain R, prove that 

Horn p(M, M ) = J~”J Hom^(Mp, Mp), 

P 

where Mp is the P-primary component of M. 

9.6 (i) If G is a torsion group with p-primary components {Gp : p e P), where P is the set of 

all primes, prove that G = /(flpep Gp). 

(ii) Prove that (ripe/ 5 ^p)/(YlpeP G P ) * s torsion-free and divisible. 

Hint. Use Exercise 5.4 on page 267. 
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9.7 If M is an /{-module, where I? is a domain, and if r € R, let /i r : M —y M be multiplication 
by r; that is, p r : m rm [see Example 7.2(iii)]. 

(i) If Q = Frac(R), prove that an /{-module is a vector space over Q if and only if M is 
torsion-free and divisible. 

(ii) Prove that p r is an injection for every r / 0 if and only if M is torsion-free. 

(iii) Prove that fi r is a surjection for every r / 0 if and only if M is divisible. 

(iv) Prove that M is a vector space over Q if and only if, for every r ^ 0, the map p r : M —> 
M is an isomorphism. 

9.8 (i) Let I? be a domain, let r e R, and let M be an /{-module. If pt r : M —>• M is multipli¬ 

cation by r, prove, for every /{-module A, that the induced maps 

(/V)* : Homs(A, M) -*■ Hom^(A, M) 

and 

(g-r)* : Hom^(M, A) —>• Homjj(Af, A) 
are also multiplication by r. 

(ii) Let R be a domain with Q = Frac(/{). Using Exercise 9.7 on page 664, prove, for every 
/{-module M, that both Horn p (Q, M) and Horn/; (A/, Q) are vector spaces over Q. 

9.9 (i) If M and N are finitely generated torsion /{-modules, prove, for all primes P and all 

n > 0 , that 

U P (n, M ® /V) = Up(n, M) + U P (n, N ), 

(ii) If A, B, and C are finitely generated /{-modules, where R is a PID, prove that A ® B = 
A ® C implies B = C. 

(iii) If A and B are finitely generated /{-modules, where R is a PID, prove that A® A = 6® B 
implies A = B. 

9.10 If A is an abelian group, call a subset X of A linearly independent if, whenever JL m ;.v ( - = 0, 
where m, e Z and almost all m, = 0, then to; = 0 for all i. Define rank(A) to be the number 
of elements in a maximal linearly independent subset of A. 

(i) If X is linearly independent, prove that {X) — a direct sum of cyclic groups. 

(ii) If A is torsion, prove that rank(A) = 0. 

(iii) If A is free abelian, prove that the two notions of rank coincide [the earlier notion defined 
rank(A) as the number of elements in a basis of A]. 

(iv) Prove that rank(A) = dim(Q ®j j A), and conclude that every two maximal linearly 
independent subsets of A have the same number of elements; that is, rank(A) is well- 
defined. 

(v) If 0 —> A —> B —> C —> 0 is an exact sequence of abelian groups, prove that rank( B) = 
rank(A) + rank(C). 

9.11 ( Kulikov) If G is an abelian p-group, call a subset X C G pure-independent if X is linearly 
independent (see Exercise 9.10) and {X) is a pure subgroup. 

(i) Prove that G has a maximal pure-independent subset. 

(ii) If X is a maximal pure-independent subset of G, the subgroup B = {X} is called a basic 
subgroup of G. Prove that if B is a basic subgroup of G, then G/B is divisible. 

9.12 Prove that if G and H are torsion abelian groups, then G®g 7/ is a direct sum of cyclic groups. 
Hint. Use an exact sequence 0 —>■ B —> G —> G/B—>-0, where B is a basic subgroup, 
along with the following theorem proved in Rotman, An Introduction to Homological Algebra, 
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9.13 


9.14 

9.15 

9.16 

9.17 


9.18 

9.19 

9.20 

9.21 

9.22 

9.23 


9.24 


pages 94-96): If 0 —»■ A' — U- A -*■ A" —> 0 is an exact sequence of abelian groups and if 
i(A') is a pure subgroup of A, then, for every abelian group E, there is exactness of 

0 —»■ A! E -*■ A (g)^ E -*■ A” (&i E —> 0. 

Let M be a /’-primary //-module, where R is a PID and P = (p) is a prime ideal. Define, for 
all n > 0 , 

V P (n, M) = dim {(p n M n M[p])/(p n+1 M n M[p])^j , 

where M[p] — [m e M : pm = 0). (This invariant is introduced because we cannot subtract 
infinite cardinal numbers.) 

(i) Prove that Vp(n, M) = Up{n, M) when M is finitely generated 

(ii) Let M = J^iel Cj be a direct sum of cyclic modules Q, where / is any index set, 
possibly infinite. Prove that the number of summands Cj having order ideal ( p n ) is 
Vp(n, M), and hence it is an invariant of M. 

(iii) Let M and M' be torsion modules that are direct sums of cyclic modules. Prove that 
M = M' if and only if Vp(n, M) — Vp(n, M 1 ) for all n > 0 and all prime ideals P. 

(i) If p is a prime and G = ?(]”[/:> t i a k))- where { ak) is a cyclic group of order p , prove 
that G is an uncountable p-primary abelian group with V p (n, G) = 1 for all n > 0. 

(ii) Use Exercise 9.13 to prove that the primary group G in part (i) is not a direct sum of 
cyclic groups. 

Generalize Proposition 8.95 as follows: If R is a domain, D is a divisible //-module, and T is 
a torsion //-module with every element of finite order, then D T = {0}. 

Prove that there is an additive functor d : Ab —*■ Ab that assigns to each group G its maximal 
divisible subgroup dG. 

(i) Prove that Z( p°°) has no maximal subgroups. 

(ii) Prove that Z( p°°) = lint I p n. 

(iii) Prove that a presentation of Z (p°°) is 

( a „, n > 1 | pa\ = 0 , pa n+ j = a n for n > 1). 

Prove that an abelian group G is reduced if and only if Hom^fQ, G) = {0}. 

If 0 —> A —y B —> C —> 0 is exact and both A and C are reduced, prove that B is reduced. 
Hint. Use left exactness of Hom^CQ, ). 

If { Dj : i e /} is a family of divisible abelian groups, prove that n (€ / A' is isomorphic to a 
direct sum of divisible groups. 

Prove that Q x = I 2 © F, where F is a free abelian group of infinite rank. 

Prove that R x = Ij ® R. 

Hint. Usec r . 

(i) Prove, for every group homomorphism /: Q —> Q, that there exists r 6 Q with f(x) = 
rx for all x e Q. 

(ii) Prove that Hom^IQ, Q) = Q. 

(iii) Prove that End(Q) = Q as rings. 

For every abelian group A, prove that Hom^CA, Q) and Hom^IQ, A) are vector spaces 
over Q. 
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9.25 Prove that if G is a nonzero abelian group, then Hom^lG, Q/Z) {0}. 

9.26 Prove that an abelian group G is injective if and only if every nonzero quotient group is infinite. 

9.27 Prove that if G is an infinite abelian group all of whose proper subgroups are finite, then 
G = Z (p°°) for some prime pf 

9.28 (i) Let D — 5Z/=1 A’ where each Dj = Z (p9°) for some prime pp Prove that every 

subgroup of D has DCC. 

(ii) Prove, conversely, that if an abelian group G has DCC, then G is isomorphic to a sub¬ 
group of a direct sum of a finite number of copies of Z (p9°). 

9.29 Let G = \\ pe p Z(p°°), where P is the set of all primes. Prove that S p (G) = 1 for all p e P. 
and that Soo(G) = c, where c is the cardinal of the continuum. 

Hint. Use Exercise 9.6 on page 663 after noting that ]~[ p ^p Z(p°°) has cardinality c while 
YlpeP Zj(p°°) is countable. 

9.30 Let R = k[x, y] be the polynomial ring in two variables over a field k, and let I = (x, y). 

(i) Prove that x ® y — y ® x ^ 0 in I ® p 1. 

Hint. Show that this element has a nonzero image in (7// 2 ) ®p (I /7 2 ). 

(ii) Prove that x ® y — y ® x is a torsion element in I I, and conclude that the tensor 
product of torsion-free modules need not be torsion-free. 

9.31 Let C be the category of all finitely generated /^-modules, where R is a PID. 

(i) Compute the Grothendieck group Kq(C). 

(ii) Compute the Grothendieck group K'(C). 


9.2 Rational Canonical Forms 

In Chapter 3, we saw that if T: V -> V is a linear transformation and if A = x\,..., x n 
is a basis of V. then T determines the matrix A — xl^ lx whose /th column consists of 
the coordinate-set of T(x, ) with respect to A. If Y is another basis of V, then the matrix 
B = y[T]y may be different from A. On the other hand. Corollary 3.101 on page 176 says 
that two matrices A and B arise from the same linear transformation if and only if A and 
B are similar, that is, there exists a nonsingular matrix P with B — PAP~ l . 

Corollary 3.101. Let T : V -> V be a linear transformation on a vector space V over a 
field k. If X and Y are bases ofV, then there is a nonsingular matrix P with entries in k 
so that 

y[T]y = P{x[T]x)P~ l - 

Conversely, if B — PAP -1 , where B, A, and P are n x n matrices with entries in k and 
P is nonsingular, then there is a linear transformation T : k n —> k n and bases X and Y of 
k n such that B — y[T]y and A — x[T]x- 

We now consider how to determine whether two given matrices are similar; that is, 
whether they arise from the same linear transformation. 

6 There exist infinite nonabelian groups all of whose proper subgroups are finite. Indeed. Tarski monsters 
exist: These are infinite groups all of whose proper subgroups have prime order. 
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Example 9.30. 

Recall Example 7.1(v) on page 424: If T: V —> V is a linear transformation, where 
V is a vector space over a field k, then V admits a scalar multiplication by polynomials 
fix) e k[x]: 

m m 

fix) v = = Y^CfT’iv), 

i =0 i=0 

where T° is the identity map ly, and T l is the composite of T with itself i times if / > 1. 
We denote this k[x] -module by V 7 . 

We now show that if V is n -dimensional, then the A:[.r]-module V T is a torsion module. 
By Corollary 3.88, for each v e V, the list v, T(v), T 2 (v), ..., T n (v) must be linearly 
dependent (for it contains n + 1 vectors). Therefore, there are c, e k , not all 0, with 
^"_q Ci T 1 (v) = 0; but this says that g(x) = YH=o c ‘ x ‘ ^ es ' n the order ideal ann(t). ◄ 

There is an important special case of the construction of the /c[x]-module V r . If A is an 
n x n matrix with entries in k. define T: k" -> k" by T(v) — Av (recall that the elements 
of k " are n x 1 column vectors v, so that Av is matrix multiplication). We denote the 
£[x]-module (k") T by ( k n ) A \ thus, the action is given by 

m m 

f(x)v = £>■)„ = ^2 CiA'v. 
i =0 i=0 

We now interpret the results in the previous section about modules over general PIDs 
for the k [x] -modules V 1 and (k" ) A . If T: V -> V is a linear transformation, then a 
submodule W of V 1 is an invariant subspace', that is, W is a subspace of V with T(W) C 
W, and so the restriction T\W is a linear transformation on IV; that is, T\W: W -> W. 

Definition. If A is an r x r matrix and B is an s x s matrix, then their direct sum A® B 
is the (r + s) x (r + s) matrix 


A ® B 


A 0 
0 B 


Lemma 9.31. IfV 7 — W © IV', where W and W' are submodules, then 
bub'[T]bub' = b[T\W\b © b'[T\W] b 

where B — w\,... ,w r is a basis ofW and B' = w^,..., w' s is a basis ofW'. 

Proof. Since IV and IV' are submodules, we have T (IV) C W and T ( VV') C IV'; that is, 
the restrictions T \ W and 7’ | W' are linear transformations on IV and IV', respectively. Since 
V = W © IV', the union B U B' is a basis of V. Finally, bub'[T]bub' is a direct sum as in 
the statement of the lemma: T ( Wi ) e IV, so that it is a linear combination of w\,..., w r , 
and hence it requires no nonzero coordinates from the u)'.; similarly, T (it/-) e IV', and so 
its coordinates from the it; are all 0 . • 
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When we studied permutations, we saw that the cycle notation allowed us to recognize 
important properties that are masked by the conventional functional notation. We now ask 
whether there is an analogous way to denote matrices; more precisely, if V T is a cyclic 
/c[x]-module, can we find a basis 11 of V 7 so that the corresponding matrix b[T]b displays 
important properties of 77 


Lemma 9.32. A submodule W ofV 1 is cyclic of finite order if and only if there is a vector 
v e W and an integer s > 1 such that 

v, Tv, T 2 v, .. ., T s ~ l v 


is a basis of W. Moreover, if 


s— 1 

T s v + = 0, 

(=0 


then the order ideal ann(u) = (g), where g(x) — X s + c^-ix* 1 + • • • + cix + co; that is, 


W = k[x]/(g). 


Proof. Assume that W — (v) = {f(x)v : f(x) e k[x]}. Since V , hence W, is finite¬ 
dimensional, there is an integer .? > 1 and a linearly independent list v, Tv, T 2 v,.. 
T s ~ l v that becomes linearly dependent when we adjoin T s v. Hence, there are c,- e k with 


s— 1 

T s v + J 2 c i Tiv = 0. 

1=0 

If w e W, then w = f{x)v for some f(x) e k[x\. An easy induction on deg(/) shows 
that w lies in the subspace spanned by v, Tv, T 2 v,..., T s ~ l v, it follows that this list is a 
basis of W. 

To prove the converse, assume that there is a vector v e W and an integer ,v > 1 
such that the list v, Tv, T 2 v ,..., T s ~ { v is a basis of W. Clearly, W C (v), the cyclic 
submodule generated by v. The reverse inclusion is obvious, for we are assuming that W 
is a submodule; hence, f(x)v e W for every f(x) e k[x\. 

The polynomial g(x) lies in the order ideal ann(u). If h(x) e ann(u), the division 
algorithm gives q(x ) and r(x) with h — gq + r, where r — 0 or deg(r) < deg(g) = s. But 
r(x) e ann(u), so that r (x) — •.Lq CjxL Hence, ff jLo c jT 2 v — 0, where t < s — 1, and 

this contradicts the linear independence of the basis. Therefore, g(x) has smallest degree 
of all polynomials in ann(i>), so that ann(v) = (g). Therefore, W = k[x]/ ann(u) = 
k[x]/(g). . 


Definition. If g(x) — x + co, then its companion matrix C (g) is the lxl matrix [—co]; 
if.? >2 and g(x) = X s + c v _ix v_l + • • • + c \ x + cq, then its companion matrix C(g) is 
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the s x s matrix 


0 

0 

0 •• 

• 0 

-CO 

1 

0 

0 •• 

• 0 

-Cl 

0 

1 

0 •• 

• 0 

-C2 

0 

0 

1 •• 

• 0 

-C3 

0 

0 

0 •• 

• 1 

-C.5—| 


Obviously, we can recapture the polynomial g{x ) from the last column of the companion 
matrix C(g). 

Lemma 9.33. Let T: V -> V be a linear transformation on a vector space V over afield 
k, and let V r be a cyclic k[x]-module with generator v. If the order ideal ann(u) = (g), 
where g(x) = x s +c^_ix ,s_1 + • — \-c\x + cq, then B = v, Tv, T 2 v,..., T s ~ l v is a basis 
ofV and the matrix b[T]b is the companion matrix C(g). 

Proof. Let A — b[T\b- By definition, the first column of A consists of the coordinates 
of T(v), the second column the coordinates of T (Tv) — T 2 v, and, more generally, if 
i <5 — 1, then T(T'v) = T' +l w, that is, T sends each basis vector into the next one. 
However, on the last basis vector, T(T s ~ l v ) = T s v. But T s v = — c i T' v, where 
g(x) — x s + c > x ‘ ■ Thus, b [T]b is the companion matrix C(g). • 

Theorem 9.34. 

(i) Let A be an n x n matrix with entries in afield k. If 

(k") A = Wi ©•••© W r , 

where each Wi is cyclic, say, with order ideal (/, ), then A is similar to a direct sum 
of companion matrices 

C(/i)®...®C(/ r ). 

(ii) Every n x n matrix A over afield k is similar to a direct sum of companion matrices 

C(gi) © • • • © C(g,) 

in which the gi(x) are monicpolynomials and 

£i(*) I g 2 (x) | ••• | gt(x). 

Proof. Define V — k" and define T : V -* V by T(y) — Ay, where y is a column vector, 
(i) By Lemma 9.33, each W, has a basis If — Vi,TVj, T 2 Vi, ... and, with respect to this 
basis If, the restriction T\W, has matrix C(f ), the companion matrix of f (x). With 
respect to the basis B\ U • • • U If , the transformation T has the desired matrix, by Propo¬ 
sition 9.31. Finally, A is similar to C(/i) © • • • © C(/ r ), by Corollary 3.101. 
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(ii) By Example 9.30, the finitely generated A [.r]-module V 7 is a torsion module, and so 
the consequence of the basis theorem. Proposition 9.18, gives 

(k") A = Wj © W 2 © • • • © W t , 

where each W) is cyclic, say, with generator v,- having order ideal (g,), and gi(x) \ g 2 (x) I 
• • • | gr(x). The statement now follows from part (i). • 

Definition. A rational 7 canonical 8 form is a matrix R that is a direct sum of companion 
matrices, 

R = C(gi) © • • • © C(gt), 

where the g, (x) are monic polynomials with gi(x) | g 2 (x) | • • • | gt(x). 

If a matrix A is similar to the rational canonical form 

C(g i) © • • • © C(g t ), 

where gi(x) \ g 2 (x) | ••• | g r (x), then we say that the invariant factors of A are 
g\(x), g 2 (x) . g r (x). 

We have just proved that every n x n matrix over a field is similar to a rational canonical 
form, and so it has invariant factors. Can a matrix A have more than one list of invariant 
factors? 

Theorem 9.35. Two n x n matrices A and B with entries in a field k are similar if and 
only if they have the same invariant factors. Moreover, a matrix is similar to exactly one 
rational canonical form. 

Proof. By Corollary 3.101, A and B are similar if and only if <k" ) A = {k") B . By Theo¬ 
rem 9.22, ( k") A = {k n ) B if and only if their invariant factors are the same. 

There is only one rational canonical form for a given list of invariant factors gi (x), 
g 2 (x), ..., g t (x ), namely, C(g i) © • • • © C(g t ). If a matrix were similar to two distinct 
rational canonical forms, then it would have two different lists of invariant factors, contrary 
to the first statement of this theorem. • 

Here is a theorem analogous to Corollary 3.41, which states that if k is a subfield of a 
field K and if fix), g{x) e k[x ], then their gcd in A'[jc] is equal to their gcd in K[x). 

7 If E c R is an extension of Q, then every element e € E that is not in Q is irrational. More generally, if E/k 
is a field extension, then we call the elements of the ground field k rational. This is the usage of the adjective 
rational in rational canonical form, for all the entries of a rational canonical form lie in the field k and not in 
some extension of it. In contrast, the Jordan canonical form, to be discussed in the next section, involves the 
eigenvalues of a matrix that may not lie in k. 

s The adjective canonical originally meant something dictated by ecclesiastical law, as canonical hours being 
those times devoted to prayers. The meaning broadened to mean things of excellence, leading to the mathematical 
meaning of something given by a general rule or formula. 
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Corollary 9.36. 

(i) Let k be a subfield of a field K, and let A and B be n x n matrices with entries 
in k. If A and B are similar over K, then they are similar over k (i.e., if there is 
a nonsingular matrix P having entries in K with B — PAP -1 , then there is a 
nonsingular matrix Q having entries in k with B = QAQ -1 ). 

(ii) Ifk is the algebraic closure of a field k, then two n x n matrices A and B with entries 
in k are similar over k if and only if they are similar over k. 


Proof, (i) Suppose that g \ (x ), ..., g, (x) are the invariant factors of A regarded as a ma¬ 
trix over k, while G i(x)...., G q (x) are the invariant factors of A regarded as a matrix over 
K. By the theorem, the two lists of polynomials coincide, for both are invariant factors for 
A as a matrix over K. 

Now B has the same invariant factors as A, for they are similar over K ; since these 
invariant factors lie in k, however, A and B are similar over k. 

(ii) Immediate from part (i). • 

For example, suppose that A and B are matrices with real entries that are similar over 
the complexes; that is, if there is a nonsingular complex matrix P such that B = PAP -1 , 
then there exists a nonsingular real matrix Q such that B — QA Q -1 . 

The first step in analyzing a matrix A is to see whether it leaves any one-dimensional 
subspaces of k" invariant; that is, are there any nonzero vectors x with Ax = ax for some 
scalar a? We call a an eigenvalue of A and we call x an eigenvector of A for a. To say that 
Ax = ax for x nonzero is to say that x is a nontrivial solution of the homogeneous system 
(A — a I)x = 0; that is, A — a I is a singular matrix. But a matrix with entries in a field is 
singular if and only if its determinant is 0. Recall that the characteristic polynomial of A 
is 4ta{x) — det(x/ — A) e k[x], and so the eigenvalues of A are the roots of i/ot(x). If 
k is the algebraic closure of k, then fix (x) — YYl=\i x ~ «/)> and so the constant term of 
fi A (x) is (-1)" n a,-. On the other hand, the constant term of any polynomial /(x) is just 
/(0); setting x = 0 in i Ita{x) = det(x/ — A) gives i/^aCO) = (—1)" det(A). It follows that 
det(A) is the product of the eigenvalues. 

Here are some elementary facts about eigenvalues. 

Corollary 9.37. Let A be an n x n matrix with entries in afield k. 

(i) A is singular if and only if 0 is an eigenvalue of A. 

(ii) If a is an eigenvalue of A, then a n is an eigenvalue of A". 

(iii) If A is nonsingular and a is an eigenvalue of A, then a / 0 and a -1 is an eigenvalue 
of A -1 . 


9 We continue using familiar properties of determinants even though they will not be proved until Section 9.9. 
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Proof, (i) If A is singular, then the homogeneous system Ax = 0 has a nontrivial solution; 
that is, there is a nonzero x with Ax = 0. But this just says that Ax — Or, and so 0 is an 
eigenvalue. 

Conversely, if 0 is an eigenvalue, then 0 = det(07 — A) — ± det( A), so that det(A) = 0 
and A is singular. 

(ii) There is a nonzero vector v with Av — av. It follows by induction on n > 1 that 
A"i> = a n v. 

(iii) If x is an eigenvector for A and a, then 

x = A~ l Ax = A~ x ax — aA~ { x. 

Therefore, a 0 (because eigenvectors are nonzero) and a~ l x — A~ l x. • 

Let us return to canonical forms. 


Lemma 9.38. If g(x) e k[x], then det(x7 — C(g)) — g(x). 
Proof. If deg(g) — s > 2, then 



X 

0 

0 ••• 0 

co 


-1 

X 

0 ••• 0 

Cl 

xl - C(g ) = 

0 

-1 

x • • • 0 

C2 


_ 0 

0 

0 ••• -1 

X 

and Laplace expansion across the first row 

gives 


det(x/ - C(g)) = 

xdet(L) + (—1) 1+ " 

cq det(M), 


where L is the matrix obtained by erasing the top row and the first column, and M is 
the matrix obtained by erasing the top row and the last column. Now M is a triangular 
(s — 1) x (s — 1) matrix having —l’s on the diagonal, while L — xl — C ((g(x) — co)/x). 
By induction, det(L) = (g(x) — co)/x, while det(M) = (—l) s_1 . Therefore, 

det(x/ - C(g)) = x[(g(x) - c 0 )/x] + (-l) (1+s)+( * -1, c 0 = g(x). • 


If R = C(g\) © • • • © C(g t ) is a rational canonical form, then 

xl - R = [xl - C(gi)] © • • • © [xl - C(gr)], 

and so the lemma and Proposition 9.163, which says that det( B\ © • • • © B t ) = P|, det (Bj), 
give 

t t 

fR(x) = ]~[ fcig.fx) = ]""[ £;(*)• 
i=i f=i 

Thus, the characteristic polynomial is the product of the invariant factors; in light of Corol¬ 
lary 9.20, the characteristic polynomial of an n x n matrix A over a field k is the analog 
for (k") A of the order of a finite abelian group. 
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Example 9.39. 

We now show that similar matrices have the same characteristic polynomial. If B = 
PAP ~ l , then since xl commutes with every matrix, we have P(xl) — ( xI)P and, hence, 
P(xI)P- { — (xI)PP~ l — xl. Therefore, 

i fs(x) — det(x/ — B) 

= det (PxIP~ l - PAP ~ l ) 

= det(P[x/ - A]P _1 ) 

= det(P) det(x7 — A) det(P _1 ) 

= det(x/ — A) 

= ^a(x). 

It follows that if A is similar to C(g\) 0 • • • 0 C(g t ), then 

t 

f>A (x) - ]""[g,'(x). 

1=1 

Therefore, similar matrices have the same eigenvalues with multiplicities. -4 

Theorem 9.40 (Cayley-Hamilton). If A is an n x n matrix with characteristic polyno¬ 
mial VotCO = x" + b n - ix" _1 + • • • + b\x + bo, then i Ita(A) = 0; that is, 

A" + b n -\A n + • • • + b\ A + bol — 0. 

Proof. We may assume that A — C(g\) © • • • © C(g t ) is a rational canonical form, 
by Example 9.39, where if a (x) — g i(x) • • ■ g t (x). If we regard k n as the 7[x]-module 
(, k n ) A , then Corollary 9.19 says that gt(A)y — 0 for all y e k n . Thus, gt(A) = 0. As 
gt(x) | i/otCOi however, we have i I/a(A) — 0. • 

The Cayley-Hamilton theorem is the analog of Corollary 2.44. 

Definition. The minimum polynomial in a (x ) of an n x n matrix A is the monic polyno¬ 
mial f(x) of least degree with the property that f(A) — 0. 

Proposition 9.41. The minimum polynomial in a (x ) is a divisor of the characteristic 
polynomial if a (x), and every eigenvalue of A is a root ofmxix). 

Proof. The Cayley-Hamilton theorem shows that m a (x) | i//,t(x), while Corollary 9.19 
implies that c t (x) is the minimum polynomial of A, where c ? (x) is the invariant factor of 
A of highest degree. It follows from the fact that 

i fx(x) = cj(x) • • • c t (x), 

where ci(x) | C 2 (x) | ••• | c ? (x), that iha(x) — c t (x) is a polynomial having every 
eigenvalue of A as a root [of course, the multiplicity as a root of uia(x) may be less than 
its multiplicity as a root of the characteristic polynomial if a (x)] . • 
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Corollary 9.42. If all the eigenvalues of an n x n matrix A are distinct, then m A (x) = 
\jr A (A); that is, the minimum polynomial coincides with the characteristicpolyomial. 

Proof This is true because every root of \j/ A (x) is a root of m A (x). • 

Corollary 9.43. 

(i) An n x n matrix A is similar to a companion matrix if and only if 

m A {x) = f A (x). 

(ii) A finite abelian group G is cyclic if and only if its exponent equals its order. 

Proof (i) A companion matrix Cig) has only one invariant factor, namely, g(x); but 
Corollary 9.19 identifies the minimum polynomial as the last invariant factor. 

If m A (x) = f A (x), then A has only one invariant factor, namely, \J/ A (x), by Corol¬ 
lary 9.20. Hence, A and C(fr A {x)) have the same invariant factors, and so they are similar. 

(ii) A cyclic group of order n has only one invariant factor, namely, n; but Corollary 9.19 
identifies the exponent as the last invariant factor. 

If the exponent of G is equal to its order |G|, then G has only one invariant factor, 
namely, |G|. Hence, G and I|G| have the same invariant factors, and so they are isomorphic. 


Exercises 

9.32 (i) How many 10 x 10 matrices A over R are there, to similarity, with A 2 = /? 
(ii) How many 10 x 10 matrices A over ¥ p are there, to similarity, with A~ = /? 

Hint. The answer depends on whether p is odd or p = 2. 

9.33 Find the rational canonical forms of 


A = 

T 

2 

A 

B = 

'2 0 0" 
1 2 0 

, and C = 

'2 

1 

0 O' 
2 0 


J 

4 


0 0 3 


0 

1 2 


9.34 If A is similar to A' and B is similar to B ', prove that A ® B is similar to A! ® B'. 

9.35 Let k be a field, and let f(x) and g(x ) lie in k[x}. If g(x) | f(x) and if every root of f{x) is 
a root of g(x), show that there exists a matrix A having minimum polynomial m A (x) = g(x) 
and characteristic polynomial f A {x) = f(x). 

9.36 (i) Give an example of two nonisomorphic finite abelian groups having the same order and 

the same exponent. 

(ii) Give an example of two nonsimilar matrices having the same characteristic polynomial 
and the same minimum polynomial. 
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9.3 Jordan Canonical Forms 


If k is a finite field, then GL (n , k) is a finite group, and so every element in it has finite 
order. Consider the group-theoretic question: What is the order of A in GL(3, F 7 ), where 


A = 


0 

1 

0 


0 

0 

1 


1 

4 ? 
3 


Of course, we can compute the powers A 2 , A 3 ,..Lagrange’s theorem guarantees there 
is some n > 1 with A n = E, but this procedure for finding the order of A is rather tedious. 
We recognize A as the companion matrix of 


g(x) = a -3 — 3 a 2 — 4x — 1 = a 3 — 3 a 2 + 3 a — 1 = (a — l ) 3 


(remember that g( a) e F 7 IA]). Now A and PAP~ l are conjugates in the group GL(«, k) 
and, hence, they have the same order. But the powers of a companion matrix are compli¬ 
cated (e.g., the square of a companion matrix is not a companion matrix). We now give a 
second canonical form whose powers are easily calculated, and we shall use it to compute 
the order of A later in this section. 


Definition. A 1 x 1 Jordan block is a matrix J (a, 1) = [a]. If s > 2, then ans xs 
Jordan block is a matrix J (a, s) of the form 


/(a, s ) 


a 0 0 
1 a 0 
0 1 a 


0 0 
0 0 
0 0 


0 0 0 • • • a 0 
0 0 0 ••• la 


Here is a more compact description of a Iordan block. Let L denote the ,v x s matrix 
having all entries 0 except for l’s on the subdiagonal just below the main diagonal. In this 
notation, a Jordan block J(a, s) has the form 

J(a, s) — a I + L. 

Let us regard L as a linear transformation on k s . If e\,..., e s is the standard basis, then 
Lej = G+i if i < s while Le s = 0. It follows easily that the matrix L 2 is all 0’s except for 
l’s on the second subdiagonal below the main diagonal; L 3 is all 0 ’s except for l’s on the 
third subdiagonal; L s ~ l has 1 in the ,v, 1 position, with 0’s everywhere else, and L s — 0. 
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Lemma 9.44. If J — J {a, s) — a I + L is an s x s Jordan block, then for all m > 1, 


J 


m 



v. 


Proof Since L and a I commute (actually, a I commutes with every matrix), the collec¬ 
tion of all linear combinations of the powers of a I and the powers of L is a (commutative) 
ring, and so the binomial theorem applies. Finally, note that all terms involving L‘ for 
i > s are 0 because L s =0. • 


Example 9.45. 

Different powers of L are “disjoint”; that is, if m f n and the i j entry of L" is nonzero, 
then the ij entry of L m is zero: 


a 

0 " 

m 

a m 

0 ' 

1 

a 



a m 


a 

0 

0 

m 

a m 

0 

0 " 

1 

a 

0 

= 

ma m ~ l 

a' n 

0 

0 

1 

a 



mot"'- 1 

a m 


Lemma 9.46. If g{x) — (x — a) s , then the companion matrix C(g ) is similar to the sxs 
Jordan block J (a, s). 


Proof If T: k s -> k s is defined by z i-> C(g)z, then the proof of Lemma 9.33 gives a 
basis of k s of the form v, Tv, T 2 v, ..., r i-1 u. We claim that the list Y = yo ,..., y s - i is 
also a basis of k s , where 

yo = v, yt = (T - al)v -- y s -1 = (T — al) s ~ l v. 

It is easy to see that the list Y spans V, because T' v e (yo, ..., y, ) for all 0 < f <5 — 1. 
Since there are s elements in Y, Proposition 3.87 shows that Y is a basis. 

We now compute J — y\T]y. If j + 1 < s, then 

Tyj = T(T - al) j v 

= (T - aI) J Tv 

= (T - al) j [al + (T — aI)]v 

= a(T - al) j v + (T - al) j+l v. 


Hence, if j + 1 < s, then 


If j + 1 = 5 , then 


Tyj = uyj +yj+i- 

(T - al) j+l v — {T — aI) s v = 0, 


by the Cayley-Hamilton theorem |for i /rc( g )(x) = (x — a) J ]; hence, Ty s -1 = ay s -\- 
The matrix J is thus a Jordan block J(a, s). By Corollary 3.101, C(g) and J(a, s) are 
similar. • 



Sec. 9.3 Jordan Canonical Forms 


677 


It follows that Jordan blocks, as companion matrices, correspond to polynomials; in 
particular, J(ct, s) corresponds to (x — a) s . 

Theorem 9.47. Let A be an n x n matrix with entries in a field k. Ifk contains all the 
eigenvalues of A (in particular, ifk is algebraically closed), then A is similar to a direct 
sum of Jordan blocks. 

Proof. Instead of using the invariant factors gi | gi I • • • | gt, we are now going to use 
the elementary divisors f (x) occurring in the basis theorem itself; that is, each f (x) is a 
power of an irreducible polynomial in k[x ]. By Theorem 9.34(i), a decomposition of (, k") A 
into a direct sum of cyclic k[x]-modules W, yields a direct sum of companion matrices 

where (f ) is the order ideal of Wj, and U is similar to A. Since fix (x) — FI, fi(x), 
however, our hypothesis says that each f(x) splits over k; that is, j) (x) = (x — a, )*' for 
some si > 1, where a, is an eigenvalue of A. By the lemma, C(fi) is similar to a Jordan 
block and, by Exercise 9.34 on page 674, A is similar to a direct sum of Jordan blocks. • 

Definition. A Jordan canonical form is a direct sum of Jordan blocks. 

If a matrix A is similar to the Jordan canonical form 


J (a i, si) © • • • © J(a r , s r ), 


then we say that A has elementary divisors (x — ai) si ,..., (x — a r ) Sr . 

Theorem 9.47 says that every square matrix A having entries in a field containing all the 
eigenvalues of A is similar to a Jordan canonical form. Can a matrix be similar to several 
Jordan canonical forms? The answer is yes, but not really. 

Example 9.48. 

Let /,• be the r x r identity matrix, and let f be the sxj identity matrix. Then interchanging 
blocks in a direct sum yields a similar matrix; 


~B 

O' 


'0 

L- 

"a 

o' 

"0 

Is' 

0 

A 


_/v 

0 

0 

B 

Jr 

0 


Since every permutation is a product of transpositions, it follows that permuting the blocks 
of a matrix of the form Ai © A 2 © • • • © A, yields a matrix similar to the orginal one. 4 

Theorem 9.49. If A and B are n x n matrices over afield k containing all their eigen¬ 
values, then A and B are similar if and only if they have the same elementary divisors. 
Moreover, if a matrix A is similar to two Jordan canonical forms, say, H and H', then 
H and H' have the same Jordan blocks (i.e., H' arises from H by permuting its Jordan 
blocks). 



678 


Advanced Linear Algebra Ch. 9 


Proof. By Corollary 3.101, A and B are similar if and only if (k n ) A = {k n ) B . By Theo¬ 
rem 9.22, ( k") A = ( k n ) B if and only if their elementary divisors are the same. 

In contrast to the invariant factors, which are given in a specific order (each dividing the 
next), A determines only a set of elementary divisors, hence only a set of Jordan blocks. By 
Example 9.48, the different Jordan canonical forms obtained from a given Jordan canonical 
form by permuting its Jordan blocks are all similar. • 

Here are some applications of the Jordan canonical form. 

Proposition 9.50. If A is an n x n matrix with entries in afield k, then A is similar to its 
transpose A r . 

Proof First, Corollary 9.36(ii) allows us to assume that k contains all the eigenvalues of 
A. Now if B — PAP~ l . then B' — (P')~ { A'P'; that is, if B is similar to A, then B 1 is 
similar to A 1 . Thus, it suffices to prove that H is similar to H' for a Jordan canonical form 
//, and, by Exercise 9.34 on page 674, it is enough to show that a Jordan block J — J (a, s) 
is similar to J'. 

We illustrate the idea for J (a, 3). Let Q be the matrix having l’s on the "wrong” 
diagonal and 0’s everywhere else; notice that Q = Q~ l . 


"0 

0 

f 


a 

0 

0 


"0 

0 

r 


a 

1 

0" 

0 

1 

0 


1 

a 

0 


0 

1 

0 

= 

0 

a 

1 

1 

0 

0 


0 

1 

a 


1 

0 

0 


0 

0 

a 


We let the reader prove, in general, that Q — Q~ l and QJ (a. s)Q~ l — J {a. s)'. Per¬ 
haps the most efficient proof is to let tq,..., v s be a basis of a vector space W, to define 
Q: W -* W by Q: Vj i-> and to define J: W W by /: u, i->- avi + u,-+i for 

i < s and J: v s i-> av s . • 


Example 9.51. 

At the beginning of this section, we asked for the order of the matrix A in GL(3, F 7 ), where 


A = 


0 0 
1 0 
0 1 


1 

4 

3 


we saw that A is the companion matrix of (x — l) 3 . Since 1 Ita(x) is a power of x — 1, the 
eigenvalues of A are all equal to 1 and hence lie in F 7 ; by Lemma 9.46, A is similar to the 
Jordan block 


J = 


1 

1 

0 


0 0 
1 0 
1 1 


J 


m 



0 0 
1 0 

m 1 


By Example 9.45, 
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and it follows that J 1 — I because, in F 7 , we have 7 = 0 and ( 2 ) — 21 = 0. Hence, A 
has order 7 in GL(3, F 7 ). ◄ 

Exponentiating a matrix is used to find solutions of systems of linear differential equa¬ 
tions; it is also very useful in setting up the relation between a Lie group and its corre¬ 
sponding Lie algebra. An n x n complex matrix A consists of n 2 entries, and so A may 

2 

be regarded as a point in C" . This allows us to define convergence of a sequence of 
n x n complex matrices: A\, A 2 ,.... A 7 ..... converges to a matrix M if, for each i, j, the 
sequence of i, j entries converges. As in calculus, convergence of a series means conver¬ 
gence of the sequence of its partial sums. 

Definition. If A is an n x n complex matrix, then 

OO . 

gA _ \ ' _ _ j ^ _j_ 1^3 | 

fc] 26 

k=0 

It can be proved that this series converges for every matrix A, and that the function 
A i-»- e A is continuous; that is, if lim^oo A 7 — M, then 

lim e Ak — e M . 

k —>00 

Here are some properties of this exponentiation of matrices; we shall see that the Jordan 
canonical form allows us to compute e A . 

Proposition 9.52. Let A be an n x n complex matrix. 

(i) If P is nonsingular, then Pe A P~ l — e PAP . 

(ii) If AB — BA, then e A e B — e A+B . 

(iii) For every matrix A, the matrix e A is nonsingular, indeed, 

(e A r l =e~ A . 

(iv) If L is an n x n matrix having 1 ’s just below the main diagonal and 0 ’s elsewhere, 
then e L is a lower triangular matrix with 1 ’s on the diagonal. 

(v) If D is a diagonal matrix, say, D — diag(ai, < 22 , ■ ■ ■, oi„), then 

e D = diag(e“‘, e“ 2 ,..., e 01 "). 

(vi) Ifai, ... ,a n are the eigenvalues of A (with multiplicities ), then e 011 , ..., e a " are the 
eigenvalues of e A . 

(vii) We can compute e A . 

(viii) 7/tr(A) = 0, then det(e A ) = 1. 
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Proof, (i) We use the continuity of matrix exponentiation. 

Pe A P~ l = p( lim Y — A k )p~ l 

k=0 ' 

= lim Y — (PA k P~ l ) 

«—>oo ^— / k I v 7 

k=0 ’ 

= lim Y — [PAP~ l ) k 

>oo ' /c! v ’ 

k=0 


(ii) The coefficient of the Ath term of the power series for e A+B is 

1 k 

— (A + B) k , 
k\ 

while the coefficient of the Ath term of e A e B is 



1 


i\{k — i)\ 


A i B k ~ i 



Since A and B commute, the binomial theorem shows that both Ath coefficients are equal. 
See Exercise 9.44 on page 682 for an example where this is false if A and B do not com¬ 
mute. 

(iii) This follows immediately from part (ii), for — A and A commute and e° = I. 

(iv) The equation 

e B — I + L + + • • • + L s 1 

holds because L s — 0, and the result follows by Lemma 9.44. For example, when .v = 5, 


" 1 0 0 0 0 " 
110 0 0 
\ 1 1 0 0 
5 2 110 


(v) This is clear from the definition: 

e D = I + D + \_D 2 + \D 3 H-, 
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for 

D k = diag (a\, a\, ■ ■ ■, ct k n ). 


(vi) Since C is algebraically closed, A is similar to its Jordan canonical form J: There is 
a nonsingular matrix P with PAP~ l — J. Now A and J have the same characteristic 
polynomials, and hence the same eigenvalues with multiplicities. But / is a lower trian¬ 
gular matrix with the eigenvalues a\,..., a n of A on the diagonal, and so the definition of 
matrix exponentiation gives e J lower triangular with e a ',.... e“" on the diagonal. Since 
e A = e p JP — P~ l e J P, it follows that the eigenvalues of e A are as claimed. 

(vii) By Exercise 9.38, there is a nonsingular matrix P with PAP~ l = A + L, where A 
is a diagonal matrix, L n — 0, and A L = LA. Hence, 

Pe A P~ 1 =e PAP ~ l =e A+L =e A e L . 

But e A is computed in part (v) and e L is computed in part (iv). Hence, e A — P~ l e A e L P 
is computable. 

(viii) By definition, the trace of a matrix is the sum of its eigenvalues, while the determinant 
of a matrix is the product of the eigenvalues. Since the eigenvalues of e A are e 011 ,..., e“", 
we have 

det(e A ) = Y\e ai = = e trU) . 

i 

Hence, tr(A) = 0 implies det(e A ) = 1. • 


Exercises 

9.37 Find all n x n matrices A over a field k for which A and A~ are similar. 

9.38 (Jordan Decomposition) 

Prove that every n x n matrix A over an algebraically closed field k can be written 

A = D + N, 

where D is diagonalizable (i.e., D is similar to a diagonal matrix), N is nilpotent (i.e., N m — 
0 for some m > 1), and DN — ND. 

9.39 Give an example of an n x n matrix that is not diagonalizable. 

Hint. It is known that every symmetric real matrix is diagonalizable. Alternatively, a rotation 
(not the identity) about the origin on R 2 sends no line through the origin into itself. 

9.40 (i) Prove that all the eigenvalues of a nilpotent matrix are 0. 

(ii) Use the Jordan form to prove the converse: If all the eigenvalues of a matrix A are 0, 
then A is nilpotent. (This result also follows from the Cayley-Hamilton theorem.) 

9.41 How many similarity classes of 6 x 6 nilpotent real matrices are there? 

9.42 If A is a nonsingular matrix and A is similar to B, prove that A -1 is similar to B~ l . 
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9.43 (i) Prove that every nilpotent matrix N is similar to a strictly lower triangular matrix (i.e., 

all entries on and above the diagonal are 0 ). 

(ii) If IV is a nilpotent matrix, prove that / + N is nonsingular. 

9.44 Let 

. _ r 1 °i , d _ r° i_ 

A |_o oj and B |_0 0 ' 

Prove that e A e B ^ e B e A , and conclude that e A e B ^ e A+B . 

9.45 How many conjugacy classes are there in GL(3, F 7 )? 

9.46 We know that PSL(3, F 4 ) is a simple group of order 20 160 = 78 !. Now Ag contains an 
element of order 15. namely, (1 2 3 4 5 )(6 7 8 ). Prove that PSL(3, F 4 ) has no element of 
order 15, and conclude that PSL(3, F 4 ) ^ Ag. 

Hint. Use Corollary 9.36 to replace F 4 by a larger field containing any needed eigenvalues of 
a matrix. Compute the order [in the group PSL(3, F 4 )] of the possible Jordan canonical forms 

a 0 0 a 0 0 a 0 0 

A=1 a 0,6=0 b 0 , and C = 0 b 0 . 

_0 1 flj |_0 1 bj |_0 0 c _ 


9.4 Smith Normal Forms 

There is a defect in our account of canonical forms: How do we find the invariant factors of 
a given matrix A? The coming discussion will give an algorithm for computing its invariant 
factors. In particular, it will compute the minimum polynomial of A. 

In Chapter 3, we showed that a linear transformation T: V —»• W between finite¬ 
dimensional vector spaces determines a matrix, once bases Y of V and Z of W are chosen, 
and Proposition 3.98 shows that matrix multiplication arises as the matrix determined by 
the composite of two linear transformations. We now generalize that calculation to /(-maps 
between free /(-modules, where R is any commutative ring. 

Definition. Let R be a commutative ring, and let T : R' —> R" be an /(-map, where /(' 
and R n are free /(-modules of ranks t and n, respectively. If Y = y 1 ..... y, is a basis of 
R' and Z = zi ,..., z n is a basis of R n , then 

z[T]y — [a u ] 

is the nxl matrix over R whose zth column, for all i, consists of the coordinates of T (>’, ); 
that is, 

n 

T(yi) = J2 a J iZ J- 
7=1 

We now compare matrices for an /(-homomorphism T arising from different choices 
of bases in R 1 and in R n . The next proposition generalizes Corollary 3.101 from vector 
spaces to modules over a commutative ring. 
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Proposition 9.53. Let R he a commutative ring, let R' and R" be free R-modules of 
ranks t and n, respectively, and let T : R 1 —>• R n be an R-homomorphism. Let Y and Y’ 
be bases of R', and let Z and Z' be bases of R n . IfT = z[T]y and F' = z'\T\y', then 
there exist invertible matrices P and Q, where P is t x t and Q is n x n, with 

r' = qvp~ 1 . 

Conversely, ifY and T' are n xt matrices over R with T' = Qr P _1 for some invertible 
matrices P and Q, then there is an R-homomorphism 7 : R' —> R", bases Y and Y' of R', 
and bases Z and Z' of R", respectively, such that T = z[T]y and T' = z'[T]yi. 

Proof. This is the same calculation we did in Proposition 3.101 on page 176 when we 
applied the formula 

z[Siyy[T] x = Z [ST] X , 

where T: V —»■ V' and S: V' —> V" and X , T, and Z are bases of V, V\ and V", 
respectively. Note that the original proof never uses the inverse of any matrix entry, so that 
the earlier hypothesis that the entries lie in a field is much too strong; they may lie in any 
commutative ring. • 

Definition. Two n x t matrices T and F' with entries in a commutative ring R are 
R-equivalent if there are invertible matrices P and Q with entries in R with 

r' = qyp 

(writing P is just as general as writing P~ l ). 

Of course, equivalence as just defined is an equivalence relation on the set of all (rect¬ 
angular) n xt matrices over R. 

Proposition 9.54. If R is a commutative ring, then finite presentations of (finitely pre¬ 
sented) R-modules M and M' give exact sequences 

fi'4s"4M^0 and R '' 4 R"' K M' -* 0, 

and choices of bases Y, Y’ of R' and Z, Z 1 of R" give matrices Y = zMy and T' = 
z'[L']yi. Ift’ = t, n' = n, and Y and F' are R-equivalent, then M = M'. 

Proof. Since F and F' are P-equivalent, there are invertible matrices P and Q with F' = 
QYP~ l \ now P determines an /7-isomorphism 6 : R n —> R", and Q determines an R- 
isomorphism <p : R r —> R 1 . The equation F' = Q Y P~ 1 implies that the following diagram 
commutes; 

R> —^ R n - JL -> M ->- 0 

B : v 

y 

R' - 7 ^ R n - M' -=- 0 

y it' 
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Define an A 1 -map v: M —» M' as follows: if m e M , then surjectivity of n gives an 
element u e R n with :r(u) — nr, set v{m) — tc'6{u). Proposition 8.93 is a proof by 
diagram chasing that v is a well-defined isomorphism. • 

Proposition 9.54 is virtually useless; for most commutative rings R , there is no way 
to determine whether matrices F and T' with entries in R are R -equivalent. However, 
when R is a euclidean ring, we will be able to use the criterion in the proposition to find a 
computable normal form of a matrix. 

If T: V —»• V is a linear transformation on a vector space V over a field k, the next 
theorem gives a finite presentation of the k[x]-module V 7 . The next definition creates a 
free k[x]-module from any vector space V over a field k; the construction is based on the 
formal definition, in Chapter 3, of k[x ] as sequences in k almost all of whose coordinates 
are zero. 

Definition. If V is a ^-module over a commutative ring k, define 

V[x] = £Vj, 

i> 0 


where V, = V for all i. In more detail, we denote the elements of V,- by x‘ v, where v e V 
(so that x l merely marks the coordinate position; in particular, we let x°v — v, so that 
Vo = V). Thus, each element u e V[x] has a unique expression of the form 


= E 


X Vi 


i> 0 


where Vj e V and almost all v,- = 0. The k-module V[x ] is a k[x]-module if we define 

= 'y' i x' +l v i . 

i i 

For readers comfortable with tensor product, the module V[x] just constructed is merely 
k[jr] <S>k V. Indeed, the next lemma uses the fact that tensor product commutes with direct 
sums, for a subset B is a basis of V if and only if V = kb is a direct sum. 

Lemma 9.55. If V is a free k-module over a commutative ring k, then V [x] is a free 
k[x]-module. In fact, a basis E ofV is also a basis of V[x] as a free k[x]-module. 

Proof. Each element u e V[x] has an expression of the form u = JV> 0 x l Vi. Since 
x'ei ,..., x‘e„ is a basis for V,- = x' V, each i>; = / a ji e j f° r a ji e k. Collecting terms, 

u = fi(x)e\ H-h f„(x)e n . 


where fj(x) — oijo + aj\x + ■ ■ ■ + a/ t x' for some t. 
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To prove uniqueness of this expression, suppose that 

gl(x)ei H-1 - g„(x)e n = 0, 

where gj(x) = fjo + fij\x + ■ ■ ■ + fijtx 1 for some t. For each i, this gives the equation 
fijiX 1 6j — 0 in Vi. Since x l e\,..., x'e„ is a basis of Vj, it is linearly independent, and 
so all Pji — 0. • 

We can now give a finite presentation of V 1 . Viewing V[x] as sequences (rather than 
as k[x~\ ®k V) is convenient in this proof. 

Theorem 9.56 (Characteristic Sequence). 

(i) If V is a finitely generated k-module over a commutative ring k and T: V —> V is a 
k-homomorphism, then there is an exact sequence of k[x]-modules 

0 -» V[x] 4 V[x] 4 V T -> 0 , 

where, for all i > 0 and all v e V, X(x‘v) — x l+1 v — x' Tv and nix 1 v ) = T' v. 

(ii) If A is an n x n matrix over k and E is the standard basis E = e \,..., e„ ofk n , then 
the matrix eYI-\e arising from the presentation of (k n ) A in part (i) is xl — A. 


Proof, (i) It is easily checked that both X and jt are well-defined /.'-maps; they are also 
A:[x]-maps; for example, 

A(x(x'u)) = xX(x'v), 
because both equal x ,+2 v — x ,+l T v. 

(1) n is suijective. If v e V T , then 7r(i>) = T°v = v. 

(2) im X C kerjr. 

ttX(x‘v) = 7t(x i+l v -x''Tv) = T i+1 v - T i+1 v = 0. 

(3) kerjr C imA. If u — Yl'iLox'vi e ker7r, then Y^T=0 T l Vi — 0. Hence, 

m m 

u — ’y^x'vj — ^2 T ,y i 
1=0 1=0 

m 

= Y^( X ‘ Vi ~ T ‘ Vi )’ 

i = l 

because 

0 T' 0 A 

X u i>0 -T Vo — V 0 - Vo — 0. 

For any i > 1, we are going to rewrite the /th summand x l Vj — T' v, of u as a telescoping 



686 


Advanced Linear Algebra Ch. 9 


sum, each of whose terms lies in im A; this will suffice to prove that kerjr C im X. 
i —1 i— 1 

Y,Hx i - l - j T J v i ) = J2 (x i ~ j T j v i - x i ~ x ~ i T j+l v 


7=0 


7=0 

= (x'u* — x'~ l Tvi ) + (x'~ l T v, — x'~ 2 T 2 Vi)+ 

• • • + (xT'~ l Vi - T‘vi) 
i— 1 

= + [T>— r'u, 

7=1 

= x'vi — T l Vi. 


(4) A is injective. As a ^-module, V[x] is a direct sum of submodules Vj, and, for all 
m > 0, y,„ = V via f m : x m v i-> v; it follows that if x m v ^ 0, then f~+ l fm(.x m v) = 
x m+ 1 t> ^ 0 . 

Suppose now that 

m 

u — x‘Vi e kcr a, 

i=0 

where x m v m ^ 0; it follows that x m+l v m ^ 0. But 

m m 

0 = X(u) — k(Yx'vj) — £>‘ +1 u,- — x'T Vj). 

i= 0 i=0 


Therefore, 

m— 1 m 

X m+1 v m = - £(x' + 1 u,-) + Y_ x l TVj. 

i =0 1=0 

Thus, x m+1 u m e y m+ i fl YI'i'Lq Vi — f0}’ so that x" 1+1 u,„ = 0. But we have seen that 
x m v ^ 0 implies x m+1 v m ^ 0, so that this contradiction gives ker A = {0}. 

(ii) In the notation of part (i), let V — k" and let T: k n —> k n be given by v i-> Av, where 
v is an n x 1 column vector. If e\, ..., e n is the standard basis of k'\ then e\, ..., e n is a 
basis of the free &[x]-module V[x], and so it suffices to find the matrix of X relative to this 
basis. Now 

X(ei) = xej — T ei — xe; — Y. a .ji e i • 

7 

Since [<5,y] = /, where Sjj is the Kronecker delta, we have 

xej — Y^ajiej — Y^x&jtej — ’Y.ajiej 

j j j 

= ~ a ji) e j * 

j 

Therefore, the matrix of X is xl — A. • 
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Corollary 9.57. Two n x n matrices A and B over a field k are similar if and only if the 
matrices T — xl — A and T' — xl — B are k[xf-equivalent. 

Proof. If A is similar to B , there is a nonsingular matrix P with entries in k such that 
B = PAP~ l . But 


P(xl - A)P~ l = xl - PAP~ l = xl - B, 

because the scalar matrix xl commutes with P (it commutes with every matrix). Thus, 
xl — A and xl — B are k[x] -equivalent. 

Conversely, suppose that the matrices xl — A and xl — B are k[x] -equivalent. By 
Theorem 9.56(ii), (k n ) A and (k") B are finitely presented A: [x]-modules with presentations 
that give the matrices xl — A and xl — B, respectively. Now Proposition 9.54 shows that 
(k n ) A = (k") R , and so A and B are similar, by Corollary 7.4. • 

Corollary 9.57 reduces the question of similarity of matrices over a field k to a problem 
of equivalence of matrices over k[x~\. Fortunately, Gaussian elimination, a method for 
solving systems of linear equations whose coefficients lie in a field, can be adapted here. 
We now generalize the ingredients of Gaussian elimination from matrices over fields to 
matrices over arbitrary commutative rings. 

In what follows, we denote the /th row of a matrix A by ROW(l) and the yth column by 
COL(y'). 


Definition. There are three elementary row operations on a matrix A with entries in a 
commutative ring R: 

Type I: Multiply ROW(y) by a unit u e R. 

Type II: Replace ROW(/) by ROW(i) + cyROW(y), where j f i and cj € R. 

Type III: Interchange ROW(i) and ROW (j). 

There are analogous elementary column operations. 

Notice that an operation of type III (i.e., an interchange) can be accomplished by oper¬ 
ations of the other two types. We indicate this schematically. 


a 

c 



b — d 
d 



b — d 
b 


—c 

a 



d 

b 


Definition. An elementary matrix is the matrix obtained from the identity matrix I by 
applying an elementary row 10 operation to it. 

Thus, there are three types of elementary matrix. It is shown in elementary linear algebra 
courses (and it is easy to prove) that performing an elementary operation is the same as 
multiplying by an elementary matrix. In more detail, if L is an elementary matrix of type 
I, II, or III, then applying an elementary row operation of this type to a matrix A gives 

10 Applying elementary column operations to I gives the same collection of elementary matrices. 



688 


Advanced Linear Algebra Ch. 9 


the matrix LA, whereas applying the corresponding elementary column operation to A 
gives the matrix AL. It is also easy to see that every elementary matrix is invertible, and its 
inverse is elementary of the same type. It follows that every product of elementary matrices 
is invertible. 


Definition. If R is a commutative ring, then a matrix F' is Gaussian equivalent to a 
matrix F if there is a sequence of elementary row and column operations 

r = r 0 Li -*-> r r = r'. 

Gaussian equivalence is an equivalence relation on the family of all n x t matrices 
over R. 

It follows that if F' is Gaussian equivalent to T, then there are matrices P and Q, each 
a product of elementary matrices, with F' = PTQ. Recall that two matrices F' and F are 
R-equivalent if there are invertible matrices P and Q with T' = PT Q. It follows that if F' 
is Gaussian equivalent to T, then F' and F are R-equivalent. We shall see that the converse 
is true when R is euclidean. 


Theorem 9.58 (Smith 11 Normal Form). Every nonzero n x t matrix F with entries in 
a euclidean ring R is Gaussian equivalent to a matrix of the form 

E 0" 

0 0 ’ 


where £ = diag(cri, ..., a q ) and o\ \ 02 \ ■ ■ ■ \ a q are nonzero (the lower blocks ofO’s or 
the right blocks ofO’s may not be present). 


Proof The proof is by induction on the number n > 1 of rows of F. If it e R, let 
(Her ) denote its degree in the euclidean ring R. Among all the entries of matrices Gaussian 
equivalent to T, let o\ have the smallest degree, and let A be a matrix Gaussian equivalent 
to T that has eri as an entry, say, in position k, £. 

We claim that a\ | t]kj for all q jy in ROW (k) of A. Otherwise, there is j l and an 
equation qtcj — ko\ + p, where 3 (p) < 3(<ti). Adding (—at)COL(£) to COL(y) gives a 
matrix A' having p as an entry. But A' is Gaussian equivalent to F and has an entry p 
whose degree is smaller than 3 (or), a contradiction. The same argument shows that o\ 
divides any entry in its column. We claim that o\ divides every entry of A'. Let a be an 

entry not in en’s row or column; schematically, we have | ^ 

V c °L 

c = Dai. Replace row(1) by row(1)+(1 — m)row(2) = {a + (1 — u)c <ji). As above, 
o 1 | a + (1 — u)c. Since o\ \ c, we have a\ \ a. 

Let us return to A, a matrix Gaussian equivalent to F that contains cri as an entry. By 
interchanges, there is a matrix A' that is Gaussian equivalent to T and that has cti in the 1,1 


where b = ua\ and 


11 This theorem and the corresponding uniqueness result, soon to be proved, were found by H. J. S. Smith in 
1861 . 
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position. If rj\j is another entry in the first row, then rj \ ; = KjO\, and adding (—/Cy)COL(l) 
to COL(j) gives a new matrix whose 1, j entry is 0. Thus, the matrix A is Gaussian 
equivalent to a matrix having o\ in the 1,1 position and with 0’s in the rest of the first row. 
This completes the base step n = 1 of the induction, for we have just shown that a nonzero 
1 x t matrix is Gaussian equivalent to [cri 0 ... 0], Furthermore, since o\ divides all entries 
in the first column, F is Gaussian equivalent to a matrix having all 0’s in the rest of the first 
column as well; thus, F is Gaussian equivalent to a matrix of the form 

crj 0 

0 Q ' 

By induction, the matrix Q is Gaussian equivalent to a matrix 

"£' O' 

0 0 ’ 


where £' = diag(cr 2 , ■ ■ ■, er q ) and 02 | <73 | • • • | cr q . Hence, T is Gaussian equivalent to 


G 1 
0 
0 


0 

£' 

0 


0 

0 

0 


. It remains to observe that n\ \ 02 ', this follows from our initial remarks. 


for the ultimate matrix is Gaussian equivalent to T and contains cri as an entry. • 


Definition. The matrix 
form of T. 


£ 0 
0 0 


in the statement of the theorem is called a Smith normal 


Thus, the theorem states that every nonzero (rectangular) matrix with entries in a eu¬ 
clidean ring R is Gaussian equivalent to a Smith normal form. 


Corollary 9.59. Let R be a euclidean ring. 

(i) Every invertible n x n matrix T with entries in R is a product of elementary matrices. 

(ii) Two matrices T and F' over R are R-equivalent if and only if they are Gaussian 
equivalent. 

Proof, (i) We now know that T is Gaussian equivalent to a Smith normal form [ q 0 ] ’ 
where £ is diagonal. Since T is a (square) invertible matrix, there can be no blocks of 0’s, 
and so F is Gaussian equivalent to £; that is, there are matrices P and Q that are products 
of elementary matrices such that 

PTQ = £ = diag(ai-- a„). 

Hence, Y = P~ l T,Q~ l . Now the inverse of an elementary matrix is again elementary, so 
that P~ l and Q~ x are products of elementary matrices. Since £ is invertible, det(£) = 
CT] • • ■ o n is a unit in R. It follows that each cr, is a unit, and so £ is a product of n 
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elementary matrices [arising from the elementary operations of multiplying ROW(7) by the 
unit <7j ]. 

(ii) It is always true that if F' and T are Gaussian equivalent, then they are R -equivalent, 
for if F' = P T Q, where P and Q are products of elementary matrices, then P and Q are 
invertible. Conversely, if T' is R -equivalent to F, then F' = PL Q, where P and Q are 
invertible, and part (i) shows that F' and F are Gaussian equivalent. • 

There are examples showing that this proposition may be false for PIDs that are not 
euclidean . 12 Investigating this phenomenon was important in the beginnings of Algebraic 
K-Theory (see Milnor, Introduction to Algebraic K-Theory). 

Theorem 9.60 (Simultaneous Bases). Let R be a euclidean ring, let F be a finitely gen¬ 
erated free R-module, and let S be a submodule of F. Then there exists a basis z\, ... ,Z n 

of F and nonzero o\ . o q in R, where 0 < q < n, such that o\ \ ■■■ \ o q and 

ct\zi, • ■ •, QqZ q is a basis of S. 

Proof If M — F/S, then Corollary 9.4 shows that S is free of rank < n, and so 

is a presentation of M, where X is the inclusion. Now any choice of bases of S and F 
associates a matrix L to X (note that L may be rectangular). According to Proposition 9.53, 
there are new bases of 5 and F relative to which F is //-equivalent to a Smith normal form, 
and these new bases are as described in the theorem. • 

Corollary 9.61. Let F be an n x n matrix with entries in a euclidean ring R. 

(i) If r is R-equivalent to a Smith normal form diag (p\,cr q ) ® 0, then those 
<Ti, ... ,o q that are not units are the invariant factors ofL. 

(ii) //diag(? 7 i, .... i ) s ) © 0 is another Smith normal form ofL, then s — q and there are 
units Uj with = M,cr, for all i\ that is, the diagonal entries are associates. 

Proof (i) We may regard T as the matrix associated to an //-map X : R" —> R" relative 
to some choice of bases. Let M — R' 1 / im X. If diag(ai,..., o q ) © 0 is a Smith normal 
form of F, then there are bases yi, .... y„ of R n and zi, ■.., z. n of R" with X(y i) = 
oqzi,..., X(y q ) = cr q z q and X(yj) = 0 for all yj with j > q, if any. If <y s is the first 07 
that is not a unit, then 

M = R n ~ q © R/(o s ) © • • • © R/{o q ), 

a direct sum of cyclic modules for which cr s \ ■ ■ ■ \ o q . The fundamental theorem of finitely 
generated //-modules identifies o s ,... ,o q as the invariant factors of M. 

(ii) Part (i) proves the essential uniqueness of the Smith normal form, for the invariant 
factors, being generators of order ideals, are only determined up to associates. • 

'“There is a version for general PIDs obtained by augmenting the collection of elementary matrices by sec¬ 
ondary matrices; see Exercise 9.50 on page 694. 
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With a slight abuse of language, we may now speak of the Smith normal form of a 
matrix. 

Corollary 9.62. 7vvo n x n matrices A and B over a field k are similar if and only if 
xl — A and xl — B have the same Smith normal form over k[x]. 

Proof. By Theorem 9.57, A and B are similar if and only if xl — A is k[x]-equivalent to 
xl — B, and Corollary 9.61 shows that xl — A and xl — B are k[x]-equivalent if and only 
if they have the same Smith normal form. • 


Corollary 9.63. Let F be a finitely generated free abelian group, and let S be a subgroup 
of F having finite index ; let y\,..., y n be a basis of F, let zi, . ■ ■, z. n be a basis of S, and 
let A — [aij] be the n x n matrix with Zi — j cijiyj • Then 


[F : 5] = | det(A)|. 

Proof. Changing bases of 5 and of F replaces A by a matrix B that is Z-equivalent to it: 

B = QAP , 

where Q and P are invertible matrices with entries in Z. Since the only units in Z are 1 
and —1, we have | det(/i) = | det(A)|. In particular, if we choose 11 to he a Smith normal 

form, then B = diag(gi, .... g n ), and so | det(fi) = g\ ■ ■ ■ g n . But gi, - g n are the 

invariant factors of F/S; by Corollary 5.30, their product is the order of F/S, which is the 
index [F : 5]. • 

We have not yet kept our promise to give an algorithm computing the invariant factors 
of a matrix with entries in a field. 


Theorem 9.64. Let £ = diag(eri, ..., o q ) be the diagonal block in the Smith normal 
form of a matrix T with entries in a euclidean ring R. If we define c/,(T) by c/o(T) = 1 
and, for i > 0 , 

di(D — gcd( all i x i minors o/T), 


then, for all i > 1 , 


at = di{F)/di-\{T). 


Proof. We are going to show that if T and F' are A 1 -equivalent, then 


di(T) ~ di(T') 


for all i, where we write a ~ b to denote a and b being associates in R. This will suffice 
to prove the theorem, for if F' is the Smith normal form of V whose diagonal block is 
diagloq,..., o q ), then djiT') = o\cr 2 • • • cr ( -. Hence, 


Oiix) = ~ di(T)/di-i(T). 
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By Proposition 9.59, it suffices to prove that 

di (T) ~ di (LV) and d t (D ~ dj ( TL ) 

for every elementary matrix L. Indeed, it suffices to prove that dj(VL) ~ d± (T), because 
di (r L) = d, ([T L]’) — di (L'r') [the i x i submatrices of are the transposes of the i x i 
submatrices of F; now use the facts that L' is elementary and that, for every square matrix 
M, we have det(M f ) = det(Af)]. 

As a final simplification, it suffices to consider only elementary operations of types I 
and II, for we have seen on page 687 that an operation of type III, interchanging two rows, 
can be accomplished using the other two types. 

L is of type I. 

If we multiply ROW (£) of T by a unit u , then an i x i submatrix either remains unchanged 
or one of its rows is multiplied by u. In the first case, the minor, namely, its determinant, 
is unchanged; in the second case, the minor is changed by a unit. Therefore, every i x i 
minor of LT is an associate of the corresponding i x i minor of T, and so dj (FT) ~ c/, (T). 

L is of type II. 

If L replaces ROW(f) by ROW(f) + rROWfj ), then only ROW(f) of T is changed. Thus, 
an i x i submatrix of F either does not involve this row or it does. In the first case, the 
corresponding minor of LT is unchanged; in the second case, it has the form m + rm ', 
where m and in' are i x i minors of F (for det is a multilinear function of the rows of a 
matrix). It follows that d,(r) I di(L F), for di(F) \ m and djiV) \ m'. Since L~ l is also an 
elementary matrix of type II, this argument shows that dj(L~ l (L F)) | c/,( LI ’). Of course, 
L _1 (LF) = T, so that c/,(T) and dj{LY) divide each other. As R is a domain, we have 
di(LV) ~ di(T). • 


Theorem 9.65. There is an algorithm to compute the elementary divisors of any square 
matrix A with entries in a field k. 

Proof. By Corollary 9.62, it suffices to find a Smith normal form for I’ = xl — A over 
the ring k[x\, by Corollary 9.61, the invariant factors of A are those diagonal entries which 
are not units. 

There are two algorithms: compute (xI — A) for all i (of course, this is not a very 
efficient algorithm for large matrices); put xl — A into Smith normal form using Gaussian 
elimination over k[x]. The reader should now have no difficulty in writing a program to 
compute the elementary divisors. • 


Example 9.66. 

Find the invariant factors, over Q, of 

'2 
1 
0 



A = 


3 

2 

0 
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We are going to use a combination of the two modes of attack: Gaussian elimination and 
gcd’s of minors. Now 

~x — 2 -3 -1 ' 

xl — A = —1 x — 2 —1 

0 0 x + 4 

It is plain that gi = 1, for it is the gcd of all the entries of A, some of which are nonzero 
constants. Interchange ROW(l) and ROW(2), and then change sign in the top row to obtain 

"1 —x + 2 1 

x — 2 -3 -1 . 

0 0 x + 4 

Add — (x — 2 )row( 1) to row( 2) to obtain 

"1 —x + 2 1 1 [1 0 0 ' 

0 x 2 — 4x + 1 — x + 1 —> 0 x 2 — 4x + 1 — x + 1 . 

_0 0 x + 4 _ _0 0 x + 4 _ 


The gcd of the entries in the 2x2 submatrix 

x 2 — 4x + 1 —x + 1 
0 x + 4 

is 1, for —x + 1 and x + 4 are distinct irreducibles, and so g 2 = 1. We have shown that 
there is only one invariant factor of A. namely, (x 2 — 4x + l)(x + 4) = x 3 — 15x + 4, and 
it must be the characteristic polynomial of A. It follows that the characteristic and minimal 
polynomials of A coincide, and Corollary 9.43 shows that the rational canonical form of A 


is 


0 0 
1 0 
0 1 



Example 9.67. 

Find the abelian group G having generators a, b, c and relations 

In A 5b 4~ 2c — 0 
3a + 3b =0 
13a + lib + 2c — 0. 

Using elementary operations over Z, we find the Smith normal form of the matrix of rela¬ 
tions: 

7 5 2l [10 0 

3 3 0 0 6 0 . 

13 11 2j |_0 0 0_ 

It follows that G = (Z/1Z) © (Z/ 6 Z) © (Z/OZ). Simplifying, G = Ig © Z. ◄ 
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Exercises 


9.47 Find the invariant factors, over Q, of the matrix 

'-4 6 3' 

-3 5 4 

_4 -5 3 

9.48 Find the invariant factors, over Q, of the matrix 


-6 

2 

-5 

-19' 

2 

0 

1 

5 

-2 

1 

0 

-5 

3 

-1 

2 

9 


9.49 If k is a field, prove that there is an additive exact functor ^-Mod —»■ £[>]Mod taking any vector 
space V to V[x ]. [See Theorem 9.56(ii).] 

9.50 Let R be a PID, and let a, b e R. 


(i) If d is the gcd of a and b , prove that there is a 2 x 2 matrix Q — 
det(2) = 1 so that 

Q 


with 


& 

* 


^3 


l b 



K 

*J 


where d \ d'. 

Hint. If d = xa + yb, define x’ = b/d and y 1 = —a/d. 
(ii) Call an n x n matrix U secondary if it can be partitioned 


Q 
o / 


where Q is a 2 x 2 matrix of determinant 1. Prove that every n x n matrix A with entries 
in a PID can be transformed into a Smith canonical form by a sequence of elementary 
and secondary matrices. 


9.5 Bilinear Forms 

In this section, k will be a field and V will be a vector space over k, usually finite¬ 
dimensional. Even though we have not yet proved the basic theorems about determinants 
(they will be proved in Section 9.9), we continue to use their familiar properties. 

Definition. A bilinear form (or inner product) on V is a bilinear function 

/: V x V -> k. 

The ordered pair (V, /) is called an inner product space. 
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Of course, ( k n , /) is an inner product space if / is the familiar dot product 

f(u, v ) = y, UjVi, 
i 


where u — (mi, ..., «„)' and v = (vi, ■ ■ ■ , v„)’ (the superscript t denotes transpose; re¬ 
member that the elements of k" are n x 1 column vectors). In terms of matrix multiplication, 
we have 

f (u, v) = u { v. 

There are two types of bilinear forms of special interest. 

Definition. A bilinear form /: V x V —»• k is symmetric if 

/(«, v) = f(v, u) 

for all u, v e V ; we call an inner product space (V, /) a symmetric space when / is 
symmetric. 

A bilinear form / is alternating if / (v , v) = 0 for all v e V; we call an inner product 
space (V , /) an alternating space when / is alternating. 


Example 9.68. 

(i) If V — k 2 and its elements are viewed as column vectors, then det: V x V -* k, given 
by 


det 


a c 
b d 


— ad — be. 


is an example of an alternating bilinear form. 

(ii) In Chapter 8 , we defined a (Hermitian) form on the complex vector space cf(G) of all 
class functions on a finite group G. More generally, define a function /: C" x C" -> C 
by 

f(u, v) = 'Y^ujVj, 
j 

where u — (mi, .... u n ) r , v — (i>i, ..., v,,)', and c denotes the complex conjugate of 
a complex number c. Such a function is not C-bilinear because f(u,cv) — cf(u, v) 
instead of cf(u, v). Hermitian forms are examples of sesquilinearforms', such forms can 
be constructed over any field k equipped with an automorphism of order 2 (to play the role 
of complex conjugation). ■* 


Every bilinear form can be expressed in terms of symmetric and alternating bilinear 
forms. 
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Proposition 9.69. Let k be a field of characteristic f 2, and let f be a bilinear form 
defined on a vector space V over k. Then there are unique bilinear forms f s and f a , where 
f s is symmetric and f a is alternating, such that f = f s + fa- 

Proof By hypothesis, ^ e k, and so we may define 

f s (u, v) = j(/(M, v) + f(v, u )) 

and 

fa(u, V ) = [(fill, V ) - f(V,u)). 

It is clear that / = fs + fa- that f s is symmetric, and that f a is alternating. Let us 
prove uniqueness. If / = f ' + ff where f ' is symmetric and / ( ' is alternating, then 
f s + f a — f' + fa- so that f s — f' = f^ — f a . If we define g to be the common value, 
f s — f^ = g — fa — f a - then g is both symmetric and alternating. By Exercise 9.54 on 
page 713, we have g — 0, and so f s = f' and /« = /„. • 

Remark. If (V, g) is an inner product space, then g is called skew if 

g(v, u) = g(u, v) 

for all u, v e V. We now show that if k does not have characteristic 2, then g is alternating 
if and only if g is skew. 

If g is any bilinear form, we have 

g(u +v,u + v) — g(u, u) + g(u , v ) + g(v, u) + g(v, v). 

Therefore, if g is alternating, then 0 = g(u, v) + g(v, u), so that g is skew. (We have not 
yet used the hypothesis that the characteristic of k is not 2.) 

Conversely, if g is skew, then set u = v in the equation g(u, v) — —g(v, u) to get 
g(u, u) — — g{u , m); that is, 2 giu, u) = 0. Thus, g(u, u) — 0, because k does not have 
characteristic 2, and so g is alternating. ◄ 

Definition. Let / be a bilinear form on a vector space V over a field k, and let E = 
e I..... e„ he a basis of V. Then an inner product matrix of / relative to E is 

A = [f{ei,ej)i. 

Suppose that (V, /) is an inner product space, e\, ..., e n is a basis of V, and A = 
[fie, , ej)] is the inner product matrix of / relative to E. If b — b, e,- and c = c,e/ 
are vectors in V, then 

fib, c ) = /£>„!» = ^ bjfjej , ef)Cj. 

i,j 

If B and C denote the column vectors (b i, ..., b„Y and (ci,..., respectively, then 
this last equation can be written in matrix form: 

f(b,c) = B' AC. 

Thus, an inner product matrix determines / completely. 
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Proposition 9.70. Let V be an n-dimensional vector space over afield k. 

(i) Every n x n matrix A over a field k is the inner product matrix of some bilinear form 
f defined on V x V. If f is symmetric, then its inner product matrix A relative to 
any basis of V is a symmetric matrix ( i.e., A' = A). If f is alternating, then any 
inner product matrix relative to any basis ofV is skew-symmetric (i.e., A' — —A). 

(ii) If B' AC — B' A'C for all column vectors B and C, then A — A'. 

(iii) Let A and A' be inner product matrices of bilinear forms f and f' on V relative to 
bases E and E', respectively. Then f — f if and only if A and A' are congruent ; 
that is, there exists a nonsingular matrix P with 

A! = P'AP. 


Proof, (i) For any matrix A, the function f:k n x k" -* k, defined by f(b, c ) = b' Ac, is 
easily seen to be a bilinear form, and A is its inner product matrix relative to the standard 
basis ei,... ,e n . The reader may easily transfer this construction to any vector space V 
once a basis of V is chosen. 

If / is symmetric, then so is its inner product matrix A = [a//], for ajj — f(ej, ej) — 
f(ej, ei) = ajp, similarly, if / is alternating, then a\j — /(e,, ef)= —f(ej, e;) = —<a/,-. 

(ii) If b = bjCi and c — ff, c,e,-, then we have seen that fib. c) — B' AC, where B 
and C are the column vectors of the coordinates of b and c with respect to E. In particular, 
if b — ei and c — ej, then f (ei, ej) = atj is the ij entry of A. 

(iii) Let the coordinates of b and c with respect to the basis E' be B' and C', respec¬ 
tively, so that f'(b, c) = (B')'A'C', where A' = [f(e’ i , el)]. If P is the transition matrix 
E [1] E ', then B = PB' and C = PC'. Hence, f(b,c) = B'AC = (PB')'A(PC') = 
(B'Y(P‘AP)C'. By part (ii), we must have P'AP = A'. 

For the converse, the given matrix equation A! — P' AP yields equations: 

[f'(e' i ,e' j )]=A' 

= P'AP 

= | \ 52 ,Puf( e t, e q)Pqj )] 

lq 

t q 

= [f(e'i,e'j)]. 

Hence, f'(e\, e'j) — f (e\, el) for all i, j , from which it follows that f'(b, c) = f(b , c) for 
all b, c e V. Therefore, f = f. • 
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Corollary 9.71. If (V, /) is an inner product space and if A and A' are inner product 
matrices of f relative to different bases ofV, then there exists a nonzero a e k with 

detCA') = a 2 det(A). 

Consequently, A' is nonsingular if and only if A is nonsingular. 

Proof This follows from the facts: det(P r ) = det(P); det(Afi) = det(A) dct( B): and P 
is nonsingular if and only if dct( P) f 0. • 

The most important bilinear forms are the nondegenerate ones. 

Definition. A bilinear form / is nondegenerate if it has a nonsingular inner product 
matrix. 

For example, the dot product on k n is nondegenerate, for its inner product matrix relative 
to the standard basis is the identity matrix I. 

The discriminant of a bilinear form is essentially the determinant of its inner product 
matrix. However, since the inner product matrix depends on a choice of basis, we must 
complicate the definition a bit. 

Definition. If k is a field, then its multiplicative group of nonzero elements is denoted 
by k x . Define (k y ) 2 — {a 2 : a e k x }. The discriminant of a bilinear form / is either 0 or 

det(A)(T x ) 2 e k x /(k x ) 2 , 

where A is an inner product matrix of /. 

It follows from Corollary 9.71 that the discriminant of / is well-defined. Quite often, 
however, we are less careful and say that det(A) is the discriminant of /, where A is some 
inner product matrix of /. 

The next definition will be used in characterizing nondegeneracy. 

Definition. If (V, /) is an inner product space and W C V is a subspace of V, then the 
left orthogonal complement of W is 

W LL — {b e V : f(b , w) = 0 for all w e W}; 

the right orthogonal complement of W is 

W LR — {c e V : f(w, c) — 0 for all w e W}. 

It is easy to see that both W^ L and W J - R are subspaces of V . Moreover, W^ L — W A - R 
if / is either symmetric or alternating, in which case we write W^. 

Let (V, f) be an inner product space, and let A be the inner product matrix of / relative 
to a basis e\,... ,e n of V. We claim that b e V ±L if and only if b is a solution of the 
homogeneous system A'x = 0. If b e V 2 L . then f(b,ej) — 0 for all j. Writing 
b = b i e i> we see that 0 = fQ>, ej) = f(J2i b i e i,ej) = J2j b if^i, ej). In matrix 
terms, b = {b \,..., b n ) r and B' A = 0; transposing, b is a solution of the homogeneous 
system A'x = 0. The proof of the converse is left to the reader. A similar argument shows 
that c e V ±K if and only if c is a solution of the homogeneous system Ax = 0. 
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Proposition 9.72. If (V, /) is an inner product space, then f is nondegenerate if and 
only ifV^ L — {0} = V 2 ~ R ', that is, if fib, c) = 0 for all c e V, then b = 0, and if 
fib, c) = 0 for all b e V, then c = 0. 

Proof. Our remarks above show that be V 1 L if and only if b is a solution of the homo¬ 
geneous system A' x = 0. Therefore, V L L f {0} if and only if there is a nontrivial solution 
b, and Exercise 3.70 on page 170 shows that this holds if and only if det(A') = 0. Since 
det(A') = det(A), we have / degenerate. A similar argument shows that ^ {0} if 
and only if there is a nontrivial solution to Ax = 0. • 


Example 9.73. 

Let (V , /) be an inner product space, and let W C V he a subspace. It is possible that / 
is nondegenerate, while its restriction f\(W x If) is degenerate. For example, let V = k 2 , 
and let / have the inner product matrix A = [ j q ] relative to the standard basis e \, e 2 - It 
is clear that A is nonsingular, so that / is nondegenerate. On the other hand, if W = (ei), 
then f\(W x W) — 0, and hence it is degenerate. 4 

Here is a characterization of nondegeneracy in terms of the dual space. This is quite 
natural, for if / is a bilinear form on a vector space V over a field k, then for any fixed 
u e V, the function /( , u): V —> A is a linear functional. 

Proposition 9.74. Let (V, /) be an inner product space, and let e\,..., e n be a basis of 
V. Then f is nondegenerate if and only if fi , e\), ..., /( , e n ) is a basis of the dual 
space V* (we call the latter the dual basis). 

Proof. Assume that / is nondegenerate. Since dim(V*) = n, it suffices to prove linear 
independence. If there are scalars c\,... ,c n with Cjf( , e,) = 0, then 

T: ci f(v, ei) — 0 for all v e V. 

i 

If we define u = ff , Cjei , then /( v, u) = 0 for all v, so that nondegeneracy gives u — 0. 
But e\,... ,e n is a linearly independent list, so that all c,- = 0; hence, /( , e\), ..., 

f( , e„) is also linearly independent, and hence it is a basis of V*. 

Conversely, assume the given linear functionals are a basis of V*. If f(v, u) — 0 for 
all v e V, where u — JT c/e,-, then ff , ctf( , e;) = 0. Since these linear functionals are 
linearly independent, all Cj — 0, and so u — 0; that is, / is nondegenerate. • 


Corollary 9.75. If (V, f) is an inner product space with f nondegenerate, then every 
linear functional g e V* has the form 


g = f( , u) 


for a unique u e V. 
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Proof. Let e\,e n be a basis of V , and let /( , e \),...,/( , e n ) be its dual basis. 
Since g e V *, there are scalars c, with g — c,/( , e,-). If we define u = Cj ei , then 

g(v) = f(v, u). 

To prove uniqueness, suppose that /( ,u) = f( , u'). Then f(v, u — if) — 0 for all 
v e V , and so nondegeneracy of / gives u — if — 0. • 

Corollary 9.76. Let (V, /) be an inner product space with f nondegenerate. If e\, .... e n 
is a basis ofV, then there exists a basis b\, ... ,b n of V with 

f (ei, bj) = 8jj. 

Proof. Since / is nondegenerate, the function V -> V *, given by v i-»- /( , v), is an 
isomorphism. It follows that the following diagram commutes: 

V x V — — >-k, 
v 

y 

Vxf 

where ev is evaluation (x,g) i-> g(x ) and <p: (x,y) i->- (x,/( , y)). For each i, let 
gi e V* be the ;th coordinate function: If v e V and v = j Cjej, then g,(v) = Ci. By 
Corollary 9.75, there are b \,..., b n e V with g ,• = /( , bj) for all i. Commutativity of the 
diagram gives 

/ (ei, bj) = ev(e,,gj) = S , 7 . • 

Proposition 9.77. Let (V, f) be an inner product space, and let W be a subspace ofV. 
If f\(W x W) is nondegenerate, then 

V = W © w- 1 . 

Remark. We do not assume that / itself is nondegenerate; even if we did, it would not 
force f\(W x W) to be nondegenerate, as we have seen in Example 9.73. ◄ 

Proof. If u e W fl W ■*“, then f(vu, u) = 0 for all w e W. Since f\(W x W ) is nonde¬ 
generate and u e W, we have u = 0; hence, W fl W 1 - — {0}. If v e V, then /( , v)\W 
is a linear functional on W ; that is, /( , v)\W e W*. By Corollary 9.75, there is wq e W 
with f(w, v) = f(w, wq) for all w e W. Hence, v = wq + (v — wo), where wq e W and 
v — wq e W^~. • 

There is a name for direct sum decompositions as in the proposition. 

Definition. If ( V, /) is an inner product space, then we say that a direct sum 

V = Wl © • • • © W r 

is an orthogonal direct sum if, for all i f j , we have /(«),, wj) = 0 for all «j, e VV, and 
wj e Wj. 
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We are now going to look more carefully at special bilinear forms; first we examine 
alternating forms, then symmetric ones. 

We begin by constructing all alternating bilinear forms / on a two-dimensional vector 
space V over a field k. As always, / = 0 is an example. Otherwise, there exist two vectors 
e\, e-± e V with f(e i, ez) f 0; say, f(e i, e 2 ) = c. If we replace e\ by e\ — c _1 e i, then 
f (e\ , ej) = 1. Since / is alternating, the inner product matrix A of / relative to the basis 

, . , TO ll 

e P e 2is A= _j Q . 

Definition. A hyperbolic plane over a field k is a two-dimensional vector space over k 
equipped with a nonzero alternating bilinear form. 

We have just seen that every two-dimensional alternating space (V, /) in which / is 
not identically zero has an inner product matrix A = [ _j q]. 

Theorem 9.78. Let (V, /) be an alternating space, where V is a vector space over a 
field k. If f is nondegenerate, then there is an orthogonal direct sum 

V — H\ © • • • © H m , 

where each Hj is a hyperbolic plane. 

Proof. The proof is by induction on dim( V) > 1. For the base step, note that dim(V) > 2, 
because an alternating form on a one-dimensional space must be 0, hence degenerate. If 
dim(V) = 2, then we saw that V is a hyperbolic plane. For the inductive step, note that 
there are vectors ei,e 2 € V with f(e 1,^2) 7^ 0 (because / is nondegenerate, hence, 
nonzero), and we may normalize so that f(e 1, ^2) = 1: if f{e 1, ^2) = d, replace e 2 by 
d~ x e 2 - The subspace H\ — {e\ , ef) is a hyperbolic plane, and the restriction f\ (H\ x H\) 
is nondegenerate. Thus, Proposition 9.77 gives V — H\ ® . Since the restriction of / to 

H^~ is nondegenerate, by Exercise 9.56 on page 713, the inductive hypothesis applies. • 

Corollary 9.79. Let (V, /) be an alternating space, where V is a vector space over a 
field k. If f is nondegenerate, then dim( V) is even. 

Proof. By the theorem, V is a direct sum of two-dimensional subspaces. • 

Definition. Let (V, f) be an alternating space in which / is nondegenerate. A symplec- 
tic 13 basis is a basis x\,y\, ..., x m , y m such that f(xj , y,-) = 1, x,-) = — 1 for all i, 

and all other f{xi,xj), /(>•,-, yj), /(x,-, }'/), and f(yj,Xi ) are 0. 


1 'The term symplectic was coined by H. Weyl. On page 165 of his book. The Classical Groups: Their Invari- 
ants and Representations, he wrote, “The name ‘complex group’ formerly advocated by me in allusion to line 
complexes, as these are defined by the vanishing of antisymmetric bilinear forms, has become more and more 
embarrassing through collision with the word ‘complex’ in the connotation of complex number. I therefore pro¬ 
pose to replace it by the corresponding Greek adjective ‘symplectic.’ Dickson calls the group the ‘Abelian linear 
group’ in homage to Abel who first studied it.” 
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Corollary 9.80. Let (V, /) be an alternating space in which f is nondegenerate , 14 and 
let A be an inner product matrix for f (relative to some basis ofV). 


(i) There exists a symplectic basis x \, yi,..., x „,, y m for V, and A is a 2m x 2m matrix 
for some m > 1. 

‘ 0 f 


(ii) A is congruent to a matrix direct sum of blocks of the form 


-1 0 


and the latter 


is congruent to 


0 I 
-I 0 


where I is the m x m identity matrix. 


(iii) Every nonsingular skew-symmetric matrix A over a field k is congruent to a direct 


sum of 2 x 2 blocks 


0 

-1 


1 

0 


Proof (i) A symplectic basis exists, by Theorem 9.78, and so V is even-dimensional. 

(ii) An inner product matrix A is congruent to the inner product matrix relative to a sym¬ 
plectic basis arising from a symplectic basis x\ . y \...., x m , y m . The second inner product 
matrix arises from a reordered symplectic basis x\,... ,x m ,y\,..., y m . 

(iii) A routine calculation. • 

We now consider symmetric bilinear forms. 

Definition. Let (V, /) be a symmetric space, and let £ = e\, ... , e n be a basis of V. 
Then E is an orthogonal basis if /(e,-, ej) — 0 for all i f j, and E is an orthonormal 
basis if /(e,-, ej) — <5, /, where Sjj is the Kronecker delta. 

If e\,..., e n is an orthogonal basis of a symmetric space (V, /), then V — (<?i) © • • • © 
(e„) is an orthogonal direct sum. In Corollary 9.76, we saw that if (V, /) is a symmetric 
space with / nondegenerate, and if e \,..., e n is a basis of V, then there exists a basis 
b\, ..., b n of V with f(et, b ,) = <5, ;. If E is an orthonormal basis, then we can set bj — e; 
for all i. 


Theorem 9.81. Let (V, /) be a symmetric space, where V is a vector space over afield k 
of characteristic not 2. 

(i) V has an orthogonal basis, and so every symmetric matrix A with entries in k is 
congruent to a diagonal matrix. 

(ii) If C — diag[c^(ii,..., c„d n ], then C is congruent to D — diag[c/i, ..., d„], 

(iii) If f is nondegenerate and if every element in k has a square root in k, then V has an 
orthonormal basis. Every symmetric matrix A with entries in k is congruent to I. 


I4 [f the form / is degenerate, then A is congruent to a direct sum of 2 x 2 blocks [ q and a block of 0’s. 
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Proof, (i) If / = 0, then every basis is an orthogonal basis. We may now assume that / ^ 
0. By Exercise 9.54 on page 713, which applies because k does not have characteristic 2, 
there is some v e V with f(v, r) /0 (otherwise, / is both symmetric and alternating). If 
W = (v), then f\(W x W) is nondegenerate, so that Proposition 9.77 gives V — W © W^. 
The proof is now completed by induction on dim(W). 

If A is an n x n symmetric matrix, then Proposition 9.70(i) shows that there is a 
symmetric bilinear form / and a basis U — it \ . u n so that A is the inner prod¬ 

uct matrix of / relative to U. We have just seen that there exists an orthogonal basis 
v \,..., v n , so that Proposition 9.70(iii) shows that A is congruent to the diagonal matrix 
diag[/( dj, Vi),..., / (v n , v n )]. 

(ii) If an orthogonal basis consists of vectors i >,• with f(vi, Vj) = cjdi, then replacing each 
vj by v'j = c~ l Vj gives an orthogonal basis with f(vL v'A — dj. It follows that the inner 
product matrix of / relative to the basis v[,..., v' n is D — diag[t/i,..., d n \. 

(iii) By part (i), there exists an orthogonal basis ui,..., v „; let /’(t;,. v,) = c, for each i. 
Since / is nondegenerate, c, f 0 for all i (the determinant of the inner product matrix 
relative to this orthogonal basis is C 1 C 2 • • • c n )\ since each c; is a square, by hypothesis, we 
may replace each u,- by u- = (^/c7) _1 u ( -, as in part (ii); this new basis is orthonormal. The 
final statement follows because the inner product matrix relative to an orthonormal basis is 
the identity I. • 

Notice that Theorem 9.81 does not say that any two diagonal matrices over a field k 
of characteristic not 2 are congruent; this depends on k. For example, if k = C, then all 
(nonsingular) diagonal matrices are congruent to I , but we now show that this is false if 
k = R. 

Definition. A symmetric bilinear form / on a vector space V over R is positive definite 
if f(v, v) > 0 for all nonzero v e V. while / is negative definite if /( v, v) < 0 for all 
nonzero v e V . 

The next result, and its matrix corollary, was proved by 1.1. Sylvester. When n — 2, it 
classifies the conic sections, and when n — 3, it classifies the quadric surfaces. 

Lemma 9.82. If f is a symmetric bilinear form on a vector space V over R of dimension 
m, then there is an orthogonal direct sum 

V = W+ © W- © Wo, 

where /|W + is positive definite, /|W_ is negative definite, and /|Wo is identically 0. 
Moreover, the dimensions of these three subspaces are uniquely determined by f. 

Proof. By Theorem 9.81, there is an orthogonal basis v \,..., v m of V. Denote /(t>;, v/) 
by di. As any real number, each dj is either positive, negative, or 0, and we rearrange the 
basis vectors so that v\,... ,v p have postive di, v p+ \,..., v p + r have negative di, and the 
last vectors have di = 0. It follows easily that V is the orthogonal direct sum 

V = |ui,...,UpJ® [vp+ 1, - . • , Vp+f^ ® (up-|-/--t-l, • . . , V m J , 
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and that the restrictions of / to each summand are positive definite, negative definite, and 
zero. 

Now Wo = V 1 - depends only on /, and hence its dimension depends only on / as well. 
To prove uniqueness of the other two dimensions, suppose that there is a second orthogonal 
direct sum V — W+©W(_©Wo. IfT: V -> W+ is the projection, then ker T — W_©Wo. 
It follows that if ip = T \ W' + , then 

ker ^9 = W' + n ker T = W' + n (W_ © W 0 ). 

However, if v e W' + , then /( v, v) > 0, while if v € W_ © Wo, then f(v, v) < 0; 
hence, if v e ker <p, then f(v, v ) = 0. But f\W' + is positive definite, for this is one of 
the defining properties of W' + , so that /(v, v) = 0 implies v — 0. We conclude that 
ker <p = {0}, and so <p: W r + —> W+ is an injection; therefore, dim(W+) < dim(W+). The 
reverse inequality is proved similarly, so that dim(W^_) = dim(W+). Finally, the formula 
dim(W_) = dim(V) — dim(W+) — dim(Wo), and its primed version, give dim(Wi) = 
dim(W_). • 

Theorem 9.83 (Law of Inertia). Every symmetric n x n matrix A over R is congruent 
to a matrix of the form 

~l P 0 0" 

0 —l r 0 . 

0 0 0 

Moreover, the signature s of f, defined by s — p — r is well-defined, and two n x n 
symmetric real matrices are congruent if and only if they have the same rank and the same 
signature. 

Proof. By Theorem 9.81, A is congruent to a diagonal matrix diag[c/|,.... d n ], where 
d\,.... dp are positive, d p +\,..., d p + r are negative, and d p + r + 1 ,..., d n are 0. But every 
positive real is a square, while every negative real is the negative of a square; it now follows 
from Theorem 9.81 (ii) that A is congruent to a matrix as in the statement of the theorem. 

It is clear that congruent n x n matrices have the same rank and the same signature. 
Conversely, let A and A! have the same rank and the same signature. Now A is congruent 
to the matrix direct sum I p © — I, © 0 and A' is congruent to I p i © —I r r © 0. Since 
rank(A) = rank(A'), we have p' + r' = p + r; since the signatures are the same, we have 
p’ — r' = p — r. It follows that p' — p and r = r, so that both A and A' are congruent 
to the same diagonal matrix of l’s, — l’s, and 0’s, and hence they are congruent to each 
other. • 

It would be simplest if a symmetric space (V, f) with / nondegenerate always had 
an orthonormal basis; that is, if every symmmetric matrix were congruent to the identity 
matrix. This need not be so, for the 2x2 real matrix —I is not congruent to I because 
their signatures are different (/ has signature 2 and — I has signature —2). 

Closely related to bilinear forms are quadratic forms', they arise from a bilinear form / 
defined on a vector space V over a field k by considering the function Q : V —> k given by 
Q(v) = /( v, v). 
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Definition. Let V be an a vector space over a field k. A quadratic form is a function 
Q : V -» k such that 

(i) Q(cv ) = c 2 Q(v) for all v e V and c e k\ 

(ii) the function g: V x V —> k, defined by 

g(u, v) = Q(u + v) - Q(u) - Q(v), 


is a bilinear form. 


Example 9.84. 

(i) If / is a bilinear form on a vector space V, define Q(v) = f(v, v) for all v e V; we 
show that Q is a quadratic form. Now Q(cv) — f(cv, cv) = c 2 f(v, v) — c 2 Q(v ), giving 
the first axiom in the definition. If u, v e V, then 

Q(u + v) = f(u + v, u + v) 

= f(u , u ) + f(u, v ) + f(v, u) + f(v, v ) 

= Q(u) + Q(v) + g(u, v), 


where 

g(u,v) = f(u,v) + f(v,u). 

It is easy to check that g is a bilinear form that is symmetric. 

(ii) We have just seen that every bilinear form / determines a quadratic form Q. If f is 
symmetric and k does not have characteristic 2, then Q determines /. In fact, the formula 
2 f(u, v) — g(u, v) gives f(u, v) — ^g(u , v) in this case. 

(iii) If / is the usual dot product defined on R", then the corresponding quadratic form is 
Q(v ) = ||u|| 2 , where ||i>|| is the length of the vector v. 

(iv) If / is a bilinear form on a vector space V with inner product matrix A = [a, ; ] relative 

to some basis e\,... ,e n , then if u — c,e/, 


Q( u ) = ^aijdCj. 

ij 

If n = 2, for example, we have 

Q(u) — anc 2 + (a 12 + a 2 \)c\C 2 + a 22 c\. 

Thus, quadratic forms are really homogeneous quadratic polynomials. < 

We have just observed, in the last example, that if a field k does not have characteris¬ 
tic 2, then symmetric bilinear forms and quadratic forms are merely two different ways of 
viewing the same thing, for each determines the other. 



706 


Advanced Linear Algebra Ch. 9 


We have classified quadratic forms Q over C and over R. The classification over the 
prime fields is also known, as is the classification over the finite fields, and we now state 
(without proof) the results when Q is nondegenerate. Given a quadratic form Q defined on 
a finite-dimensional vector space V over a field k, its associated bilinear form is 

fix, y ) = Q(x + y) - Q(x ) - Q(y). 

Call two quadratic forms equivalent if their associated bilinear forms have congruent inner 
product matrices, and call a quadratic form nondegenerate if its bilinear form is non- 
degerate. As we have just seen in Example 9.84, / is a symmetric bilinear form (which 
is uniquely determined by Q when k does not have characteristic 2). If k is a finite field 
of odd characteristic, then two nondegenerate quadratic forms over k are equivalent if and 
only if they have the same discriminant (see Kaplansky, Linear Algebra and Geometry; A 
Second Course , pp. 14-15). If k is a finite field of characteristic 2, the theory is a bit more 
complicated. In this case, the associated symmetric bilinear form 

g(x, y) = Q(x + y) + Q(x) + Q(y) 

must also be alternating, for g(x, x) = Q(2x)+2Q(x) = 0. Therefore, V has a symplectic 
basis x \, y \,..., x m , y m . The Arf invariant of Q is defined by 

m 

Arf XQ) = Q(xi)Q(yi) 

i =1 

[it is not at all obvious that the Arf invariant is an invariant; i.e., that Art) Q) does not 
depend on the choice of symplectic basis; see R. L. Dye, “On the Arf Invariant,” Journal 
of Algebra 53 (1978), pp. 36-39, for an elegant proof]. If k is a finite field of characteristic 
2, then two nondegenerate quadratic forms over k are equivalent if and only if they have 
the same discriminant and the same Arf invariant (see Kaplansky, Linear Algebra and 
Geometry; A Second Course , pp. 27-33). The classification of quadratic forms over Q is 
much deeper (see Borevich and Shafarevich, Number Theory , pp. 61-70). Just as R can 
be obtained from Q by completing with respect to the usual metric d(a, b) — \a — b\ (that 
is, by adding points to force Cauchy sequences to converge), so, too, can we complete Z, 
for every prime p , with respect to the p-adic metric (see the discussion on page 503). The 
completion 7L p is called the p-adic integers. The p-adic metric on Z can be extended to 
Q, and its completion Q p [which turns out to be Frac(Z / ,)] is called the p-adic numbers. 
The Hasse-Minkowski theorem says that two quadratic forms over Q are equivalent if and 
only if they are equivalent over R and over Q p for all primes p. 

The first theorems of linear algebra consider the structure of vector spaces in order to 
pave the way for a discussion of linear transformations. Similarly, the first theorems of 
inner product spaces enable us to discuss the appropriate linear transformations. 

Definition. If (V, f) is an inner product space, where V is a finite-dimensional vector 
space over a field k and / is a nondegenerate bilinear form, then an isometry is a linear 
transformation (p: V —> V such that, for all u, v e V, 


f(u, v ) = f((pu , (pv ). 
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Proposition 9.85. Let (V, /) be an inner product space, where f is a nondegenerate 
bilinear form, let E = e\, ..., e n be a basis ofV, and let A be the inner product matrix 
relative to E. Then cp e GL(V) is an isometry if and only if its matrix M — e[<P]e satisfies 
the equation M' AM = A. 

Proof. Recall the equation 

f(b, c) — B r AC, 

where b, c e V and B. C e k" are their coordinate vectors relative to the basis E. In this 
notation, E\,..., E n is the standard basis of k". Now 

<P(e ,0 = MEj 

for all i, because MEj is the /th column of M that is the coordinate vector of <p(ej). There¬ 
fore, 

f{<pei,<pej) = (M Ej) 1 A(M E j) = E\{M t AM)E j . 

If <p is an isometry, then 

f((pei,<pej ) = f(ej,ej ) = E\AEj, 

so that f{ej, ej) — E'AEj — E'fiM’ AM)Ej. Hence, Proposition 9.70(h) gives M' AM = 
A. 

Conversely, if M' AM — A, then 

f(<pej,<pej ) = E\{M'AM)Ej = E^AEj - fiej.ej), 
and tp is an isometry. • 

Proposition 9.86. Let (V, f) be an inner product space, where V is a finite-dimensional 
vector space over afield k and f is a nondegenerate bilinear form. Then Isom(V, /), the 
set of all isometries of (V, f), is a subgroup of GL(V). 

Proof. We prove that Isom(V, /) is a subgroup; of course, ly is an isometry. Let 
tp: V V be an isometry. If it <= V and <pu — 0, then, for all v e V, we have 

0 = f(<pu, tpv) = f(u, v ). 

Since / is nondegenerate, u = 0 and is an injection. Hence, dim(im <p) — dim(V), so 
that im tp = V, by Corollary 3.90(ii). Therefore, every isometry is nonsingular. 

The inverse of an isometry tp is also an isometry: For all it, v e V, 

f(cp~ l u, <p~ l v ) = f {ypip — 1 u , (pip~ l v) 

= f(u, v). 

Finally, the composite of two isometries <p and 6 is also an isometry: 

f(u,v) = f(<pu,cpv) = f(O(pu,0(pv). • 

Computing the inverse of a general nonsingular matrix is quite time-consuming, but it 
is easier for isometries. 
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Definition. Let (V, /) be an inner product space whose bilinear form / is nondegenerate. 
The adjoint of a linear transformation T: V -» V is a linear transformation T*: V -* V 
such that, for all u, v £ V, 

f{Tu, v) = f(u, T*v). 

Let us see that adjoints exist. 

Proposition 9.87. If (V, /) is an inner product space whose bilinear form f is nonde¬ 
generate, then every linear transformation T : V —> V has an adjoint. 

Proof Let e\ ,..., e n be a basis of V. For each j, the function <pj : V —r k, defined by 

<Pj(v) = f(Tv,ej), 

is easily seen to be a linear functional. By Corollary 9.75, there exists u: £ V with 
q>j(v) = f(v, Uj) for all v £ V. Define T*: V -> V by T*(ej) — u j, and note that 

f(Tei,ej) = (pjiei) = /(e,-,n ; ) = f{e t , T*ef). • 

Proposition 9.88. Let (V, f) be an inner product space whose bilinear form f is nonde¬ 
generate. IfT: V V is a linear transformation with adjoint T*, then T is an isometry 
if and only ifT*T — 1 y, in which case T* = T~ l . 

Proof. If T*T = 1 y , then, for all u, v £ V, we have 

f(Tu, Tv) — f (u, T*Tv) — f (u, v), 


so that T is an isometry. 

Conversely, assume that T is an isometry. Choose v £ V ; for all u £ V, we have 

f (u , T*Tv -v) = f (u , T*Tv) - f(u, v ) 

= f(Tu , Tv) - f(u , v) 

= 0 . 

Since / is nondegenerate, T*Tv — v — 0; that is, T* Tv = v. As this is true for all v £ V, 
we have T*T — 1 y. • 


Definition. Let (V, f) be an inner product space, where V is a finite-dimensional vector 
space over a field k and / is a nondegenerate bilinear form. 

(i) If / is alternating, then Isom( V, /) is called the symplectic group , and it is denoted 
by Sp(V, /). 

(ii) If f is symmetric, then Isom( V, /) is called the orthogonal group , and it is denoted 
by 0(V, /). 
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As always, a choice of basis E of an n -dimensional vector space V over a field k gives 
an isomorphism p.: GL(V) —> GL(n, k), the group of all nonsingular n x n matrices 
over k. In particular, let (V, /) be an alternating space with / nondegenerate, and let 
E — xi,yi,, x m , y m be a symplectic basis of V (which exists, by Corollary 9.80); 
recall that n = dim(V) is even; say, n — 2m. Denote the image of Sp(V, /) by Sp(2m, k). 
Similarly, if (V, /) is a symmetric space with / nondegenerate, and E is an orthogonal 
basis (which exists when k does not have characteristic 2, by Theorem 9.81), denote the 
image of OiV , /) by 0(n, /). 

Let us find adjoints when the bilinear form is symmetric or alternating. 

Proposition 9.89. Let (V, f) be a symmetric space, where V is an n -dimensional vector 
space over a field k and f is nondegenerate, and let E — e\, ... ,e n be an orthogonal 
basis with f(ej, e, ) = c,-. 

If B — [bij] is a matrix relative to E, then its adjoint B* is its “weighted” transpose 
[c~ l cjbji]. In particular, if E is an orthonormal basis, then B* — B', the transpose of B. 

Remark. It follows that B is orthogonal if and only if B' B = I. -4 


Proof. We have 

fiBei.ej ) = fishnet, 

t 

= ^2hif(ee,ej) 

i 

= bjiCj. 

If B* — [/?*.], then a similar calculation gives 


fiet, B*ef) = ^ ~^b* tj f(ei,ei ) = Cjbfj. 

Since f(Bej, ej ) = /(e,-, B*ej ), we have 

bjiCj = Cib*j 

for all i, j. Since / is nondegenerate, all c, f 0, and so 

b*j= c 7 l cjbn. 

The last remark follows, for if E is an orthonormal basis, then c, = 1 for all i. • 


How can we recognize a symplectic matrix? 
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Proposition 9.90. Let (V, /) be an alternating space, where V is a 2m-dimensional 
vector space over afield k and f is nondegenerate, and let E be a symplectic basis ordered 


asx i,...,x m ,yi - ,y m . 

The adjoint of a matrix B 


P 

S 


Q 

T 


relative to E, partitioned into m x m blocks, is 


B* = 



-Q' 

P' 


Remark. It follows that B e Sp(2 m,k) if and only if B*B — I. a 


Proof. We have 


f{Bxi,Xj ) = f(^2paxt + s u y i ,xj'j 

t 

= ^Pei fixe, xj ) + ^s ti f(ye,Xj) 


because f(x(, xj) — 0 and f(yt, xj) — —Sy for all i, j. Let us partition the adjoint B* 
into m x m blocks: 


Hence, 

f(xi, B*xj ) = f(xi,'y]nijxe + a^yf) 

t 

l t 

— a ij > 

because f(xi,xf) — 0 and f(xi,yf) — 8u. Since f(Bxi,Xj ) = f(Xi, B*xy), we have 
<7jj — —Sji. Hence, E = —S'. Computation of the other blocks of B* is similar. • 

The next question is whether Tsom( V. f) depends on the choice of nondegenerate 
alternating bilinear form /. Observe that GL(V) acts on k VxV , the set of all functions 

V x V —»■ k: If /: V xf->l and <p e GL(V), then define tpf = f v , where 

f v (b,c) = f ((p~ l * * * V b, tp~ l c). 

This formula does yield an action: If 0 e GL( V), then (<p9)f — f v9 , where 

(cp0)f(b,c) = r e (b,c) 

= f((cpd)- l b, ( cp9r l c) 

= f(0~ l (p~ l b,e~ l (p^c). 
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On the other hand, ip(9f ) is defined by 

(f e r(b, C ) = f{<p- i b,<p- x c) 

so that (<p6)f = (P(9f). 

Definition. Let V and W be finite-dimensional vector spaces over a field k. and let 
/: V x V —*■ k and g: W x W —» k he bilinear forms. Then / and g are equivalent 
if there is an isometry qr. V -> W. 

Proposition 9.91. If V is a finite-dimensional vector space over a field k and if 
f,g'. V x V -> k are bilinear forms, then the following statements are equivalent. 

(i) / and g are equivalent. 

(ii) If E — e\, .... e n is a basis ofV, then the inner product matrices of f and g with 
respect to E are congruent. 

(iii) There is q> e GL(V) with g — f v . 

Proof, (i) => (ii) If q>\ V -> V is an isometry, then gfpib), q>(c)) = f(b,c) for all 
b, c e V. If £ = e\, ..., e n is a basis of V, then E' = <p(e i),..., q>ie n ) is also a basis, 
because is an isomorphism. Hence, A' = [gispiei), (piej))~\ = [f (e/, ej)] = A for all 
i, j\ that is, the inner product matrix A' of g with respect to E' is equal to the inner product 
matrix A of / with respect to E. By Proposition 9.70(iii), the inner product matrix A" of 
g with respect to E is congruent to A. 

(ii) => (iii) If A = [ f(ei,ej )] and A' — [g(ei, e ; )], then there exists a nonsingular 
matrix Q = [qij] with A' = Q'AQ. Define 0: V —> V to he the linear transformation 
with 0(ej) = Y. v 9vjey Finally, g = f e ~'\ 

[. g(ei,ej)] = A! = Q'AQ = [f(^q V ie v , ^qxjex)] 

v X 

= [f(Q(e i ),0(e j ))] = [f e ~\e i ,e j )l 

(iii) => (i) It is obvious from the definition that (p ~ l : (V, g) -> (V, /) is an isometry: 

gib, c) = f v {b, c) = f(cp~'b, (p~ l c ). 

Therefore, g is equivalent to /. • 

Proposition 9.92. 

(i) Let (V, /) be an inner product space, where V is a finite-dimensional vector space 
over a field k and f is a nondegenerate bilinear form. The stabilizer GL(V) f of f 
under the action on k VxV is Isom(V, /). 

(ii) If g". V x V —> k lies in the same orbit as f, then Isom(V, /) and Isom(V, g) are 
isomorphic, in fact, they are conjugate subgroups of GL(V). 
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Proof, (i) By definition of stabilizer, tp e GL( V ) f if and only if f v = /; that is, for all 
b, c e V, we have f(cp~ l b, (p~ l c ) = f(b, c ). Thus, (p~ l , and hence <p, is an isometry. 

(ii) By Exercise 2.99 on page 114, we have GL(V) g = r(GL (V)f)r~^ for some r e 
GL(V); that is, Isom(V, g) = rIsom(V, f)r~ l . • 

It follows from Proposition 9.92 that equivalent bilinear forms have isomorphic iso¬ 
metry groups. We can now show that the symplectic group is, to isomorphism, independent 
of the choice of nondegenerate alternating form. 

Theorem 9.93. If (V, f) and (V, g) are alternating spaces, where f and g are nonde¬ 
generate, then f and g are equivalent and 

Sp(V, /) = Sp(V, g). 

Proof By Corollary 9.80(ii), the inner product matrix of any nondegenerate alternating 
bilinear form is congruent to [ _ r q], where I is the identity matrix. The result now follows 
from Proposition 9.91. • 

Symplectic groups give rise to simple groups. If k is a field, define PSp(2 m,k) = 
Sp(2m, k)/Z(2m, k), where Z(2m, k) is the subgroup of all scalar matrices in Sp(2m, k). 
The groups PSp(2m, k) are simple for all m > 1 and all fields k with only three exceptions: 
PSp(2, F 2 ) = S 3 , PSp(2, F 3 ) = A 4 , and PSp(4, F 2 ) = S 6 . 

The orthogonal groups, that is, isometry groups of a symmetric space (V, f ) when / is 
nondegenerate, also give rise to simple groups. In contrast to symplectic groups, however, 
they depend on properties of the field k. We restrict our attention to finite fields k. The 
cases when k has odd characteristic and when k has characteristic 2 must be considered 
separately, and we must further consider the subcases when dim( V ) is odd or even. When 
k has odd characteristic p, there is only one orthogonal group 0(n , p m ) 15 when n is odd, 
but there are two, 0 + (n, and 0~(n, p m ), when n is even. The simple groups are 
defined from these groups as follows: First form SO € (n, //") (where e = + or e — —) 
as all orthogonal matrices having determinant 1 ; next, form PSO € (n, p m ) by dividing 
by all scalar matrices in SO € (n , p m ). Finally, we can define a subgroup (n , p m ) of 
PSO € (n , p m ) (essentially the commutator subgroup), and these groups are simple with 
only a finite number of exceptions (which can be explicitly listed). 

When k has characteristic 2, we usually begin with a quadratic form rather than a sym¬ 
metric bilinear form. In this case, there is also only one orthogonal group 0(n, 2 m ) when 
n is odd, but there are two, which are also denoted by 0 + (n, 2 m ) and 0~(n, 2 " ! ), when n 
is even. If n is odd, say, n — 21+ 1, then 0(21 + 1, 2'") = Sp(2f, 2" ! ), so that we consider 
only orthogonal groups 0 € (2i,2' n ) arising from symmetric spaces of even dimension. 
Each of these groups gives rise to a simple group in a manner analogous to the odd char¬ 
acteristic case. For more details, we refer the reader to the books of E. Artin, Geometric 
Algebra ; Conway et al. Atlas of Finite Groups', J; Dieudonne, La Geometrie des groupes 

15 When k is a finite field, say, k = Fq for some prime power q, we often denote GL(n, k) by GL(n, q). A 
similar notational change is used for any of the matrix groups arising from GL(«, k). 
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classiques ; M. Suzuki, Group Theory I; and the article by Carter in Kostrikin-Shafarevich, 
Algebra IX. 

Quadratic forms are of great importance in number theory. For an introduction to this 
aspect of the subject, see Hahn, Quadratic Algebras, Clifford Algebras, and Arithmetic 
Witt Groups', Lam, The Algebraic Theory of Quadratic Forms', and O’Meara, Introduction 
to Quadratic Forms. 

Exercises 

9.51 It is shown in analytic geometry that if l\ and O are lines with slopes m\ and m 2 , respectively, 
then l j and I 2 are perpendicular if and only if mi m 2 = — 1. If 

li — {an,- + itj : a e R), 

for i = 1,2, prove that mi in 2 = — 1 if and only if the dot product v\ ■ V 2 = 0. (Since both 
lines have slopes, neither of them is vertical.) 

Hint. The slope of a vector v = (a, b) is m = b/a. 

9.52 (i) In calculus, a line in space passing through a point u is defined as 

{u + aw : a e R) C R 3 , 

where w is a fixed nonzero vector. Show that every line through the origin is a one¬ 
dimensional subspace of R 3 . 

(ii) In calculus, a plane in space passing through a point u is defined as the subset 

{i> e R 3 : (v — u ) • n — 0} C R 3 , 

where n 7 ^ 0 is a fixed normal vector. Prove that a plane through the origin is a two- 
dimensional subspace of R 3 . 

Hint. To determine the dimension of a plane through the origin, find an orthogonal 
basis of R 3 containing n. 

9.53 If k is a field of characteristic not 2, prove that for every n x n matrix A with entries in k, 
there are unique matrices B and C with B symmetric, C skew-symmetric (i.e., C r = — C), 
and A = B + C. 

9.54 Let (P, /) be an inner product space, where P is a vector space over a field k of characteristic 
not 2. Prove that if / is both symmetric and alternating, then / = 0. 

9.55 If (P, /) is an inner product space, define u _L v to mean f(u, r) = 0. Prove that ± is a 
symmetric relation if and only if / is either symmetric or alternating. 

9.56 Let (V, /) be an inner product space with / nondegenerate. If W is a proper subspace and 
P = W © IP 3- , prove that f\(W x W "*“) is nondegenerate. 

9.57 (i) Let (P, /) be an inner product space, where V is a vector space over a field k of char¬ 

acteristic not 2. Prove that if / is symmetric, then there is a basis e\ . e n of V and 

scalars c\, ..., c n such that f(x, y) = c i x iyi> where x — f^Xjej and y — y/e,\ 

Moreover, if / is nondegenerate and k has square roots, then the basis e\,... ,e n can 
be chosen so that fix, y) = JT .Xfy,-. 

(ii) If k is a field of characteristic not 2, then every symmetric matrix A with entries in k is 
congruent to a diagonal matrix. Moreover, if A is nonsingular and k has square roots, 
then A = P 1 P for some nonsingular matrix P. 



714 


Advanced Linear Algebra Ch. 9 


9.58 Give an example of two real symmetric m x m matrices having the same rank and the same 
discriminant but that are not congruent. 


9.59 For every field k, prove that Sp(2, k) = SL(2, k). 

Hint. By Corollary 9.80(h), we know that if P e Sp(2m, k), then det(P) = ±1. However, 
Proposition 9.89 shows that det(P) = 1 for P e Sp(2, k) [it is true, for all m > 1, that 
Sp(2w, k) < SL(2 m,k)]. 


9.60 


If A is an m x m matrix with A 1 A = /, prove that 


A 

0 


0 

A 


is a symplectic matrix. Conclude, 


if k is a finite field of odd characteristic, that 0{m, k) < Sp(2m, k). 


9.61 Let (V, f ) be an alternating space with f nondegenerate. Prove that T e GL(V) is an isome¬ 
try [i.e., T e Sp( V , /)] if and only if, whenever E = x \, y \, ..., x m , y m is a symplectic basis 
of V , then T(E) = Tx\, Ty \,..., Tx m , Ty m is also a symplectic basis of V. 


9.6 Graded Algebras 

We are now going to use tensor products of many modules in order to construct some 
useful rings. This topic is often called multilinear algebra. 

Throughout this section, R will denote a commutative ring. 

Definition. An //-algebra A is a graded R-algebra if there are /^-submodules A p , for 
p > 0, such that 

® A = Z P > 0 AP; 

(ii) For all p , q > 0, if x e A p and y e A q , then xy € A p+q ; that is, 

A p A q C A p+q . 

An element x e A p is called homogeneous of degree p. 

Notice that 0 is homogeneous of any degree, but that most elements in a graded ring are 
not homogeneous and, hence, have no degree. Note also that any product of homogeneous 
elements is itself homogeneous. 

Example 9.94. 

(i) The polynomial ring A = R[x] is a graded //-algebra if we define 

A p = {rx p : r e R}. 

The homogeneous elements are the monomials and, in contrast to ordinary usage, only 
monomials (including 0) have degrees. On the other hand, x p has degree p in both usages 
of the term degree. 

(ii) The polynomial ring A = R [vi, xj,... , x n ] is a graded //-algebra if we define 

A p = [rxI'x ^ 2 ■ ■ ■xf l n : r e R and y^e,- = /;>}; 
that is, A p consists of all monomials of total degree p. 
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(iii) In algebraic topology, we assign a sequence of (abelian) cohomology groups U P (X. R ) 
to a space X, where R is a commutative ring and p > 0, and we define a multiplication on 
Y2 P >o II p (X, R), called cup product, making it a graded R -algebra. < 

Just as the degree of a polynomial is often useful, so, too, is the degree of a homoge¬ 
neous element in a graded algebra. 

Definition. If A and B are graded /^-algebras, then a graded map 16 is an /^-algebra map 
f: A —> B with f(A p ) C B p for all p > 0. 

It is easy to see that all graded R -algebras and graded maps form a category, which we 
denote by Gr^Alg. 

Definition. If A is a graded /^-algebra, then a graded ideal (or homogeneous ideal ) is a 
two-sided ideal I in A with I = Y2 P >o I p > where I p = I fl A p . 

In contrast to the affine varieties that we have considered in Chapter 6, projective vari¬ 
eties are studied more intensely in algebraic geometry. The algebraic way to study these 
geometric objects involves homogeneous ideals in graded algebras. 

Proposition 9.95. Let A and B be graded R-algebras. 

(i) If f : A —> B is a graded map, then ker / is a graded ideal. 

(ii) If I is a graded ideal in A, then A/1 is a graded R-algebra if we define 

(. A/I) p = (AP + D/I. 

Moreover, A/1 = E,M/I) P = E P A P /(I n A p ) = J2 p (A p /I p ). 

(iii) A two-sided ideal I in A is graded if and only if it is generated by homogeneous 
elements. 

(iv) The identity element 1 in A is homogeneous of degree 0. 

Proof. The proofs of (i) and (ii) are left as (routine) exercises. 

(iii) If I is graded, then I = Y2 p I p , so that I is generated by [_J /; I p . But [J ;; I p 
consists of homogeneous elements because I p = I fl A p C A p for all p. 

Conversely, suppose that I is generated by a set X of homogeneous elements. We must 
show that I — £ (/ fl A p ), and it is only necessary to prove I c ^f p (I fl A p ), for the 
reverse inclusion always holds. Since I is the two-sided ideal generated by X, a typical 
element u e I has the form u — JT a/Xibj, where at, bj e A and Xj e X. Now u = 
u p , where u p e A p , and it suffices to show that each u p lies in I. Indeed, it suffices 
to prove this for a single term a,v,77,, and so we drop the subscript i. Since a = ffaj and 
b = b(, where each a ; - and bi are homogeneous, we have u — ffi lOjxbf, but each 

*®There is a more general definition of a graded map /: A B. Given d 6 Z, then a /.-algebra map / is 
graded map of degree d if f(A p ) C B p+l1 for all p > 0. 
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term in this sum is homogeneous, being the product of the homogeneous elements aj, x, 
and b(. Thus, it p is the sum of those Ujxbt having degree p, and so it p G I. 

(iv) Write 1 = eo + e\ + ■ ■ ■ + e t , where e; G A'. If a p G A p , then 

cip — cq cip — c\Qp + • • • + e,a p G A p n (A p+1 © • • • ® A p+t ) = {0}. 

It follows that a p — eod p for all homogeneous elements a p , and so d = end for all a g A. 
A similar argument, examining d p = d p 1 (instead of ci p = 1 a p ), shows that d — aeo for 
all a G A. Therefore, 1 = eo, by the uniqueness of the identity element in a ring. • 

Example 9.96. 

The quotient /?[x]/(x 13 ) is a graded /^-algebra. However, there is no obvious grading on 
the algebra 7?[x]/(x 13 + 1). After all, what degree should be assigned to the coset of x 13 , 
which is the same as the coset of — 1 ? ◄ 

We now consider generalized associativity of tensor product. 

Definition. Let R be a commutative ring and let M \,..., M p be /^-modules. An R- 
multilinear function f : M\ x • • • x M p —> N, where N is an /^-module, is a function 
that is additive in each of the p variables (when we fix the other p — 1 variables) and if 
1 < i < p, then 


f{m\, rnii, ...,m p ) = rf(m i,- m t , .. ,,m p ), 

where r G R and m f g Mi for all l. 

Proposition 9.97. Let R be a commutative ring and let M \, ..., M p be R-modules. 

(i) There exists an R-module U[M\ . M p ] that is a solution to the universal map¬ 

ping problem posed by multilinearity. 

M\ x • • • x M p ---^ t/[Mi, ..., M p ] 


There is a R-multilinear h such that, if f is R-multilinear, then there exists a unique 
R-homomorphism f making the diagram commute. 

(ii) If f : Mj — > Mj are R-maps, then there is a unique R-map 

u[fx ,••• ,f P ]: U[Mx,...,M p ]^ U[M[,...,M' p ] 

taking h(m i, ... , m p ) h'(f\ (mi), ..., f p (m p )), where h' : M[ x • • • x M' p -» 
U[M[ . M’ p \. 
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Proof, (i) This is a straightforward generalization of Theorem 8.74, the existence of 
tensor products, using multilinear functions instead of bilinear ones. Let F be the free 
//-module with basis M\ x • • • x M p , and let S be the submodule of F generated by all 
elements of the following two types: 

(mi.m; + m'j ,..., m p ) — (mi,..., m;,..., m p ) — (in i,..., mf ..., m p ); 

(mi,..., rnii ,..., m p ) — r(m i,..., m/,..., m p ), 
where m,, m ■ e M,-, re//, and 1 < i < p. 

Define //[Mi, • • • , Mp] = F/5 and define h: M\ x • • • x M p —> U[M \, • • • , M p ] by 
h : (mi, ..., m p ) i—>• (mi,..., m p ) + S. 

The reader should check that h is //-multilinear. The remainder of the proof is merely an 
adaptation of the proof of Proposition 8.74, and it is also left to the reader. 

(ii) The function Mi x • • • x M p -> JJ[M[ . M' p ], given by 

(mi-- m p ) h* //(/i(mj), ..., f„(m p )), 

is easily seen to be //-multilinear, and hence there exists a unique //-homomorphism as 
described in the statement. • 

Observe that there are no parentheses needed in the generator h(m\, ..., m p )\ that is, 
h(m \,..., nip) depends only on the /?-tuple (mi,..., m p ) and not on any association of 
its coordinates. The next proposition relates this construction to iterated tensor products. 
Once this is done, we will change the notation U[M\, .... M p \. 

Proposition 9.98 (Generalized Associativity). Let R be a commutative ring and let 
Mi, ..., M p be R-modules. If M\ M p is an iterated tensor product in some 

association, then there is an R-isomorphism U[M \, ..., M p ] —> M\ - - M p taking 
h(m \, ..., m p ) i —y m\ ® • • • <g> m p . 

Remark. We are tempted to quote Theorem 2.20: Associativity for three factors implies 
associativity for many factors, for we have proved the associative law for three factors in 
Proposition 8.84. However, we did not prove equality, A <8>r (B <S)r C ) = (A B) <S)r C; 
we only constructed an isomorphism. There is an extra condition, due, independently, to 
Mac Lane and Stashefl': If the associative law holds up to isomorphism and if a certain 
“pentagonal” diagram commutes, then generalized associativity holds up to isomorphism 
(see Mac Lane, Categories for the Working Mathematician, pages 157-161). ◄ 

Proof. The proof is by induction on p > 2. The base step is true, for U\ M\. Mi\ = 
Mi ® r Mi- For the inductive step, let us assume that 

Mi M p = //[Mi, ..., M;] ®r Z/[M,-_|_i,..., M p ], 
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We have indicated the final factors in the association; for example, 

((Mi <S)r M2) <S>r M3) (M4 <S)r M 5 ) = U[M\, M2 , M3] l/[M 4 , M5]. 

By induction, there are multilinear functions /;'; Mi x • • • x M, -* M1 <S)r ■ ■ ■ < 8 >r M, and 
h": Mi -|_i x • • • x M p -> M,-+i <g>^ • • • ®r M p with h'{m \,..., m ( ) = mi <g> • • • <g> ra/ 
associated as in Mi ■ ■ ■ ®r M,-, and with /t"(m,+i, ..., mp) = m^i <g> • • • <g> m p 
associated as in M,+i®z?- • -(&rM p . Induction gives isomorphisms ip ': {/[Mi,..., M,] -» 
Mi M; and ip ": {/[M (+ i,..., M p ] ->• M,-+i • • • ®r M p with ip'h' = 

h\(M\ x • • • x Mi) and <p"h" — /z|(M, + i x • • • x M p ). By Corollary 8 . 78 , <p' (g> ip" is an 
isomorphism U[M\, ..., M,] ®r {/[M, + 1 , ..., M p ] —>■ Mi <g>u ■ • • 0 # Mp. 

We now show that U[M\, ..., M ; ] 0 ^ U[Mj + 1 ,..., M p ] is a solution to the universal 
problem for multilinear functions. Consider the diagram 

Mj x • • • x Mp--->■ {/[Mi, ..., Mi] ® R U[M i+ 1,..., Mp] 



where r)(mi ,..., mp) = h'(m 1 ,..., m ; ) 0 /i"(m,-+ 1 , ..., m p ), is an /{-module, and / 

is multilinear. We must find a homomorphism / making the diagram commute. 

If (mi,..., nij) e Mi x • • • x M;, the function /( mi . m; ): M, + i x • • • x M p —> /V, 

defined by (m,+i, ..., mp) i-> /(mi,..., m,-, /z"(m, + i, ..., mp)), is multilinear; hence, 
there is a unique homomorphism /( mimj ): {/[M;+i,..., Mp] —>• /V with 

■ h"(m i+ i, ... ,m p ) h* /(mi- ,m p ). 

If r <= R and 1 < j < i , then 

7( mi ,...,/m ; ,..., m ,)(/!"(»h+i- • • •, m p)) = f(m\, ■■■, rmj -- m p ) 

= rf(m 1 ,..., mj,, nij ) 

= r (m i+ 1 ,..., m p )). 

Similarly, if mj, m'. e Mj, where 1 < j < L then 

f (rn\,...,mj+m'j,...,mi) f{m\,...,mj,...,mi) A f( m j.. ...m'j.....mi)- 

The function of i + 1 variables Mi x • • • x M; x U\M l+ \, ..., M p \ —> N, defined 
by (mi, ...,mi,u") i-> /( mil ...is multilinear, and so it gives a bilinear func¬ 
tion U[M\,..., Mi] x {/[M,-+i,..., Mp] —»■ /V, namely, (u',u”) i-> (h'(u'),h"(u")). 
Thus, there is a unique homomorphism /: {/[Mi, ..., M ( ] 0« {/[M,-+i,..., M p ] —>• /V 
which takes //(mi,..., m,-)^0 h"(m i+ i ,..., m p ) h* f(m u ...,mi)(h" {m i+ u ..., m p )) 

= /(mi. m p )\ that is, ft] = f. Therefore, {/[Mi.M,] 0# {/[M; + i,..., M p ] 

is a solution to the universal mapping problem. By uniqueness of such solutions, there 

is an isomorphism 0 : JJ[M \, ..., M p ] —»■ {/[Mi.M,] 0^ {/[M; + i,..., M p ] with 

Oh(m\,... , m p ) = h'(mi,..., nii) <S> h" (nii + i,..., m p ) — ..., m p ). Finally, 

](p' 0 q>")9 is the desired isomorphism {/[Mi, ..., M p ] = Mi ®r ■ ■ ■ ®r M p . • 
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We now abandon the notation in Proposition 9.97; from now on, we shall write 

U[M \,..., M p ] — M\ ® R ■ ■ ■ (gift M p , 
h(in i, ..., nip) — m\ ® • • • ® m p , 
u[fi,...,f p ] = fi ® • • • <g> f p . 

Proposition 9.99. If R is a commutative ring and A and B are R-algebras, then the 
tensor product A ® R B is an R-algebra if we define (a ® b)(a' ® b') — aa' ® bb'. 

Proof. First, A ® R B is an R -module, by Corollary 8.81. Let p: A x A —> A and 
v: B x B -> B be the given multiplications on the algebras A and B. respectively. We 
must show there is a multiplication on A ® R B as in the statement; that is, there is an R- 
bilinear function X: (A®r B) x (A®r B) -> A<8 >r.B with A: (a®b, a'®b f ) i->- aa'®bb'. 
Such a function X exists because it is the composite 

(A ® R B) x (A ® R B) -» (A ® R B) <g> (A ® R B ) 

-> (A ® R A) x {B ® R B ) 

-* A® r B : 

the first function is (a ® b, a' ® b r ) i-^ a ® b ® a' ® b' (which is the bilinear function in 
Proposition 8.82); the second is l®r®l, where r ; B® R A -» A® R B takes b®a i-> a®b 
(which exists by Propositions 8.83 and 9.98); the third is p.® v. It is now routine to check 
that the /^-module A ® R B is an /^-algebra. • 

Example 9.100. 

In Exercise 8.48 on page 604, we saw that there is an isomorphism of abelian groups: 
I m ® I„ = Id, where d = ( m, n). It follows that if (m, n) — 1, then I„, ® I„ = {0}. Of 
course, this tensor product is still {0} if we regard I,„ and I„ as Z-algebras. Thus, in this 
case, the tensor product is the zero ring. Had we insisted, in the definition of ring, that 
1 f 0, then the tensor product of rings would not always be defined. < 

We now show that the tensor product of algebras is an "honest” construction. 

Proposition 9.101. If R is a commutative ring and A and B are commutative R-algebras, 
then A ® R B is the coproduct in the category of commutative R-algebras. 

Proof. Define p: A —> A ® R B by p: a i-> a®l, and define o : B -> A ® R B by 
o : b i —y 1 ® b. Let X be a commutative /^-algebra, and consider the diagram 

A 
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where / and g are /^-algebra maps. The function qr. A x B —> X, given by (a, b) 
f(a)g(b), is easily seen to be R -bilinear, and so there is a unique map of R -modules 
<J>: A B X with <I>(a ® b) = f(a)g(b). It remains to prove that <t> is an /^-algebra 
map, for which it suffices to prove that <l>((a ® b)(a' <8> b'f) = T>(a <g> b)®{a' ® b'). Now 

<J>((a ® b)(a' ® b'f) — ‘t>(aa' ® bb') 

= f(a)f(a')g(b)g(b'). 

On the other hand, <t>(a ® b)<t>(a' ® b') — f (a)g(b) f (a')g(b'). Since X is commutative, 
<t> does preserve multiplication. • 

Bimodules can be viewed as left modules over a suitable ring. 

Corollary 9.102. Let R and S be k-algebras, where k is a commutative ring. Every 
( R , S)-bimodule M is a left R <S>k S op -module, where 

(r ® s)m — rms. 

Proof. The function R x ,S' op x M —> M, given by (r, s, m) i-> rms, is A-trilinear, and 
this can be used to prove that (r ® s)m — rms is well-defined. Let us write s * s' for the 
product in .S -op ; that is, s * s' = s's. The only axiom that is not obvious is axiom (iii) in 
the definition of module: If a, a' e R 0* S op - then (aa')m = a(a'm), and it is enough to 
check that this is true for generators a — r ® s and a' = r' <S) s' of R <S)k ■S’ op . But 

[(r ® s)(r' ® s')]m — [rr' ® s * s']m 

— (rr')m(s * s') 

— (rr')m(s's) 

= r(r'ms')s. 


On the other hand, 

(r <8> s)[(r / <S> s )m ] = (r ® s)[r , (»^.s , )] = r(r'ms')s. • 


Definition. 

is 


If k is a commutative ring and A is a k-algebra, then its enveloping algebra 


A e = A ® k A op . 


Corollary 9.103. If k is a commutative ring and A is a k-algebra, then A is a left A e - 
module whose submodules are the two-sided ideals. If A is a simple k-algebra, then A is a 
simple A e -module. 

Proof. Since a k-algebra A is an (A, A)-bimodule, it is a left A^-module. • 
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Proposition 9.104. Ifk is a commutative ring and A is a k-algebra, then 

End^e (A) = Z(A). 

Proof. If /: A -> A is an ,4''-map, then it is a map of A viewed only as a left A- 
module. Proposition 8.12 applies to say that / is determined by z — /(1), because f(a) = 
f (a 1) = af( 1) = az for all a e A. On the other hand, since / is also a map of A viewed as 
aright A-module, we have f (a ) = /(la) = /(l)a = za. Therefore, z = /(1) e Z(A); 
that is, the map <p: f \—r /(I) is a map End a? (A) —> Z (A). The map <p is surjective, for if 
z € Z(A), then f (a) = za is an A e -endomorphism with p(f) = z; the map tp is injective, 
for if / e EndA^A) and /( 1 ) = 0, then f — 0. • 

We now construct the tensor algebra on an /^-module M. When M is a free R -module 
with basis X, then the tensor algebra will be seen to be the free R- algebra with basis X ; 
that is, it is the polynomial ring over R in noncommuting variables X. 

Definition. Let R be a commutative ring, and let M be an R -module. Define 

T°(M) = R, 

T l (M ) = M, 

T P (M) = M <Z)r ■ ■ ■ ®r M (p times) if p > 2. 

Remark. Many authors denote T P (M ) by (ft) 1 ’ M. In Proposition 9.97, T P (M) was orig¬ 
inally denoted by U\ M \ . M p ] (here, all M, = M), and we later replaced this notation 

by M\ (%)•••(%) M p , for this is easier to remember. We remind the reader that T P (M), 
however it is denoted, is generated by symbols m \ ® • • • <g> m p in which no parentheses 
occur. 

Proposition 9.105. If M is an R-module, then there is a graded R-algebra 

T(M) = J2T p (M) 

p > o 

with the action ofr e R on T q ( M ) given by 


r Oh ®---®y q ) = (ry i) ® y 2 ® • • • <S> y q — (y\ ® ® y q )r , 

and with the multiplication T P (M) x T q ( M ) —>■ T p+q (M), for p, q > 1, given by 


(xi ® ® Ap, yi ® ® y ? ) Vi ® ® Vp ® yi ® ® y q . 

Proof. First, define the product of two homogeneous elements by the formulas in the 
statement. Multiplication p : T ( M) x T (M) —>■ T ( M) must now be 

^ Y. x p®y q > 

p i p-q 
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where x p e T P (M) and y q e T q (M). Multiplication is associative because no parentheses 
are needed in describing generators m \ ® ■ ■ ■ ® m p of T P (M), and the distributive laws 
hold because multiplication is R -bilinear. Finally, 1 e k — T°(M) is the identity, each 
element of R commutes with every element of T (M), and T p (M)T q ( M ) C T p+q (M), so 
that T (M) is a graded /^-algebra. • 

The reader may check that if M — R, then T ( M ) = R[x]. 


Definition. If R is a commutative ring and M is an R -module, then T ( M ) is called the 
tensor algebra on M. 

If R is a commutative ring and A and B are R -modules, define a word of length p > 0 
on A and B to be an R -module of the form 

W(A , B) p = T ei (A ) ® R T fl (B ) ®r ■ ■ ■ (g) S T e AA) ® R T fr (B), 

where + /, ) = p, all e,-, f) are integers, e\ > 0, f r > 0, and all the other exponents 

are positive. 


Proposition 9.106. If A and B are R-modules, then for all p > 0, 

P 

T p {A © B) = J2 B h ®R 
j =0 

where W (A , B)j, W\A , B) p _j range over all words of length j and p — j, respectively. 
Proof The proof is by induction on p > 0. For the base step, 

T°(A ©fl) = R = R ® r R = T°(A ) ® R T°(B). 

For the inductive step, 

T p+ \A © B) = T p (A © B) ® R (A © B) 

= (T p (A © B) ® R A) © ( T p (A © B) ® R B) 
p 

= W(A ’ B \i ®R W>(A ' B h~j ®R 
7=0 

where X = A or X = B. This completes the proof, for every word of length p — j + 1 has 
the form W'(A, B ) ®r X. • 



Sec. 9.6 Graded Algebras 


723 


Proposition 9.107. Tensor algebra defines a functor T : r Mod —>• Gr^Alg. Moreover, 
T presents surjections. 

Proof. We have already defined T on every A 1 -module M: it is the tensor algebra T(M). 
If /: M -> N is an R -homomorphism, then Proposition 9.97 provides maps 

f® ••• ® f: T p (M) -» T p (N), 

for each p, which give an .R-algebra map T(M) —>• T (. N ). It is a simple matter to check 
that T preserves identity maps and composites. 

Assume that f: M —> N is a surjective R-map. If n\ ® ■ ■ ■ ® n p e T P (N), then 
suijectivity of / provides m, e M, for all i, with /(m,j = n,, and so 

T (/) \ m\® ■ ■ ■ ®m p n\® • ■ ■ ®n p . • 


We now generalize the notion of free module to free algebra. 

Definition. If A is a subset of an /^-algebra F, then F is a free R-algebra with basis X 
if, for every /^-algebra A and every function <p\ X —> A, there exists a unique /^-algebra 
map <p with fix) = (fix) for all x e X. In other words, the following diagram commutes, 
where i: X —»■ F is the inclusion. 


In the next proposition, we regard the graded /^-algebra T ( V) merely as an /^-algebra. 

Proposition 9.108. If V is a free R-module with basis X, where R is a commutative ring, 
then T ( V ) is a free R-algebra with basis X. 

Proof. Consider the diagram 


T(V) 


j | 
V 


X 


T(<p) 

A 



T{A) 



A, 


where i: X —> V and j: V —> T(V) are inclusions, and A is an A 1 -algebra. Viewing A 
only as an /^-module gives an A 1 -module map f: V -x A, for V is a free A 1 -module 
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with basis X. Applying the functor T gives an R -algebra map T if): T(V) —> T(A). 
For existence of an R-algebra map T(V) -> A , it suffices to define an R-algebra map 
/a: T(A) -» A such that the composite p o T ifp) is an R-algebra map extending <p. For 
each p, consider the diagram 


h p 

Ax--- x A — 


TP (A) 



: ^p 

y 

A, 


where h p : (a\, ..., a p ) i-> a\ ® ■ ■ ■ ® a p and m p : (ai, ..., a p ) a\ ■ ■ ■ a p , the latter 
being the product of the elements a\,... ,a p in the R-algebra A. Of course, m p is R- 
multilinear, and so it induces an R-map p p making the diagram commute. Now define 
p : T (A) —»■ A by p — p p . To see that p is multiplicative, it suffices to show 

p p+q {fa\ ® - ■ ■ ® a p ) ® (a[ ® ■ ■ ■ ® a ' q )) = p p (a\ ® ■ ■■ ® a p )p q {a[ ® ■ ■■ ® a’ q ). 

But this equation follows from the associative law in A: 

(«!••• a p )(a j • • • a' q ) = cn ■ ■ ■ a p a[ ■ ■ ■ a! q . 

Finally, uniqueness of this /^-algebra map follows from V generating T (V) as an /^-algebra 
[after all, every homogeneous element in T (V) is a product of elements of degree 1], • 


Corollary 9.109. Let R be a commutative ring. 

(i) If A is an R-algebra, then there is a surjective R-algebra map T (A) —>■ A. 

(ii) Every R-algebra A is a quotient of a free R-algebra. 

Proof, (i) Regard A only as an R-module. For each p > 2, multiplication A p —> A is 
R -multilinear, and so there is a unique /Amodule map T P (A ) —> A. But these maps may 
be assembled to give an W-module map T ( A) = T p (A ) —> A. This map is surjective, 
because A has a unit 1, and it is easily seen to be a map of R-algebras; that is, it preserves 
multiplication. 

(ii) Let V be a free R-module for which there exists a suijective /Amap V —*■ A. By 
Proposition 9.107, the induced map T((p): T(V) —> T{A) is suijective. Now T(V ) is 
a free R-algebra, and if we compose T(cp) with the surjection T(A) A, then A is a 
quotient of T(V). • 

Definition. If R is a commutative ring and V is a free R-module with basis X, then T (V) 
is called the ring of polynomials over R in noncommuting variables X, and it is denoted 
by R{X). 
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If V is the free R-module with basis X , then each element u in T ( V) has a unique 
expression 

M = X! r 'l,-,ip X 'l ® ® 

p>0 

i\....dp 

where r (1 .e W and x; • e X. We obtain the usual notation for such a polynomial by 

erasing the tensor product symbols. For example, if X = {x, y 1, then 

u — ro + r\x + r 2 y + r^x 1 + r 4 y 2 + r^xy + r^yx H-. 

Example 9.110. 

Just as for modules, we can now construct rings (Z-algebras) by generators and relations. 
The first example of a ring that is left noetherian but not right noetherian was given by J. 
Dieudonne; it is the ring R generated by elements x and y satisfying the relations yx = 0 
and y 2 = 0. The existence of the ring R is now easy: Let V be the free abelian group 
with basis m, u, let R = ( Hp>o T p (V ) j //, where I is the two-sided ideal generated by 

vu and v 2 , and set x = u + I and y = v + I. Note that since the ideal I is generated by 
homogeneous elements of degree 2, we have T l (V) — V fl / = {0}, and so x ^ 0 and 
y 7^0. ◄ 

We now mention a class of rings generalizing commutative rings. 

Definition. If k is a field, 17 then a polynomial identity on a ^-algebra A is an element 
f(X) e k(X) (the ring of polynomials over k in noncommuting variables X) all of whose 
substitutions in A are 0. 

For example, if f(x,y) — xy — yx e k(x, y), then / is a polynomial identity on a 
^-algebra A if ab — ha = 0 for all a,b e A; that is, A is commutative. 

Here is a precise definition. Every function (p: X -* A extends to a ^-algebra map 
i p: k(X) —»■ A, and f(X ) is a polynomial identity on A if and only if f (X) e p| ker^ 
for all functions (p\ X —> A. 

Definition. A k-algebra A is a Pl-algebra (an algebra satisfying a polynomial identity) 
if A satisfies some identity at least one of whose coefficients is 1. 

Every k-algebra generated by n elements satisfies the standard identity 
Sn+ l(*l,- X n +i) — ^2 Sgn(cr )X(j(i) • • • X a (n+l). 

rt +1 

We can prove that the matrix algebra Mat,, (k) satisfies the standard identity s n 2 +1 , and 
S. A. Amitsur and J. Levitzki proved that Mat,, (k) satisfies sjn ; moreover, 2 n is the lowest 
possible degree of such a polynomial identity. There is a short proof of this due to S. 
Rosset, “A New Proof of the Amitsur-Levitski Identity,” Israel Journal of Mathematics 23, 
1976, pages 187-188. 

17 We could, of course, extend these definitions by allowing k to be a commutative ring. 
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Definition. A central polynomial identity on a k-algebra A is a polynomial identity 
f(X) e k{X) on A all of whose values f{a\, 02, ■ ■ ■) (as the a, vary over all elements of 
A) lie in Z(A). 

It was proved, independently, by E. Formanek and Yu. P. Razmyslov, that Mat,, (k) 
satisfies a central polynomial identity. 

There are theorems showing, in several respects, that Pi-algebras behave like commu¬ 
tative algebras. For example, recall that a ring R is primitive if it has a faithful simple left 
/Amodule; if R is commutative, then R is a field. I. Kaplansky proved that every primi¬ 
tive quotient of a Pi-algebra is simple and finite-dimensional over its center. The reader is 
referred to Procesi, Rings with Polynomial Identities. 

Another interesting area of current research involves noncommutative algebraic geom¬ 
etry. In essence, this involves the study of varieties now defined as zeros of ideals in 
k{x i , ..., x n ) instead of in k[x\,..., x n ]. 

Exercises 

9.62 (i) If k is a subfield of a field K. prove that the ring K k[x) is isomorphic to Ai[x], 

(ii) Suppose that k is a field, p(x ) e k[x) is irreducible, and K = k(a), where a is a root of 
p{x). Prove that, as rings, K ® k X — L\x]/(p(x)), where ( p{x )) is the principal ideal 
in K[x) generated by pix). 

(iii) The polynomial pix), though irreducible in k[x\, may factor in K[x], Give an example 
showing that the ring K K need not be semisimple. 

(iv) Prove that if K/k is a finite separable extension, then K ® k K is semisimple. (The 
converse is also true.) 

9.63 Let m and n be positive integers, and let d = gcd(wt,n). Prove that I m ®j j I n = as 
commutative rings. 

Hint. See Exercise 8.48 on page 604. 

9.64 If A = A' and B = B' are fc-algebras, where k is a commutative ring, prove that A ® k B = 
A! B 1 as k-algebras. 

9.65 If k is a commutative ring and A and B are k-algebras, prove that 

(A <8>a- 5)° p = A°P ®a B°P. 

9.66 If R is a commutative k-algebra, where k is a field, and if G is a group, prove that R ®a kG = 
RG. 

9.67 (i) If k is a subring of a commutative ring R, prove that R k[x] = f?[x] as 

^-algebras. 

(ii) If f(x) € k[x] and (/) is the principal ideal in k[x] generated by fix), prove that 
R (/) is the principal ideal in f?[x] generated by fix). More precisely, there is a 
commutative diagram 


0->■ E ® k (f) -s- E ®a k[x ] 



0 
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(iii) Let k be a field and E = ![*]/(/), where fix) e k\x] is irreducible. Prove that 
E <2>k E = E[x\/if)p, where {J)e is the principal ideal in E[x\ generated by fix). 

(iv) Give an example of a field extension E/k with E E not a field. 

Hint. If f(x) e k\x) factors into g(x)/t(x) in E[x ], where (g, h) = 1, then the Chinese 
remainder theorem applies. 

9.68 Let k be a field and let fix) e k\x | be irreducible. If K /k is a field extension, then fix) = 
Pi )x) ei ■ ■ ■ p n ix) e " e K [jr], where the p, ix) are distinct irreducible polynomials in K \x ] and 
G > 1. 

(i) Prove that fix) is separable if and only if all e/ = 1. 

(ii) Prove that a finite field extension K/kis separable if and only if K K is a semisimple 
ring. 

Hint. First, observe that K/k is a simple extension, so there is an exact sequence 
0 —>■ if) —»■ k[x\ -4- K —> 0. Second, use the Chinese remainder theorem. 

9.69 Prove that the ring R in Example 9.110 is left noetherian but not right noetherian. 

Hint. See Cartan and Eilenberg, Homological Algebra, p. 16. 

9.70 If G is a group, then a L-algebra A is called G-graded if there are L-submodules A g , for all 
jeG, such that 

Ci) A = ^ geG AS-, 

(ii) For all g, h e G, A ? A 1 ’ C A gh . 

An lb-graded algebra is called a superalgebra. If A is a G-graded algebra and e is the identity 
element of G, prove that 1 e A e . 

9.71 If A is a 1-algebra generated by n elements, prove that A satisfies the standard identity defined 
on page 725. 


9.7 Division Algebras 

That the tensor product of algebras is, again, an algebra, is used in the study of division 
rings. 

Definition. A division algebra over a field k is a division ring regarded as an algebra over 
its center k. 

Let us begin by considering the wider class of simple algebras. 

Definition. A 1-algebra A over a field k is central simple if it is finite-dimensional, 18 
simple (no two-sided ideals other than A and {0}), and its center Z(A) — k. 

Notation. If A is an algebra over a field k, then we write 

[A : k] = dim*(A). 

18 Some authors do not assume finite-dimensionality. 
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Example 9.111. 

(i) Every division algebra A that is finite-dimensional over its center k is a central simple 
k-algebra. The quaternions H is a central simple R-algebra, and every field is a central 
simple algebra over itself. Hilbert gave an example of an infinite-dimensional division 
algebra (see Drozd-Kirichenko, Finite Dimensional Algebras, page 81). 

(ii) If k is a field, then Mat,, (k) is a central simple k-algebra. 

(iii) If A is a central simple k-algebra, then its opposite algebra A op is also a central simple 
k-algebra. < 


Theorem 9.112. Let A be a central simple k-algebra. If B is a simple k-algebra, then 
A B is a central simple Z(B)-algebra. In particular, if B is a central simple k-algebra, 
then A <S>k B is a central simple k-algebra. 

Proof Each x e A B has an expression of the form 


x=a\®b\-\ -b a n <S>b n , (1) 

where a, e A and bj e B. For nonzero x, define the length of x to be n if there is no 
such expression having fewer than n terms. We claim that if x has length n, that is, if 
Eq. (1) is a shortest such expression, then b\, ... ,b n is a linearly independent list in B 
(viewed as a vector space over k). Otherwise, there is some j and m,- e k, not all zero, with 
bj — Ujbi. Substituting and collecting terms gives 

a - = ^^(a,- + uiaj) <g> bj, 

‘¥=j 

which is a shorter expression for x. 

Let I {0} be a two-sided ideal in A B. Choose x to be a (nonzero) element in I of 

smallest length, and assume that Eq. (1) is a shortest expression for x. Now a\ ^ 0. Since 

A«i A is a two-sided ideal in A, simplicity gives A = Aa \ A. Hence, there are elements a' p 

and a" in A with 1 = a'a\a''. Since I is a two-sided ideal, 

p ‘—‘P p p 

x = Y_ a’ p xa" p — 1 <g> /?i + C 2 ® b 2 H- \-c n ®b n (2) 

p 

lies in I, where, for i > 2 , we have q = . At this stage, we do not know 

whether x' ^ 0, but we do know, for every a e A, that ( a ® l)x' — x'(a ® 1) e I. Now 

(a ® \)x' — x'(a ® 1) = ^^(aci — cia) ® bi. (3) 

;> 2 

First, this element is 0, lest it be an element in I of length smaller than the length of x. 
Since b\ is a linearly independent list, the k-subspace it generates is [b\,..., b n ) — 
{b\) © • • • © {b n ), and so 


A (b\, - b n ) = A {bi) © • • • © A ( b n ). 
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It follows from Eq. (3) that each term (act — Cia) 0 b, must be 0. Hence, aci = c;a for all 
a e A; that is, each c, e Z(A) = k. Eq. (2) becomes 

x' — 1 ® b\ + C 2 ® b 2 H-h c„ <8> b„ 

= 1 ® + 1 ® C 2^2 H-hi® c n b n 

= 1 ® (b\ + C 2^2 H- h c n b n ). 

Now b\ + C 2^>2 + • • • + c n b n ^ 0, because bi,... ,b n is a linearly independent list, and 
so x' ^ 0. Therefore, I contains a nonzero element of the form 1 ® b. But simplicity of 
B gives BbB = B. and so there are h ' q , b" q e B with '}2 q b' q bb q = 1. Hence, I contains 
V (1 0i>')(l0Z?)(l0 b'') = 1 ® 1, which is the unit in A®kB. Therefore, I — A ®t B 
and A B is simple. 

We now seek the center of A B. Clearly, k ®a- Z(B) C Z(,4 (gik B). For the reverse 
inequality, let z e Z(A <Z>k B) be nonzero, and let 


Z = ai ® bi H-h a n ® b n 


be a shortest such expression for z. As in the preceding argument, b \, b n is a linearly 
independent list over k. For each a e A. we have 


0 = (a ® l)z — z(a ® 1) = — aid) ® bi. 


It follows, as above, that ( aa, — a/a) ® bi — 0 for each i. Hence, acii — = 0, so 

that aat = did for all d e A and each n ; - e Z(A) = k. Thus, z = 1 ® x, where 

x = d\b\ + ■ ■ ■ + d„b n e B. But if b e B , then 

0 = z(l ® b) — (1 ® b)z = (1 ® x)(l ® b) — (1 ® b)( 1 ® x) = 1 ® (xb — bx). 

Therefore, xb — bx = 0 and x e Z(B ). We conclude that z e k (gik Z(B), as desired. • 

It is not generally true that the tensor product of simple ^-algebras is again simple; we 
must pay attention to the centers. In Exercise 9.67(iv) on page 727, we saw that if E/k is 
a field extension, then E <Z)k E need not be a field. The tensor product of division algebras 
need not be a division algebra, as we see in the next example. 

Example 9.113. 

The algebra C ®k H is an eight-dimensional R-algebra, but it is also a four-dimensional 
C-algebra: A basis is 


1 = 1®1, 1 ®i, I®;, 101. 
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We let the reader prove that the vector space isomorphism C ®r H —> Mat 2 (C) with 


1 0 1 1^ 

'1 O' 

0 lj ’ 

1 0 i i-^ 

'i 0' 

0 —i 

1 0 j i-» 

' 0 f 
-1 0 

1 0 k i-^ 

o r 
/ oj ’ 


is an isomomorphism of C-algebras. ◄ 

Another way to see thatC®RHI = Mat 2 (C) arises from Example 8.71(ii). We remarked 
then that 

RQ = RxKxKxRxI; 

tensoring by C gives 

CQ = C ®r RQ = CxCxCxCxC ®r H. 

It follows from the uniqueness in Wedderburn’s theorem that C ®r H = Mat 2 (C). 

The next theorem puts the existence of the isomorphism in Example 9.113 into the 
context of central simple algebras. 

Theorem 9.114. Let k be afield and let Abe a central simple k-algebra. 

(i) Ifk is the algebraic closure ofk, then there is an integer n with 

k <S>k A = Mat„(k). 

(ii) If A is a central simple k-algebra, then there is an integer n with 

[A : k] = n 2 . 

Proof, (i) By Theorem 9.112, k®/, A is a simple k-algebra. Hence, Wedderburn’s theorem 
(actually. Corollary 8.63) gives k ®,t A = Mat,, (D) for some n > 1 and some division ring 
D. Since D is a finite-dimensional division algebra over k, the argument in Molien’s 
Corollary 8.65 shows that D = k. 

(ii) We claim that [A : k] = [k 0 ^- A : k], for if a\,... , a m is a basis of A over k, then 
1 ®ai,..., 1 0 a m is a basis of k®*- A over k (essentially because tensor product commutes 
with direct sum). Therefore, 

[A:k]= [k ®a- A : k] = [Mat„(k) : k] = n 2 . • 

The division ring of quaternions H is a central simple R-algebra, and so its dimension 
[H : R] must be a square (it is 4). Moreover, since C is algebraically closed. Theorem 9.114 
gives C ®r H = Mat 2 (C) (Example 9.113 displays an explicit isomorphism). 
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Definition. A splitting field for a central simple k-algebra A is a field extension E / k for 
which there exists an integer n such that E <g>£ A = Mat„(.E). 

Theorem 9.114 says that the algebraic closure k of a field A: is a splitting field for every 
central simple k-algebra A. We are going to see that there always exists a splitting field 
that is a finite extension of k, but we first develop some tools in order to prove it. 

Definition. If A is a k-algebra and X C ,4 is a subset, then its centralizer, Ca(X), is 
defined by 

Ca(X ) = {a e A : ax = xa for every x e X). 

It is easy to check that centralizers are always subalgebras. 

The key idea in the next proof is that a subalgebra B of A makes A into a (B, A)- 
bimodule, and that the centralizer of B can be described in terms of an endomorphism ring 
(this idea is exploited in proofs of the Morita theorems). 

Theorem 9.115 (Double Centralizer). Let A be a central simple algebra over afield 
k and let B be a simple subalgebra of A. 

(i) Ca(B) is a simple k-algebra. 

(ii) B <S>k A op = Mat s (A) and Ca(B ) = Mat,-(A) for some division algebra A, where 
r | s. 

(hi) [B : k][C A (B) :k] = [A: k]. 

(iv) C a (C a (B )) = B. 

Proof Associativity of the multiplication in A shows that A can be viewed as a [B, A)- 
bimodule. As such, it is a left (B A op (-module, where (b <g> a)x = bxa for all x e A; 
we denote this module by A*. But B <E)k A op is a simple k-algebra, by Theorem 9.112, 
so that Corollary 8.63 gives B <Eh : A op = Mat s (A) for some integer s and some division 
algebra A over k; in fact, B <E)k A op has a unique (to isomorphism) minimal left ideal L, and 
A op = Endg^j.Aopfi)- Therefore, as ( B <g>£ A op )-modules, Corollary 8.44 gives A* = L r , 
the direct sum of r copies of L, and so End^g^/ioptA*) = Mat, (A). 

We claim that 

Ca(B ) = Ends^AopfA*) = Mat,-(A); 

this will prove (i) and most of (ii). If tp e Endfl 0 t/ iop(A*), then it is, in particular, an 
endomorphism of A as a right A-module. Hence, for all a e A, we have 

< p(a ) = (p{\a) — (p(l)a = ua, 

where u — In particular, if b e B, then (p(b) = ub. On the other hand, taking the left 
action of B into account, we have cp(b) = (p(bl) = b<p( 1) = bu. Therefore, ub = bu for 
all b e B, and so u e Ca(B). Thus, tp i->- cp( 1) is a function Endfi 0 )t , / iop(A*) -> C A {B). It 
is routine to check that this function is an injective k-algebra map; it is also surjective, for 
if u e Ca(B), then the map A —»• A, defined by a i-> ua, is a (B <g>£ A op )-map. 
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We now compute dimensions. Define d = [A : k]. Since L is a minimal left ideal in 
Maty (A), we have Maty (A) = L s (concretely, L — COL(l), consisting of all first columns 
ofx s matrices over A). Therefore, [Maty(A) : k] = s 2 [A : k] and [L s : k] = s[L : k], 
so that 

[L : k] = sd. 

Also, 

[A : k] = [A* : k] = [L r : k] = rsd. 

It follows that 


[A : k][B : k] = [B <% A op : k] = [Maty(A) : k] = s 2 d. 

Therefore, [B : k] = and so r \ s. Hence, 

[B : k][C A (B ) : k] — [B : fc][Mat r (A) : k] = - ■ r 2 d = rsd = [A : k], 

r 

because we have already proved that C A (B) = Mat,-(A). 

Finally, we prove (iv). It is easy to see that B C C A (C A (B)): after all, if b e B and 
u <= C a (B ), then bu — ub , and so b commutes with every such it. But C A (B) is a simple 
subalgebra, by (i), and so the equation in (iii) holds if we replace B by C A (B ): 

[C a (B] : k][C A (C A (B)) : k] = [A : k]. 

We conclude that [B : k] = \C A (C A (B)) : k]\ together with B C C A (C,\ (B)), this 
equality gives B — C A (C A (B)). • 

Here is a minor variant of the theorem. 

Corollary 9.116. If B is a simple subalgebra of a central simple k-algebra A, where k is 
afield, then there is a division algebra D with B op 0/,- A = Maty(Z)). 

Proof. By Theorem 9.115(ii), we have A op = Maty (A) for some division algebra A. 
Hence, (B0^A op ) op = (Maty(A)) op . But (Maty(A)) op = Maty(A op ), by Proposition 8.13, 
while (Z?0^ : A op ) op = B op 0^ A, by Exercise 9.65 on page 726. Setting/) = A op completes 
the proof. • 

If A is a division algebra over a field k and if S e A, then the subdivision algebra 
generated by k and S is a field, because elements in the center k commute with S. We are 
interested in maximal sub fields of A. 

Lemma 9.117. If A is a division algebra over a field k, then a subfield E of A is a 
maximal subfield if and only if C^(E) = E. 

Proof If Zt is a maximal subfield of A, then E C Ca(E) because E is commutative. 
For the reverse inclusion, it is easy to see that if S e C A (E), then the division algebra E' 
generated by E and S is a field. Hence, if 8 <£ E, then E C E', and the maximality of E is 
contradicted. 
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Conversely, suppose that £ is a subfield with Ca(£) = E. If £ is not a maximal 
subfield of A, then there exists a subfield E' with E C E'. Now E' C Ca(£), so that 
if there is some a' e E' with a' E, then E ^ C\{E). Therefore, £ is a maximal 
subfield. • 

After proving an elementary lemma about tensor products, we will extend the next result 
from division algebras to central simple algebras (see Theorem 9.127). 

Theorem 9.118. IfD is a division algebra over a field k and E is a maximal subfield of D, 
then E is a splitting field for D\ that is, Ef&kD = Mafifit), where s = [D :£] = [£ : k]. 

Proof Let us specialize the algebras in Theorem 9.115. Here, A — D, B = E, and 
Ca(E) — E, by Lemma 9.117. Now the condition Ca(B ) = Mat, (A) becomes E = 
Mat,.(A); since E is commutative, r — 1 and A — E. Thus, Corollary 9.116 says that 
E® k D = E °P ® k D = MaL(£). 

The equality in Theorem 9.115(iii) is now [D : k\ = [E : k][E : k] = [E : k ] 2 . But 
[E <g> k D : k] = [MaL(£) : k] = s 2 [E : k], so that s 2 — [D : k] = [E : k] 2 and 
$ = [£:&]. • 

Corollary 9.119. If D is a division algebra over afield k, then all maximal subfields have 
the same degree over k. 

Proof For every maximal subfield E, we have [E : k] = [D : E] = Vi D : k], • 

This corollary can be illustrated by Example 9.113. The quaternions El is a four¬ 
dimensional E-algebra, and so a maximal subfield must have degree 2 over R. And so 
it is, for C is a maximal subfield. 

We now prove a technical theorem that will yield wonderful results. Recall that a unit 
in a noncommutative ring A is an element having a two-sided inverse in A. 

Theorem 9.120. Let k be a field, let B be a simple k-algebra, and let A be a central 
simple k-algebra. If there are algebra maps f,g:B—^A, then there exists a unit u e A 
with 

gib) = uf(b)u~ l 


for all b e B. 

Proof. The map / makes A into a left B -module if we define the action of b e B on an 
element a e A as f(b)a. This action makes A into a (B. A )-bimodule, for the associative 
law in A gives ( f{b)x)a — f(b)(xa) for all x e A. As usual, this ( B, A)-bimodule is a 
left A op )-module, where (b®a')a = baa’ for all a e A; denote it by fA. Similarly, g 
can be used to make A into a left (B ® k A op )-module we denote by K A. By Theorem 9.112, 
B A op is a simple k-algebra. Now 


[ f A : A] = [A : A] = [*A : A], 
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so that fA = g A as (£<g)£A op )-modules, by Corollary 8.63. If tp: fA g A is a (B®*. A op )- 
isomorphism, then 


<P(f(b)aa') = g(b)<p(a)a (4) 

for all b e B and a, a 1 e A. Since <p is an automorphism of A as a right module over itself, 
(p(a) — cp(la ) = ua , where u — tp( 1) e A. To see that u is a unit, note that (p~ l (a) — u'a 
for all a e A. Now a — tp(p~ l (a ) = <p(u'a ) = uu'a for all a e A; in particular, when 
a = 1, we have 1 = uu'. The equation <p~ l (p — 1 a gives 1 = u'u, as desired. Substituting 
into Eq. (4), we have 


uf(b)a = <p(f (b)a) = g(b)cp(a) = g{b)ua 
for all a e A. In particular, if a = 1, then uf(b ) = gib)u and g(b) — uf(b)u~ l . • 

Corollary 9.121 (Skolem-Noether). Let A be a central simple k-algebra over a field 
k, and let B and B' be isomorphic simple k-subalgebras of A. If \[r: B —> B' is an 
isomorphism, then there exists a unit u e A with fib) = ubu~ l for all b e B. 

Proof In the theorem, take /: B —»■ A to be the inclusion, define B' — im i//, and define 
g — ixjr, where i : 11' —> A is the inclusion. • 

There is an analog of the Skolem-Noether theorem in group theory. A theorem of G. 
Higman, B. H. Neumann, and H. Neumann says that if B and B' are isomorphic subgroups 
of a group G, say, cp: B —*■ B' is an isomorphism, then there exists a group G* containing 
G and an element it e G* with q>(b) — ubu~ l for every b e B. There is a proof in 
Rotman, An Introduction to the Theory of Groups, page 404. 

Corollary 9.122. Let k be a field. If is an automorphism o/Mat„(k), then there exists 
a nonsingular matrix P e Mat„ ik) with 

f(T) = PTP~ l 


for every matrix T in Mat„ ( k ). 

Proof The matrix ring A — Mat„(k) is a central simple k-algebra. Set B = B' — A in 
the Skolem-Noether theorem. • 

The following proof of Wedderburn’s theorem is due to B. L. van der Waerden. 

Theorem 9.123 (Wedderburn). Every finite division ring D is a field. 

Proof. Let Z = Z(D), and let £ be a maximal subfield of I). If t/ e D, then Z(d) is 
a subfield of D, and hence there is a maximal subfield E f j containing Z(d). By Corol¬ 
lary 9.119, all maximal subfields have the same degree, hence have the same order. By 
Corollary 3.132, all maximal subfields here are isomorphic. 19 For every d e D, the 

19 It is not true that maximal subfields in arbitrary division algebras are isomorphic; see Exercise 9.80. 
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Skolem-Noether theorem says there is x,i e I) with Ed — XdEx d 1 . Therefore, D = 
u v xEx~ l , and so 

D* = \JxE*x~\ 

X 

If £ is a proper subfield of D, then E x is a proper subgroup of D x , and this equation 
contradicts Exercise 5.32 on page 278. Therefore, D — E is commutative. • 


Theorem 9.124 (Frobenius). If D is a noncommutative finite-dimensional real division 
algebra, then D = H. 

Proof. If E is a maximal subfield of D , then [D :£’] = [£’: R] < 2. If [£ : R] = 1, 
then [D : R] = l 2 = 1 and D = R. Hence, [E : R] = 2 and [D : R] = 4. Let us identify 
E with C (we know they are isomorphic). Now complex conjugation is an automorphism 
of E, so that the Skolem-Noether theorem gives x e D with z — xzx~ l for all z e E. In 
particular, —i — xix “ 1 . Hence, 

x 2 ix~ 2 — x(—i)x~ l — —xix~ l — i, 

and so x 2 commutes with i. Therefore, x 2 e Cd(E ) = E, by Lemma 9.117, and so 
x 2 — a + bi for a, b e R. But 


a + bi — x 2 = XX 2 x 1 = x(a + bi)x 1 = a — bi, 

so that/? = Oandv 2 e R. Ifv 2 > 0, then there is t e Rwithx 2 = t 2 . Now (x+t)(x—t) = 
0 gives x = ±f e R, contradicting —i = xix~ l . Therefore, x 2 = —r 2 for some real r. 
The element j, defined by j — x/r, satisfies j 2 = —1 and ji — —ij. The list 1, i, j , ij is 
linearly independent over R: if a+bi+cj +dij =0, then (—di—c)j — a + ib e C. Since 
j f. C (lest x e C), we must have — di —c — 0 — a + bi. Hence, a = b — 0 = c = d. 
Since [D : R] = 4, the list 1, i, j, i j is a basis of D. It is now routine to see that if we 
define k — ij, then ki — j — —ik, jk — i — —kj, and k 2 — — 1, and so D = H. • 

In 1929, R. Brauer introduced the Brauer group to study division rings. Since construc¬ 
tion of division rings was notoriously difficult, he considered the wider class of central 
simple algebras. Brauer introduced the following relation on central simple /'-algebras. 

Definition. Two central simple /-algebras A and B are similar, denoted by A ~ B, if 
there are integers n and m with 

A Mat„(Z) = B ®k Mat,„(/). 

By the Wedderburn theorem, A = Mat,, (A) for a unique division algebra A over k, and 
we shall see that A ~ B if and only if they determine the same division algebra. 
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Lemma 9.125. Let A be a finite-dimensional algebra over a field k. If S and T are 
k-subalgebras of A such that 

(i) st — ts for all s e S and t e T; 

(ii) A = ST; 

(iii) [A:k] = [S: k][T : k], 
then A = S ®k T. 

Proof. There is a A:-linear transformation /: S <S)k T -> A with s®f h> st, because 
(,s , t) i— st is a A;-bilinear function S x T A. Condition (i) implies that / is an algebra 
map, for 

f((s <g> t)(s' ® t')) = f(ss' ® tt') — ss'tf = sts't' = f(s <8> t ) f (s' <g> t'). 

Since A — ST, by condition (ii), the A:-linear transformation / is a surjection; since 
dim^(.S' ®it T) — dimife(A), by condition (iii), / is a A:-algebra isomorphism. • 

Lemma 9.126. Let k be afield. 

(i) If A is a k-algebra, then 


A Mat„(A;) = Mat,,(A). 

(ii) Mat,, (A:) ® k Mat,,, (A:) = Mat„„, (A:). 

(iii) A ~ B is an equivalence relation. 

(iv) If A is a central simple algebra, then 

A ® k A op = Mat,,(A;), 


where n = [A : A:]. 

Proof, (i) Define Ar-subalgebras of Mat,, (A) by 

S — Mat,,(A:) and T — {aI ; a e A}. 

If s e S and t e T, then st = ts (for the entries of matrices in S commute with elements 
a e A). Now S contains every matrix unit Ejj (whose ij entry is 1 and whose other entries 
are 0), so that ST contains all matrices of the form ajjEij for all ij, where a; ; - e A; hence, 
ST = Mat,,(A). Finally, [S : k][T : k] = n 2 [A : k] = [Mat„(A) ; k]. Therefore, 
Lemma 9.125 gives the desired isomorphism. 

(ii) If V and W are vector spaces over k of dimensions n and m, respectively, it suffices to 
prove that Endi(V) End,t(W) = End,t(V ®k W). Define S to be all / <g> 1 w- where 
/ e Liufi-tV 7 ), and define T to be all ly ® g, where g e Eiid/TW). It is routine to check 
that the three conditions in Lemma 9.125 hold. 
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(iii) Since k = Mati(Ar), we have A = A 0k k = A 0k Mati(Ar), so that ~ is reflexive. 
Symmetry is obvious; for transitivity, suppose that A ~ B and B ~ C; that is, 

A 0k Mat,, (A;) = B 0k Mat m (k) and B 0k Mat r (^) = C 0k Mat s (Ar). 

Then A 0k Mat„(Ar) 0k Mat,-(A:) = A 0k Mat,,,-( A), by part (ii). On the other hand, 

A 0k Mat,, (A:) 0k Mat,- (k) = B 0k Mat„,(A:) 0k Mat r (Ar) 

= C 0k Mat m (A:) 0k Mafl (A) 

— C 0k Mat w ,^(A). 

Therefore, A ~ C, and so ~ is an equivalence relation. 

(iv) Define/: AxA op -> Endj.(A) by f(a, c) — X a op c , where X a : x ax and p c : x t-> 

xc\ it is routine to check that X u and p c are Ar-majrs (so their composite is also a A:-map), and 
that / is A:-biadditive. Hence, there is a A>map f ; A 0k A op —>■ End^ (A) with /{a 0 c) = 
X a o p c . Associativity a (xc) — (ax)c in A says that X a o p c = p c oX a , from which it easily 
follows that / is a A-algehra map. As A 0k A op is a simple Ar-algebra and ker / is a proper 
two-sided ideal, we have / injective. Now dim^(Endt-(A)) = dimi.(Hom^-(A, A)) = n 2 , 
where n — [A : k ]. Since dim^fim/) = dim<.-(A 0k A op ) = n 2 , it follows that / is a 
A:-algebra isomorphism: A 0k A op = End*,(A). • 

We now extend Theorem 9.118 from division algebras to central simple algebras. 

Theorem 9.127. Let Abe a central simple k-algebra over afield k, so that A = Mat,. (A), 
where A is a division algebra over k. If E is a maximal subfield of A, then E splits A; that 
is, there is an integer n and an isomorphism 

E 0k A = MatnCE). 

More precisely, if [A : E] = s, then n — rs and [A : k] = (rs) 2 . 

Proof. By Theorem 9.118, A is split by a maximal subfield E (which is, of course, a 
finite extension of k ): E 0k A = Mat v (£), where s — [A : E] — [£ : A:]. Hence, 

E 0k A = E 0k Mat r (A) = E 0k (A 0k Mat,-(A:)) 

= (E 0k A) 0k Mat, (A:) = Mat. s (£') 0k Mat, (A) = Mafi-sfE). 

Therefore, A = Mat r (A) gives [A \ k] = r 2 [A : k] = r 2 s 2 . • 

Definition. If [A] denotes the equivalence class of a central simple A:-algebra A under 
similarity, define the Bremer group Br (k) to be the set 

Br(A:) = {[A] : A is a central simple Ar-algebra} 

with binary operation 

[A][B] = [A 0 k B], 
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Theorem 9.128. Br (k) is an abelian group for every field k. Moreover, if A = Mat,, (A) 
fora division algebra A, then A is central simple and [A] = [A] in Br(£). 

Proof We show that the operation is well-defined: If A, A', B, B' are ^-algebras with 
A ~ A' and B ~ B\ then A <S>k B ~ A' <g>k B'. The isomorphisms 

A ®/. Mat„(£) = A' ®k Mat„, (k) and B (E)k Mat,-(£) = B' <S>k Mat f (fc) 

give A ®£ B <3)k Mat,, (k) ®£ Mat,-(A:) = A' ®k B' ®k Mat,,, (A:) ®£ Mat s (Ar) (we are using 
commutativity and associativity of tensor product), so that Lemma 9.126(h) gives A <E)k 
B ®£ Mat,,,-(A:) = A' ®* B' ®k Mat,„. s (A:). Therefore, A ®£ B ~ A' ®*- B'. 

That [A:] is the identity follows from A:®* A = A, associativity and commutativity follow 
from associativity and commutativity of tensor product, and Lemma 9.126(iv) shows that 
[A] -1 = [A op ]. Therefore, Br(£) is an abelian group. 

If A is a central simple Ar-algebra, then A = Mat,-(A) for some finite-dimensional 
division algebra A over k. Hence, k — Z(A) = Z(Mat,-(A)) = Z(A), by Theo¬ 
rem 9.112. Thus, A is a central simple ^-algebra, [A] e Br(£), and [A] = [A] (because 
A Mat,-(A:) = Mat,-(A) = A = A <g>k k = A <&k Mati(A:)). • 

The next proposition shows the significance of the Brauer group. 

Proposition 9.129. Ifk is afield, then there is a bijection from Br(A ) to the family T> of all 
isomorphism classes of finite-dimensional division algebras over k, and so \ Br(A:)| = \D\. 
Therefore, there exists a noncommutative division ring, finite-dimensional over its center 
k, if and only ifBr(k ) f {0}. 

Proof Define a function <p: Br(A:) —> 27 by setting <p([A]) to be the isomorphism class 
of A if A = Mat„(A). Note that Theorem 9.128 shows that [A] = [A] in Br(Ar). Let us 
see that (p is well-defined. If [A] = [A'], then A ~ A', so there are integers n and m with 
A ®jt Mat,, ( k ) = A' ®k Mat„, (k). Hence, Mat,, (A) = Mat,,, (A'). By the uniqueness in the 
Wedderburn-Artin theorems, A = A' (and n — in). Therefore, ^([A]) = ^([A']). 

Clearly, q> is surjective, for if A is a finite-dimensional division algebra over k , then 
the isomorphism class of A is equal to ^([A]). To see that <p is injective, suppose that 
(p([ A]) = <^>([ A']). Then, A = A', which implies A ~ A'. • 

Example 9.130. 

(i) If k is an algebraically closed field, then Theorem 9.114 shows that Br (k) — {0}. 

(ii) If k is a finite field, then Wedderburn’s Theorem 9.123 (= Theorem 8.23) shows that 
Br(*) = {0}. 

(iii) If k = M, then Frobenius’s Theorem 9.124 shows that Br(R) = I 2 . 

(iv) It is proved, using class field theory, that Br(Q p ) = Q/Z, where Q p is the field of 
p-adic numbers. Moreover, there is an exact sequence 

0 -> Br(Q) Br(R) © Br (Qp) Q/Z 0. 

p 
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If we write Br(R) = + Z) c Q/Z, then (p is the “sum of coordinates” map. 

In a series of deep papers, Br(A') was computed for the most interesting fields k aris¬ 
ing in algebraic number theory ( local fields, one of which is Q p , and global fields) by 
A. A. Albert, R. Brauer, H. Hasse, and E. Noether. ■* 


Proposition 9.131. If E / k is afield extension, then there is a homomorphism 

f E/ k : Br(fc)-*Br(£) 


given by [A] i-> [£ (E)k A], 

Proof. If A and B are central simple ^-algebras, then E <Sn : -4 and E ®k B are central 
simple E-algebras, by Theorem 9.112. If A ~ B, then E A ~ E ®£ B as E-algebras, 
by Exercise 9.77 on page 740. It follows that the function fE/k is well-defined. Finally, 
fE/k is a homomorphism, because 

(£ ®k A) ®e (E ®k B) = (E ®£ E) ®£ (A ®k B) = E ®k (A B), 
by Proposition 8.84, associativity of tensor product. • 


Definition. If E/k is a field extension, then the relative Brauer group, Br( E/k), is the 
kernel of homomorphism fE/k'- Br (k) -> Br(E): 

Br (E/k) = kerf E/k = {[A] e Br(ifc) : A is split by £}. 


Corollary 9.132. For every field k, we have 

Br (k) = IJ Br (E/k). 
E/k finite 


Proof. This follows at once from Theorem 9.127. • 

In a word, the Brauer group arose as a way to study division rings. It is an interesting 
object, but we have not really used it seriously. For example, we still know no noncommu- 
tative division rings other than the real division algebra H and its variants for subfields k 
of R. We will remedy this when we introduce crossed product algebras in Chapter 10. For 
example, we will see, in Corollary 10.133, that there exists a division ring whose center is 
a field of characteristic p > 0. For further developments, we refer the reader to Jacobson, 
Finite-Dimensional Division Algebras over Fields, and Reiner, Maximal Orders. 
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Exercises 

9.72 Prove that HI HI = Mat 4 (R) as R-algebras. 

Hint. Use Corollary 8.60 for the central simple R-algebra HI EL 

9.73 We have given one isomorphism C ®r HI = Mat 2 (C) in Example 9.113. Describe all possible 
isomorphisms between these two algebras. 

Hint. Use the Skolem-Noether theorem. 

9.74 Prove that C C = C x C as R-algebras. 

9.75 (i) Let C(y) and C(y) be function fields. Prove that R — C(x) ®c C(y) is isomorphic to a 

subring of <C(y, y). Conclude that R has no zero divisors. 

(ii) Prove that C(y) ®c C(y) is not a field. 

Hint. Show that R is isomorphic to the subring of C(v, y) consisting of polynomials 
of the form f{x, y)/g(x)h(y). 

(iii) Use Exercise 8.39 on page 573 to prove that the tensor product of artinian algebras need 
not be artinian. 

9.76 Let A be a central simple L-algebra. If A is split by a field E , prove that A is split by any field 
extension E' of E. 

9.77 Let E/k be a field extension. If A and B are central simple L-algebras with A ~ B, prove that 
E A ~ E B as central simple £-algebras. 

9.78 If D is a finite-dimensional division algebra over R, prove that D is isomorphic to either R, 
C, or EL 

9.79 Prove that Mat 2 (HI) = HI Mat 2 (R) as R-algebras. 

9.80 (i) Let A be a four-dimensional vector space over Q, and let 1, /, j, k be a basis. Show that 

A is a division algebra if we define 1 to be the identity and 

j 2 = -2 k 2 = -2 

jk = 2i ki = j 

kj — —2 i ik = —j 

Prove that A is a division algebra over Q. 

(ii) Prove that Q(i) and Q(j) are nonisomorphic maximal subfields of A. 

9.81 Let D be the Q-subalgebra of HI having basis 1, i, j , k. 

(i) Prove that D is a division algebra over Q. 

Hint. Compute the center Z(D). 

(ii) For any pair of nonzero rationals p and q, prove that D has a maximal subfield isomor¬ 
phic to Q(\/— p 2 — q 2 )- 

Hint. Compute (pi + qj) 2 . 

9.82 (Dickson) If D is a division algebra over a field k, then each d e D is algebraic over k. Prove 
that d, d' € D are conjugate in D if and only if irr(<7, k) = irrlrf', k). 

Hint. Use the Skolem-Noether theorem. 

9.83 Prove that if A is a central simple k-algebra with A ~ Mat,, (k), then A = Mat m (k) for some 
integer m. 


ij = k 
ji = -k 
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9.84 Prove that if A is a central simple k-algebra with [A] of finite order m in Br(k), then 

A A = Mat,- ( k ) 

(there are m factors equal to A) for some integer r. (In Chapter 10, we shall see that every 
element in Br(fc) has finite order.) 


9.8 Exterior Algebra 

In calculus, the differential df of a differentiable function fix. y ) at a point P — (xq , yo) 
is defined by 

df\p = §£|p(*-*o) + ^\p(y-yo)- 

If ( x, y ) is a point near P, then df\p approximates the difference between the true value 
f(x. v) and fix o, yo)- The quantity df is considered “small,” and so its square, a second- 
order approximation, is regarded as negligible. For the moment, let us take being negligible 
seriously: Suppose that 

(df) 2 « 0 

for all differentials df. There is a curious consequence: if du and dv arc differentials, then 
so is du + dv = diu + v). But idu + dv) 2 ss 0 gives 

0 idu + dv) 2 

~ idu) 2 + du dv + dv du + idv) 2 
Ri du dv + dv du, 

and so du and dv anticommute: 

dv du ss —du dv. 

Now consider a double integral ff D fix, y)dx dy, where D is some region in the plane. 
Equations 


x = Fiu, v) 
y — Giu, v) 


lead to the change of variables formula: 


JJ fix,y)dxdy — JJ fiFiu, v), 

where A is some new region and J is the Jacobian: 


Giu, v))Jdu dv, 


J = 
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A key idea in the proof of this formula is that the graph of a differentiable function f (x . y) 
looks, locally, like a real vector space—its tangent plane. Let us denote a basis of the 
tangent plane at a point by dx,dy. If du, civ is another basis of this tangent plane, then the 
chain rule defines a linear transformation by the following linear equations: 

dx = F u du + F v dv 
dy = G u du + G v dv. 

The Jacobian J now arises in a natural way. 

dx dy = ( F n du + F v dv)(G u du + G v dv ) 

= F u duG u du + F u duG v d v + F v dvG„du + F v dvG v dv 
= F u G u (du) 2 + F u G v dudv + F v G n dvdu + F v G v (dv) 2 
F u G v du dv + F v G u dv du 
(F U G V F v G„)du dv 

,. r f u f v i 

= det dudv. 

Gu G v _ 

Analytic considerations, involving orientation, force us to use the absolute value of the 
determinant when proving the change of variables formula. 

In the preceding equations, we used the distributive and associative laws, together with 
anticommutativity; that is, we assumed that the differentials form a ring in which all 
squares are 0. The following construction puts this kind of reasoning on a firm basis. 

Definition. If M is a A-module, where A; is a commutative ring, then its exterior algebra 20 
is f\ ( M) — T(M)/J, pronounced “wedge M” where J is the two-sided ideal generated 
by all m <g> m with m e M. The image of m\ ® • • • ® m p in /\(M) is denoted by 

m i A • • • A nip. 

Notice that J is generated by homogeneous elements (of degree 2), and so it is a graded 
ideal, by Proposition 9.95. Hence, /\{M) is a graded A-algebra, 

/\(M) = k ® M © /\'{M) © /\\m) © • • • , 

where, for p > 2, we have f\ p (M) = T P (M)/J P and J p = J C 1 T P (M). Finally, /\(M) 
is generated, as a ^-algebra, by f\ l (M) = M. 

Definition. We call /\ P (M) the pth exterior power of a ^-module M. 

20 The original adjective in this context—the German ausserer, meaning “outer”—was introduced by Grass- 
mann in 1844. Grassmann used it in contrast to inner product. The first usage of the translation exterior can be 
found in work of E. Cartan in 1945, who wrote that he was using terminology of Kaehler. The wedge notation 
seems to have been introduced by Bourbaki. 
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Lemma 9.133. Let k be a commutative ring, and let M be a k-module. 

(i) Ifhi, m' e M, then in f\ 2 (M), we have 

m A m' = —m A m. 

(ii) If p > 2 and m; = m j for some i j, then m\ A • • • A m p — 0 in f\ P (M ). 

Proof, (i) Recall that /\ 2 (M) — (M®^M)/7 2 , where J 2 — J If m,m' e M, 
then 

(m + m') ® (m + m) = m ® m + m ® m + m ® m + wr <g> m'. 

Therefore, 

m <g> m r + / 2 = —® m + J 2 , 

because J 2 contains (m + m') <g> (m + m'), m <g> m, and m' <g> m'. It follows that 

m A m — —m’ A m 


for all m, m’ e M. 

(ii) As we saw in the proof of Proposition 9.95, /\ P (M) — T p (M)/J p , where J p = 
J fl T P (M) consists of all elements of degree p in the ideal J generated by all elements 
in T 2 (M ) of the form m <8> m. In more detail, J p consists of all sums of homogeneous 
elements a ® m ® m <g> ft, where m e M, a e T q (M), f e T r (M), and q + r + 2 = p; 
it follows that m i A • • • A m p = 0 if there are two equal adjacent factors, say, m, — 

Since multiplication in / \(M ) is associative, however, we can (anti)commute a factor m; 
of mi A • • • A m p several steps away at the possible cost of a change in sign, and so we can 
force any pair of factors to be adjacent. • 

One of our goals is to give a “basis-free” construction of determinants, and the idea is 
to focus on some properties that such a function has. If we regard an n x n matrix A as 
consisting of its n columns, then its determinant, det(A), is a function of n variables (each 
ranging over u-tuples). One property of determinants is that det(A) = 0 if two columns 
of A are equal, and another property is that it is multilinear. It will be seen that these 
properties almost characterize the determinant. 

Definition. If M and N are ^-modules, a ^-multilinear function /: x p M N (where 
x. p M is the cartesian product of M with itself p times) is alternating if 

f(mi,...,m p ) = 0 
whenever m; = mj for some i j. 

An alternating R-bilinear function arises naturally when considering (signed) areas in 
the plane R 2 . If Vi, V 2 e M 2 , define A( tq, V 2 ) to be the area of the parallelogram having 
sides v\ and V 2 ■ It is clear that 


A(r i>i, sv 2 ) — rsA(vi, V 2 ) 
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for all r,sel (but we must say what this means when these numbers are negative), and a 
geometric argument can be given to show that 

A(w i + vi, V 2 ) — A(w i, V 2 ) + A(vi, U 2 ); 

that is, A is R-bilinear. Now A is alternating, for A(v\, iq) = 0 because the degenerate 
“parallelogram” having sides v\ and tq has zero area. A similar argument shows that 
volume is an alternating R-multilinear function on R , as we see in vector calculus using 
the cross product. 

Theorem 9.134. For all p > 0 and all k-modules M, the pth exterior power f\^ (M) 
solves the universal mapping problem posed by alternating multilinear functions. 

x p M --->■ A P (M) 

y f 

N 

If h : x p M —> A P (M) is defined by h(m i, ..., m p ) = m i A • • • A m p , then for every 
alternating multilinear function f, there exists a unique k-homomorphism f making the 
diagram commute. 

Proof Consider the diagram 

x p M -->- A P (M) 



where h ’(mi. m p ) — m\ ® • • • <g> m p and v(mi ® ® m p ) = m\ A • • • A m p . Since 

/ is multilinear, there is a £-map f : T P (M) —>• N with f'h' = /; since / is alternating, 
J fl T P (M) C ker f', and so /' induces a map 

/: T p (M)/(J n T p (M)) -> N 


with fv = /'. Hence, 

fh = fvh' = f'h' = f. 

But T P (M)/(J fl T P {M )) = A P (M), as desired. Finally, / is the unique such map 
because im/z generates /\ P (M). • 
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Proposition 9.135. For each p > 0, the pth exterior power is a functor 

/\ P : * Mod —> k Mod. 

Proof. Now /\ P (M) has been defined on modules; it remains to define it on morphisms. 
Suppose that g: M —> M' is a ^-homomorphism. Consider the diagram 



— A p m, 

A p (g) 


where /(mi,..., m p ) = gm\ A • • • A gm p . It is easy to see that / is an alternating 
multilinear function, and so universality yields a unique map 

f\ P {g): /\”(M) —> f\ P {M') 
with mi A • • • A m p i—>• gm\ A • • • A gm p . 

If g is the identity map on a module M. then /\ p (g) is also the identity map, for it fixes 
a set of generators. Finally, suppose that g' : M' —> M" is a k- map. It is routine to check 
that both /\ p (g'g) and f\ p (g r ) f\ p (g ) make the following diagram commute 


x p M 



A P (M ") 


- A P (M), 


where F{m\, ..., m p ) = g'gm\ A • • • A g'gm p . Uniqueness of such a dashed arrow gives 
A P (g'g) = A P is') A P te)’ as desired. • 

We will soon see that f\ P i s not as n i ce as Horn or tensor, for it is not an additive functor. 

Theorem 9.136 (Anticommutativity). If M is a k-module, x e ff’(M ), and y e 
A 9 (37), then 

x A y = (-1 ) pq y A x. 

Remark. This identity holds only for products of homogeneous elements. ■* 

Proof. If x e A °(M) = k, then A (37) being a ^-algebra implies x A y = y A x for all 
y e A(37), and so the identity holds, in particular, for y e A 9 (37) for any q. A similar 
argument holds if y is homogeneous of degree 0. Therefore, we may assume that p,q > 1; 
we do a double induction. 
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Base Step: p — 1 and q = 1. Suppose that x, y e /\* (M) = M. Now 

0 = (x + y) A (x + y) 

=xAx+xAy+yAx+yAy 
= x Ay + y Ax. 

It follows that x Ay — —y A x, as desired. 

Inductive step: (p, 1) =>■ (p + 1, 1). The inductive hypothesis gives 

(x i A • • • A x p ) Ay — (— \) p y A (xi A • • • A x p ). 

Using associativity, we have 

(xi A • • • A X p+ 1) A y = XI A [(X2 A • • • A X p+ \) A y] 

— X\ A (-l) P [y A (x 2 A • • • A Xp+i)] 

= [x\ A (-l) P y] A (x 2 A • • • A X p+ 1 ) 

= (-l) P+ 1 (yAxi) A (X 2 A • • • AXp+i). 

Inductive Step: ( p , < 7 ) =>■ (/?, q + 1). Assume that 

(xi A ■ • • A x p ) A (yi A ■ ■ • A y q ) = 

(-1 ) pq (yi A •• • Ay q ) A (xi A ■■■ AXp). 

We let the reader prove, using associativity, that 

(xt A • • • A Xp) A (yi A • • • A y q+l ) = 

(-l) p ( 9+I) (yi A • • • Aj, + i) A (xi A • • • A Xp). • 

Definition. Let n be a positive integer and let 1 < P < n. An increasing p < n-list is a 
list 

H = i 1 ,..., i p 

for which 1 < i\ < i 2 < • • • < i p < n. 

If H = i'i , ..., i p is an increasing p < rz -list, we write 

eH = ei ! Ae i2 A ■■■ A e ip . 

Of course, the number of increasing p < u-lists is the same as the number of p-subsets 
of a set with n elements, namely, ("). 

Proposition 9.137. Let M be finitely generated, say, M = (e \, ..., e n ). If p > 1, then 
f\ p (M) is generated by all elements of the form en, where H — i\,... ,i p is an increasing 
p < n-list. 
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Proof. Every element of M has some expression of the form a,g,-, where a; e k. We 
prove the proposition by induction on p > 1. Let mi A ■ • • A m p +\ be a typical generator 
of /\ P+1 (M). By induction, 

m i A • • • A trip — 

H 

where cih e k and // is an increasing p < n-list. If m p+ \ = bje j, then 

m t A • • • A mp +1 = (^2a H e H ) A (^2 b j e j)- 
h j 

Each gj in ^ can be moved to any position in en = g (1 A • • • A g,- (with a possible 
change in sign) by (anti)commuting it from right to left. Of course, if g j — ei, for any 
l, then this term is 0, and so we can assume that all the factors in surviving wedges are 
distinct and are arranged with indices in ascending order. • 

Corollary 9.138. If M can be generated by n elements, then f\ p (M) = {0} for all p > n. 

Proof. Any wedge of p factors must be 0, for it must contain a repetition of one of the 
generators. • 

Definition. If V is a free A-module of rank n, then a Grassmann algebra on V is a 
A'-algebra G(V) with identity element, denoted by go, such that 

(a) G(V ) contains (go) © V as a submodule, where (go) = k\ 

(b) G(V) is generated, as a k-algebra, by (go) © V ; 

(c) v 2 = 0 for all v e V; 

(d) G(V ) is a free A'-module of rank 2". 


The computation on page 741 shows that the condition v 2 = 0 for all v e V implies 
vu = —uv for all u, v e V. A candidate for G(V ) is /\(V) but, at this stage, it is not clear 
how to show that /\(V) is free and of the desired rank. 

Grassmann algebras carry a generalization of complex conjugation, and this fact is the 
key to proving their existence. If A is a A-algebra, then an algebra automorphism is a 
A-algebra isomorphism of A with itself. 

Theorem 9.139. Let V be a free k-module with basis e\,. .., e n , where n > 1. 

(i) There exists a Grassmann algebra G ( V ) with an algebra automorphism u (—> u, 
called conjugation, such that 

u — u\ 

eo = e 0 ; 

v — — v for all v e V. 
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(ii) The Grassmann algebra G(V ) is a graded k-algebra 

G(V) = J2 gP (V)’ 


where 


G P (V) = (en : where H is an increasing p-list) 


[we have extended the notation en = e (1 A • • • A e/ in f\ p (V) to en = eq • • • ei p in 
G p (V)]. Moreover, G p (V) is a free k-module with 


rank(G^(V)) = 


Proof, (i) The proof is by induction on n > 1. The base step is clear: If V — (ei) = k, set 
G(V) = (eo) © (<?i); note that G ( V) is a free k-module of rank 2. Define a multiplication 
on G(V) by 

eoeo = eo; <?o<?i — e\ — ejeo; e\e\ = 0. 

It is routine to check that G(V) is a k-algebra that satisfies the axioms of a Grassmann 
algebra. There is no choice in defining the automorphism; we must have 

ae o + be i = ae o — be\. 

Finally, it is easy to see that u i-^- u is the automorphism we seek. 

For the inductive step, let V be a free k-module of rank/! + 1 and let e\, ..., e„+i be 
a basis of V. If W = {e\,..., e n ), then the inductive hypothesis provides a Grassmann 
algebra G(W), free of rank 2", and an automorphism u i->- u for all u e G(W ). Define 
G( V ) = G(W) © GiW), so that GiV) is a free module of rank 2" + 2" = 2" +1 . We make 
G(V ) into a k-algebra by defining 

(x\, X2)(>'| , >'2) = (x\y\, X2fi +Xiy2). 

We now verify the four parts in the definition of Grassmann algebra. 

(a) At the moment, V is not a submodule of G(V). Each v e V has a unique expression of 
the form v — w + ae n+ \, where w e VV and a e k. The k-map V -* G(V), given by 

v — w + ae n +1 i—► (w, ae o), 

is an isomorphism of k-modules, and we identify V with its image in G(V). In particular, 
e n+ \ is identified with (0, eo)- Note that the identity element eo e G(W) in G(W) has been 
identified with (eo, 0) in G(V), and that the definition of multiplication in G (V) shows that 
(eo, 0) is the identity in G(V). 

(b) By induction, we know that the elements of (eo) © W generate G(W) as a k-algebra; 
that is, all (xi, 0) e G (W ) arise from elements of W. Next, by our identification, e„+i = 
(0, e 0 ), 

(xi,0)e„ + i = (xi, 0)(0, e 0 ) = (0,xi), 
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and so the elements of V generate all pairs of the form (0, xf). Since addition is coordi- 
natewise, all (x\ , * 2 ) = (xi, 0) + (0, xf) arise from V using algebra operations. 

(c) If v e V, then v = w + ae n + 1 , where w e IT, and v is identified with ( w, ae 0 ) in 
G(V). Hence, 

2 2 _ 

v = (w, aef)(w, ae 0 ) = (w , aeow + aeow). 

Now w 2 — 0, and w = —w, so that v 2 = 0. 

(d) rankG(V) = 2" +1 because G(V ) = G(W) © G(W). 

We have shown that GiV) is a Grassmann algebra. Finally, define conjugation by 


(Xl,X 2 ) = O'l, -X 2 ). 

The reader may check that this defines a function with the desired properties. 

(ii) We prove, by induction on n > 1, that G p (V) = (e h : where H is an increasing /7-list) 
is a free A:-module with the displayed products as a basis. The base step is obvious: If 
rank(V) = 1, say, with basis e\, then GiV) — (eo, e\)\ moreover, both G°(V) and G l (V) 
are free of rank 1. 

For the inductive step, assume that V is free with basis ei, ..., e n+ \. As in the proof 
of part (i), let W = (e\ ..., e n ). By induction, G P (W ) is a free ^-module of rank ('') 
with basis all en, where H is an increasing p < n-list. Here are two types of ele¬ 
ments of G P (V ): elements e h € G(W ), where H is an increasing p < n-list; elements 
eu = e n +\, where H is an increasing p < (n + l)-list that involves e n +\. We 

know that the elements of the first type comprise a basis of G(W). The definition of mul¬ 
tiplication in G(V ) gives ene n +1 = (en , 0)(0, eo) = (0, en)- Thus, the number of such 
products is ( " j). As GiV) = G(W) © G(W), we see that the union of these two types of 

products form a basis for G p (V), and so ranklG^V)) = (") + ( " j) = ("p 1 )- 

It remains to prove that G p (V)G q (V) C G p+q (V). Consider ■ ■ ■ e, p ej 1 ■ ■ ■ ej late. 
If some subscript i r is the same as a subscript j s , then this product is 0 because it has 
a repeated factor; if all the subscripts are distinct, then this product lies in G p+q (V), as 
desired. Therefore, GiV) is a graded /.-algebra whose graded part of degree p is a free 
k -module of rank ("). • 


Theorem 9.140 (Binomial Theorem). If V is a free k-module of rank n, then there is 
an isomorphism of graded k-algebras, 

/\{V) = G(V). 

Thus, /\ P (V) isafreek-module,foraIl p > 1, with basis all increasing p < n-lists, hence 

rank(/\V)) = ('')• 
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Proof. For any p > 2, consider the diagram 

x p V --- f\ p (V), 

Sp 

G P (V) 

where g p ( tq ,..., v p ) — tq ■ ■ ■ v p . Since v 2 — 0 in G P (V ) for all v e V, the function g p 
is alternating multilinear. By the universal property of exterior power, there is a (unique) 
k-homomorphism 'gp: f\ p (V) —>• G P (V) making the diagram commute; that is, 

gp(ft A • • • A v p ) = i>i • • • v p . 

If e \,..., e n is a basis of V, then we have just seen that G P (V) is a free k-module with 
basis all eq • • • e ip , and so 'gp is surjective. But /\ P (V) is generated by all e h A ■■■ A 
e, . by Proposition 9.137. If some /.'-linear combination Zh a h(’h lies in kerg p , then 
Z a H^p( e n) = 0 in G P (V). But the list of images 'g p {eu) forms a basis of the free k- 
module G P {V), so that all the coefficients an — 0. Therefore, ker 'gp = {0}, and so g p is 
a k-isomorphism. 

Define y: /\(V) -> G(V) by y(Z p =o“p) = Z" P =o S P (u P ), so that y(A P (V)) c 
G P (V). We are done if we can show that y is an algebra map: y(u A v) — y(u)y(v). But 
this is clear for homogeneous elements of f\(V), and hence it is true for all elements. • 


Corollary 9.141. If V is a free k-module with basis e \, e n , then 

[\ (V) = (e\ A ■■ ■ Ae n ) = k. 

Proof. By Proposition 9.137, we know that /\ n (V) is a cyclic module generated by 
e\ A ••• A e n , but we cannot conclude from this proposition whether or not this element 
is zero. However, the binomial theorem not only says that this element is nonzero; it also 
says that it generates a cyclic module isomorphic to k. • 

Proposition 7.43 says that if T ; / Mod -> /. Mod is an additive functor, then T(V © V) 
= T (V) © T (V'). It follows, for p > 2, that /\ p is not an additive functor: if V is a free 
k-module of rank n, then /\ P (V © V) is free of rank ( 2 "), whereas /\ P (V) © f\ p (V ) is 
free of rank 2(''). 

An astute reader will have noticed that our construction of a Grassmann algebra GiV ) 
depends not only on the free k-module V but also on a choice of basis of V. Had we chosen 
a second basis of V, would the second Grassmann algebra be isomorphic to the first one? 

Corollary 9.142. Let V be a free k-module, and let B and B’ be bases of V. If G(V) 
is the Grassmann algebra defined using B and ifG'( V) is the Grassmann algebra defined 
using B’, then G(V) = G'(V ) as graded k-algebras. 
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Proof. Both G(V ) and G'(V) are isomorphic to f\(V), and the latter has been defined 
without any choice of basis. • 

A second proof of the binomial theorem follows from the next result. 

Theorem 9.143. For all p > 0 and all k-modules A and B, where k is a commutative 
ring , 

/\ P (A ®B) = J2 {A (A) ®k A P ~\b))- 

i =0 

Sketch of Proof. Let A be the category of all alternating anticommutative graded k- 
algebras R — Yl p >o R p ( r 2 — 0 f° r all r e R of odd degree and rs — (—1 ) pq sr if r e R p 
and .s' e S q ); by Theorem 9.136, the exterior algebra /\(A) e obj(_4) for every ^-module 
A. If R, S e obj(„4), then one verifies that 7?®* S — J2 p>0 (X!f=o ® k S p ~‘) e obj(.4); 
using anticommutativity, a modest generalization of Proposition 9.101 shows that A has 
coproducts. 

We claim that (/\, D) is an adjoint pair of functors, where f \: / Mod —> A sends A 
/\(A), and D : A -> j-Mod sends f2 p> Q R p \-+ R l , the terms of degree 1. If R = ff p R p , 
then there is a map jtr : /\(R [ ) —> R\ define r a,r : Hom_ 4 (/\(A), R) —> Hom^-(A, R l ) 
by <p i-a- tcr((p\A). It follows from Theorem 7.105 that /\ preserves coproducts; that is, 

A(A © B) = A (A) ®k A (B), and so /\ P (A ® B) = £f =0 (A f (A) ®* A • 

Here is an explicit formula for an isomorphism. In /\ 3 (A ® B), we have 

(a[ + b i) A («2 + ^ 2 ) A (03 + bf) — a\ A 02 A <33 + a\ A £>2 A ^3 

+ Z ?1 A fl 2 A fl 3 + fci A a fl 3 + fl! A fl 2 A 1?3 

+ a 1 A b 2 A £>3 + b[ A 02 A £>3 + b\ A £>2 A bj. 

By anticommutativity, this can be rewritten so that each a precedes all the /As: 

(ai + b 1 ) A («2 + bf) A (03 + bfl — ai A 02 A <33 — flj A 03 A £>2 

+ fl 2 A 03 A fo] + 03 A l?i A 62 + At A fl 2 A i >3 

+ flj A £>2 A £>3 — 02 A A £>3 + A £>2 A £> 3 . 

An i-shuffle is a partition of {1,2, ..., 72 } into two disjoint subsets /xi < ... < /Xj and 
u| < ... < v p _j\ it gives the permutation o e S p with cr(j) — ptj for j < i and 
o(i + l) — V( for j — i + l > i. Each “mixed” term in ( a\ + b 1 ) A (02 + bf) A (03 + bs) 
gives a shuffle, with the a’s giving the /x and the b’s giving the v; for example, a\ A £>2 A <33 
is a 2-shuffle and b\Aa 2 Ab 3 is a 1-shuffle. Now sgn(a) counts the total number of leftward 
moves of a’s so that they precede all the b’s, and the reader may check that the signs in the 
rewritten expansion are sgn(er). Define /: f\ p (A ® B) —> ^/\‘(A) 0^ /\ p ~'(B)^j 

by 

f(ai + b \,..., fl p + b p ) = Y. I Y sgn(er )a IM A ■ ■ ■ A a w 0 b n A • • • A b Vp _ t J . 

i =0 \i -shuffles o ! 
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Corollary 9.144. If k is a commutative ring and V is a free k-module of rank n, then 
/\ P (V) is a free k-module of rank (”). 

Proof Write V — k © B and use induction on rank(V). • 

We will use exterior algebra in the next section to prove theorems about determinants, 
but let us first note a nice result when k is a field and, hence, ^-modules are vector spaces. 

Proposition 9.145. Let k be a field, let V be a vector space over k, and let i>i,.. .,v p 
be vectors in V. Then Vi A • • • A v p — 0 in f\(V) if and only if v\, ... ,v p is a linearly 
dependent list. 

Proof. Since k is a field, a linearly independent list v\, ,..,v p can be extended to a basis 
v \,..., v p ,..., v n of V. By Corollary 9.141, t>i A • • • A v„ 0. But v\ A • • • A v p is a 
factor of i>i A • • • A v n , so that i>i A • • • A v p ^ 0. 

Conversely, if ui,..., v p is linearly dependent, there is some i with = Ej^ajvj, 
where aj e k. Hence, 

t>l A • • ■ A Vi A • • • A Vp = 1>1 A • • • A aj Vj A ■ • • A V p 

j+i 

— aj VI A • • • A Vj A • • • A Vp. 
i+i 

After expanding, each term has a repeated factor vj, and so this is 0. • 

We introduced exterior algebra, at the beginning of this section, by looking at Jacobians; 
we now end this section by applying exterior algebra to differential forms. Let X be an 
open connected 21 subset of euclidean space IR”. A function /: X -» R is called a C°°- 
function if, for all p > 1, the pth partials d p f/d p xj exist for all i = 1 ,...,«, as do all the 
mixed partials. 

Definition. If A is a connected open subset of K", define 

A (A) = {/:A->K:/isa C°°-function}. 

The condition that A be a connected open subset of R" is present so that C°°-functions 
are defined. It is easy to see that A (A) is a commutative ring under pointwise operations: 

/ + g: x h* f(x) + g(x ); fg: x h* f(x)g(x). 

In the free A(A)-module A(A)" of all n-tuples, rename the standard basis 

dx i,..., dx„. 

21 A subset X is open if. for each x e X, there is some r > 0 so that all points v with distance \y — x\ < r also 
lie in X. An open subset X of R n is connected if we can join any pair of points in A by a path lying wholly in X. 
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By the binomial theorem, each element co e /\ p (A(X) n ) has a unique expression 

°>= ^2 fh -i P dx h A * * * A dx ip , 
ii-i P 

where / (| j p e A(X) is a C°°-function on X and i\ ... i p is an increasing p < «-list. We 
write 

QP(X) = /\ P (A(X) n ), 

and we call its elements differential p-forms on X. 

Definition. The exterior derivative d p : Q P (X ) —> C /,+ l (X) is defined as follows: 

(i) If / e S2°(X) = A(X), then d°f = £" =l ^jdxy, 

(ii) If p > 1 and co e £l p {X), then co = ,• fi l ...i p dxi l A • • • A dxi p , and we define 

d p co = ^2 d°{fi i .j p ) A dxi t A • • • A dxi p . 

i\ ...ip 


If X is an open connected subset of R", the exterior derivatives give a sequence of 
A(X)-maps, called the de Rham complex: 

0 £2°(X) £2‘(X) 4 • • • -» S2"(X) 0. 

Proposition 9.146. IfX is a connected open subset ofW, then 

d p+l d p : Q P (X) Q. p+2 (X ) 


is the zero map for all p > 0. 

Proof. It suffices to prove that ddco — 0, where co = fdxi (we are using an earlier 
abbreviation: dxj — afxy, A • • • A dxi p . where / = i\, ..., i p is an increasing p < rz-list). 
Now 


ddco — d(d { ’f A x/) 


— d ( T dxi A dxj | 
YT dxi I 


-EE 


‘ j 


9 2 / 

dxj dxj 


dxj A dxj A dxj. 


Compare the j and j, i terms in this double sum: The first is 

3 2 / 

- dxj A dxi A dxp. 


3 Xi 3 xj 
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the second is 

3 2 / 

- dxj A dxj A dxi. 

dXjdXi 1 

But these cancel, for the mixed second partials are equal: 


dxi A dxj = —dxj A dxj. • 


Example 9.147. 

Consider the special case of the de Rham complex for n — 3. 


£2°(X) 


£2 J (X) 


Q 2 (X) 


Q 3 (X) 


If co e £2 0 (X), then co = f(x, y, z ) e A(X), and 

d°f = —dx + —dy + — dz, 

J dx dy 7 dz 

a 1-form resembling grad(/). 

If co e (X), then co — fdx + gdy + hdz, and a simple calculation gives d l co = 


dg 

dx 



dx A dy + 




dy Adz + 




dz A dx, 


a 2-form resembling curl(oj). 

If cu e £2 2 (20, then co = Fdy A dz + Gdz A dx + Hdx A dy. Now 

3 F 3 G 3 H 

d co — — 4- — -(--, 

dx dy dz 

a 3-form resembling div(&>). 

These are not mere resemblances. Since ^(X) is a free A(X)-module with basis 
dx, dy, dz, we see that d°co is grad(&>) when co is a 0-form. Now f2 2 (X) is a free A(X)- 
module, but we now choose a basis 


dx A dy, dy A dz, dz A dx 

instead of the usual basis dx A dy, dx A dz, dy A dz', it follows that d l co is curl(&>) in this 
case. Finally, f2 3 (X) has a basis dx A dy A dz, and so d 3 co is div(&>) when co is a 2-form. 
We have shown that the de Rham complex is 

0 £2°(X) Q\ X ) fi 2 (X) ^ 3 (X) 0. 

Proposition 9.146 now gives the familiar identities from advanced calculus: 

curl • grad = 0 and div • curl = 0. 
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We call a 1-form co closed if dco = 0, and we call it exact if co = grad/ for some 
C°°-function /. More generally, call a p -form co closed if d p co — 0, and call it exact 
if co = d p ~ l co' for some (p — l)-form co'. Thus, co e £2 P (X) is closed if and only if 
co e ker d p and co is exact if and only if a> e im d p ~ l . Therefore, the de Rham complex 
is an exact sequence of A(X )-modules if and only if every closed form is exact; this is 
the etymology of the adjective exact in “exact sequence.” It can be proved that the de 
Rham complex is an exact sequence whenever X is a simply connected open subset of 
R". For any (not necessarily simply connected) space X, we have imgrad c ker curl and 
imcurl C kerdiv, and the R-vector spaces ker curl/ imgrad and kerdiv/ imcurl are called 
the cohomology groups of X. ◄ 


Exercises 

9.85 Let G(V) be the Grassmann algebra of a free fc-module V, and let u — Yl p 11 p e G(V), 
where u p G G P (V) is homogeneous of degree p. If 17 is the conjugate of u in Theorem 9.139, 
prove that u — Ylpi~^) Pu p- 

9.86 (i) Let p be a prime. Show that /\~(I p © Ip) ^ 0, where I p © Ip is viewed as a Z-module 

(i.e., as an abelian group). 

(ii) Let D = Q/Z © Q/Z. Prove that /\ 2 (D) = 0, and conclude that if i: Ip © Ip —*■ D is 
an inclusion, then /\ 2 (i) is not an injection. 

9.87 (i) If k is a commutative ring and IV is a direct summand of a k-module M, prove that 

/\ P (N) is a direct summand of / \ P (M ) for all p > 0. 

Hint. Use Corollary 7.17 on page 434. 

(ii) If k is a field and i : W —>■ V is an injection of vector spaces over k, prove that /\ p (i) is 
an injection for all p > 0. 

9.88 Prove, for all p, that the functor /\ p preserves surjections. 

9.89 If P is a projective k-module, where k is a commutative ring, prove that /\ ? (P) is a projective 
k-module for all q. 

9.90 Let k be a field, and let V be a vector space over k. Prove that two linearly independent lists 
h 1 , ... ,u p and v\,... ,v p span the same subspace of V if and only if there is a nonzero c G k 
with a ■ ■ ■ A Up = ciq A ■ ■ ■ A v p . 

9.91 If U and V are R-modules over a commutative ring R and if U' C U and V' C V are 
submodules, prove that 

(U ®r V)/(U' ® R V + U® R V ') = (U/U') ®rV®U®r (V/V). 

Hint. Compute the kernel and image of (p: U ®r V —> ( U/U') ®r V © U ®r (V/V 1 ) 
defined by <p: u ® v i->- (u + U') ® v + u ® (v + V'). 

9.92 Define the symmetric algebra on a k-module M to be Si M) = T{M)/1, where I is the 
two-sided ideal generated by all m ® m' — m! ® m, where m, m' G M. 

(i) Prove that / is a graded ideal, so that S(M) is a graded k-algebra. 

(ii) Prove that S{M) is commutative. 
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9.93 


(i) Define a free commutative k-algebra , and prove that if M is the free k-module with 
basis X, then S(M) is the free commutative k-algebra on X. Conclude that S(M ) is 
independent of the choice of basis of the free k-module M. 

(ii) Define k[X\ to be the polynomial ring in commuting variables X if every u e k[X\ has a 
unique expression as a polynomial in finitely many elements of X. Prove that if M is the 
free k-module with basis X, then S(M) is the polynomial ring in commuting variables 
X 22 

(iii) Prove that if M is a free k-module of finite rank n, then S P (M) is a free k-module of 

rank C^Y 

Hint. Use the combinatorial fact that there are (” + p _1 ) ways to distribute p identical 
objects among n boxes. 

(iv) Prove that every commutative k-algebra is a quotient of a free commutative k-algebra. 


9.94 Let V be a finite-dimensional vector space over a field k. and let q : V -*■ k be a quadratic 
form on V. Define the Clifford algebra C(V,q) as the quotient C(V,q) — T(V)/J, where 
J is the two-sided ideal generated by all elements of the form v ® v — q(v)l (note that J is 
not a graded ideal). For v € V, denote the coset v + J by [it], and define h: V —>■ C(V,q ) by 
h(v) = [i>], Prove that C(V, q ) is a solution to the following universal problem: 


V ->C(V, ? ), 



A 


where A is a k-algebra and /: V —>■ A is a k-module map with f{v)~ = q(v) for all v e V. 

If dim(F) = n and q is nondegenerate, then it can be proved that dimfCfV, q)) = 2". In 
particular, if k = R and n — 2, then the Clifford algebra has dimension 4 and C(v,q) = H, 
the division ring of quaternions. Clifford algebras are used in the study of quadratic forms, 
hence of orthogonal groups; see lacobson, Basic Algebra II, pp. 228-245. 


9.9 Determinants 


We have been using familiar properties of determinants, even though the reader may have 
seen their verifications only over a field and not over a general commutative ring. Since 
determinants of matrices whose values lie in a commutative ring are of interest, the time 
has come to establish these properties in general, for exterior algebra is now available to 
help us. 

If k is a commutative ring, we claim that every k-module map y: k —> k is just multi¬ 
plication by some d e k: If y( 1) = d, then 

y(a ) = y(a 1 ) = ay(l) — ad — da 

“In the fourth section of Chapter 6, we assumed the existence of this big polynomial ring in order to construct 
the algebraic closure of a field. 

Our earlier definition of k[x, >’J as R\ y ], where R = k[A'], was careless. For example, it does not follow that 
k[x , y] = k[y, x], although these two rings are isomorphic. However, if M is the free k-module with basis x , y, 
then y, x is also a basis of k-algebra M, and so k[x, y] = k[y, x]. 
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for all a e k, because y is a A:-module map. Here is a slight generalization. If V — (v) = k, 
then every A: -map y : V —> V has the form y : av i-a- dav, where y(v) — dv. Suppose now 
that V is a free A:-module with basis e\, ..., e n \ Corollary 9.141 shows that /\"(V) is free 
of rank 1 with generator e\ A • • • A e n . It follows that every Ar-map y : f\"(V) —»• /\" (V) 
has the form y(a(e\ A • • • A e n )) — d(a(e\ A • • • A e n )). 

Definition. If V is a free A-module with basis e\ , .... e„. and if /: V —> V is a k- 
homomorphism, then the determinant of /, denoted by det(/), is the element det(/) e k 
for which 

f\ (/): e\ A • • • A e n f(e i) A • • • A f(e n ) 

= det(/)(ei A • • ■Ac,). 

If A = [a !; ] is an n x n matrix with entries in k , then A defines a A;-map /: A:" —>• k n 
by f(x) — Ax, where x e k n is a column vector. If e\, ..., e n is the standard basis of 
k n , then /(e,) = aj/ej, and the matrix A = [a,j] associated to / has /th column the 
coordinates of /(e;) = Ae We define det(A) = det(/): 

Aei A • • • A Ae„ = det(A)(ei A • • • A e„). 

Thus, the wedge of the columns of A in f\" ( k n ) is a constant multiple of e\ A • • • A e n , and 
det(A) is that constant. 

Example 9.148. 

If 



then the wedge product of the columns of A is 

(ae i + A><?2) A (ce i + c/e2) = acej A e\ + ade i A e2 + bce 2 Ae i + A>c/e2 A e2 

= ade i A g2 + ^ce2 A ei 
= ac/ei A e2 — bee i A e2 
= (ac/ — A>c)(ei A ej). 

Therefore, det(A) = ad — be. ■* 

The reader has probably noticed that this calculation is a repetition of the calculation on 

page 742 where we computed the Jacobian of a change of variables in a double integral. 

The next example considers triple integrals. 

Example 9.149. 

Let us change variables in a triple integral fff D f(x, y, z)dxdydz using equations: 

x = F(u , v, w ); 
y — G(u, v , w)\ 

Z — H(u, v, w). 
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Denote a basis of the tangent space T of f{x, y, z) at a point P — (xo, yo> zo) by dx, dy, 
dz. T \ f du, dv, dw is another basis of T, then the chain rule defines a linear transformation 
on T by the equations: 


dx — F u du + F v dv + F w dw 
dy — G„du + G v dv + G w dw 
dz — H u du + H v dv + H w dw. 


If we write the differential dxdydz in the integrand as dx A dy A dz, then the change of 
variables gives the new differential 


dx A dy A dz = det 


F u 

F v 

F w 

G u 

G v 

G w 

H u 

H v 

H w . 


du Adv A dw: 


expand 

(. F„du + F v dv + F w dw) A ( G u du + G v dv + G w dw ) A ( H u du + H v dv + H w dw ) 

to obtain nine terms, three of which involve (du) 2 , (dv) 2 , or (dw) 2 , and hence are 0. Of 
the remaining six terms, three have a minus sign, and it is now easy to see that this sum is 
the determinant. ■* 

Proposition 9.150. Let k be a commutative ring. 

(i) If I is the identity matrix, then det(7) = 1. 

(ii) If A and B are n x n matrices with entries in k, then 

det(Afi) = det(A) det(B). 

Proof Both results follow from f\" being a functor on * Mod. 

(i) The linear transformation corresponding to the identity matrix is 1^, and every functor 
takes identities to identities. 

(ii) If / and g are the linear transformations on k" arising from A and B, respectively, then 
fg is the linear transformation arising from AB. If we denote e\ A • • ■ A e n by e,v, then 

det (fg)e N = /\ ( fg)(eN ) 

= ig){e N )) 

= /\"(/)(det(g)ejv) 

= det (g)f\(f)(e N ) 

= det(f)det(g)e N : 



Sec. 9.9 Determinants 


759 


the next to last equation uses the fact that /\" (f) is a A'-map. Therefore, 
det(Afi) = det (fg) — det(/) det(g) = det(A) det(B). • 

Corollary 9.151. If k is a commutative ring, then det: Mat„(A) -» k is the unique 
alternating multilinear function with det(7) = 1. 

Proof. The definition of determinant (as the wedge of the columns) shows that it is an 
alternating multilinear function det: x" V —> k, where V — k n , and the proposition 
shows that det(7) = 1. The uniqueness of such a function follows from the universal 
property of f\". 

x"V--- /\ n (V) 



k 


If det' is multilinear, then there exists a A-map /: /\"(V) k with fh = det'; if 
det'(ei, ...,e n ) = 1, then f(e i A ••• A e„) = 1. Since f\ n (V) = k, every A-map 
/: /\"(V) -> A is determined by f(e\ A • • • A e„). Thus, the map / is the same for 
det' as it is for det, and so det' = fh — det. • 

We now show that the determinant just defined coincides with the familiar determinant 
function. 

Lemma 9.152. Let e\, ..., e n be a basis of a free k-module, where A is a commutative 
ring. If a is a permutation of 1,2, ... ,n, then 

e< 7 (i) A • • • A e a(n) — sgn(cr)(ei A • • • A e n ). 

Proof. Since m A m' = —m' A m, it follows that interchanging adjacent factors in the 
product e\ A • • • A e„ gives 

e\ A • • • A ei A e,' + i A • • • A e n = — e\ A • • • A e,+i A e,- A • • • A e n . 

More generally, if i < j, then we can interchange e, and ej by a sequence of interchanges 
of adjacent factors, each of which causes a sign change. By Exercise 2.7 on page 50, this 
can be accomplished with an odd number of interchanges of adjacent factors. Hence, for 
any transposition r e S n , we have 

e T (t) A • • • A e T ( n ) = e\ A • • • A ej A • • • A e,- A • • • A e„ 

— — [e\ A • • • A g; A • • • A ej A • • • A e n \ 

— sgn(r)(ei A • • • Ae„). 

We prove the general statement by induction on m, where a is a product of m transpo¬ 
sitions. The base step having just been proven, we proceed to the inductive step. Write 
o — X\X 2 ■ ■ ■ r m +i, and denote t 2 • • • x m+ \ by o'. By the inductive hypothesis. 


<V(t) A • • • A e a ’ {n) = sgn(er')et A • ■ ■ A e n . 
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and so 


Lr(l) A • • • A e a (n) — A • • • A e TlC r'( n ) 

= -e<j'(i) A • • • A e a f (n) (base step) 

= — sgn(cr , )(ei A • • • A e„) (inductive step) 
= sgn(n) sgn(ff , )(ei A • • • A e„) 

— sgn(cr)(ei A • • • Ae n ). • 


Remark. There is a simpler proof of this lemma in the special case when A: is a field. 
If k has characteristic 2, then Lemma 9.152 is obviously true, and so we may assume that 
characteristic k is not 2. Let e\,... ,e n be the standard basis of k". If cr <= S „, define a 
linear transformation rp a \ k" —> k" by <p a : e ; - i-> e a ^y Since (p aT = tp a (p r , as is easily 
verified, there is a group homomorphism d : S n —> k x given by d: a i-> det(<^o-). If a is a 
transposition, then cr 2 = (1) and d(a) 2 = 1 in k x . Since A: is a field, d(a) — ±1. As every 
permutation is a product of transpositions, it follows that d(a) — ±1 for every permutation 
a , and so im (d) < {±1}. Now there are only two homomorphisms S n —> {±1}: the 
trivial homomorphism with kernel S„ and sgn. To show that d = sgn, it suffices to show 
d(( 1 2)) jL 1. But d((l 2)) = det(^ ( i 2 )); that is. 


det(^ a 2))(ei A • • • Ae„) = <p ( i2)(et) A • • • A ^ ( i 2 )(e«) 

= e2 e\ A e?, A ■ ■ ■ A e„ 

= -(ei A • • • A e„). 


Therefore, d({ 1 2)) = —1^1, because k does not have characteristic 2, and so, for all 
cr e S n , d(a) = detl^o-) = sgn(a); that is, e a (\) A ■ ■ -Ae a („) — sgn(cr)(e\ A ■ ■ ■ Ae n ). ◄ 

Proposition 9.153 (Complete Expansion). Let e\,... ,e n bea basis of a free k-module, 
where k is a commutative ring. If A = [ajj] is an n x n matrix with entries in k, then 

det(A) = ^ ' sgn(cr)a <T (i) ! ia (T (2),2 • • • a<j(n),n- 

(7ES n 

Proof. Expand the wedge of the columns of A: 

a JA e h A J2 a hZ e h A • • • A a i" ne in 

h h in 

— a jA e ji A a j2^ e h A ‘ ‘ ‘ A a j n n e jn ■ 

jl.jl.~-Jn 

Any summand in which ej p — ej q must be 0, for it has a repeated factor, and so we may 
assume, in any surviving term, that j\ , ji. ..., j„ are all distinct; that is, there is some 
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permutation er e S n with j r — o(r) when 1 < r < n. The original product now has the 
form 

^ ' Cl a (\_)\a a (2)2 * * * tZcr(/i)/i^cr(l) A Ccf{2) A • • • A e 0 -(n). 
o&S n 

By the lemma, e a (i) A e a {2) A • • • A e a ( n ) = sgn(cr)(ei A • • • A e n ). Therefore, the wedge 
of the columns is equal to ',aeS„ s §, n ( a ) a (r(i)i a a{2)2 • • • a a ( n )n (<?i A • • • A e„), and this 
completes the proof. • 

Quite often, the complete expansion is taken as the definition of the determinant. 

Corollary 9.154. Let k be a commutative ring, and let A be an n x n matrix with entries 
in k. If u € k, then det (ul — A) = f(u), where fix) e k[x] is a monic polynomial of 
degree n. Moreover, the coefficient ofx n ~ l in fix) is — tr(A). 

Proof. Let A — [a/ 7 ] and let B — [ bij ], where bjj — uSjj—aij (where Sjj is the Kronecker 
delta). By the proposition, 

det(B) = ^ s & n(a ) b a0),tb< 7 (2),2 ■ ■ ■ ba(n),n- 
(JES n 

If a = (1), then the corresponding term in the complete expansion is 

b n b 2 2 ■ ■ -bnn = fl(M ~ «('(■) = 

i 

where g(x) — n, Q — a ii ) is a monic polynomial in k[x] of degree n. If a f (1), then the 
(7th term in the complete expansion cannot have exactly n — 1 factors from the diagonal 
of u I — A, for if a fixes n — 1 indices, then a = (1). Therefore, the sum of the terms 
over all a f (1) is either 0 or a polynomial in k[x] of degree at most n — 2. It follows that 
deg(/) = n and that the coefficient of x” -1 is — an — — tr(A). • 

Proposition 9.155. If A is an n x n matrix with entries in a commutative ring k, then 

det(A') = det(A), 

where A’ is the transpose of A. 

Proof. If A = [cijj], write the complete expansion of det(A) more compactly: 

det(A) = J2 sgn((7) ]~[ a a (i),i- 

(7 (zSp i 

For any permutation r e S n , we have i = x{j) for all /, and so 

[ a cr(r(j)),T(j)i 

i j 
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for this merely rearranges the factors in the product. Choosing x = o 1 gives 

W a (T(T(j)),T(j) = W a j,o~ l (j)- 
j J 

Therefore, 

det(A) = £ sgn^n^a-'O)- 
cs S n j 

Now sgn(cr) = sgn(a -1 ) [if a — x\ • • • x q , where the r are transpositions, then ct -1 = 
x q • ■ ■ ti]; moreover, as er varies over S„ , so does a~ l . Hence, writing a~ l — p gives 

det(A) = E sgnCp) ]~"[ a j,p(j)- 

peS„ j 

Now write A' = [bij], where bjj — ajj. Then 

det(A') = E sgn(p) ]~[ b p (j)j = sgn(p)n«T,0) = det(A). • 

peS„ j peS„ j 

We have already seen that the eigenvalues a\ . a n of an n x n matrix A, with entries 

in a field k, are the roots of the characteristic polynomial 

\//a(x) — det(v/ — A) e k[x]. 

We have seen that det(A) = ]”[, a i> anc l we are now going to see that tr(A) = a, . 
Proposition 9.156. If A — [a ( / ] is an n x n matrix with entries in a field k, then 


tr(A) = o'! + a 2 H-ha;,,. 

Proof In the complete expansion of det(x/ — A), the diagonal corresponds to the term 
(x — ao-(i),i)(x — a ( j( 2 ), 2 ) ■ • ■ (x — a a ( n ) M ) with a the identity permutation. If o ^ 1, then 
there are at least two terms off the diagonal, and so the degree of this term is at most n — 2. 
Therefore, the coefficient b n -\ of x"~ l in the diagonal term, namely, — JT an, coincides 
with the coefficient of x n ~ l in i/taW, namely, — JT a,- = — tr(A), where a\,... ,a n are 
the eigenvalues of A. On the other hand, 

iAaCO = ]~[(x - oti), 

i 

and so the coefficient of x" -1 is — a,-, as desired. • 

We know that similar matrices have the same determinant and the same trace. The next 
corollary generalizes these facts, for all the coefficients of their characteristic polynomials 
coincide. 
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Corollary 9.157. If A and B are similar n x n matrices with entries in a field k, then A 
and B have the same characteristic polynomial. 

Proof. There is a nonsingular matrix P with B — P A P~ l , and 

fnix) — det(x7 — B) 

= det(x7 - PAP _1 ) 

= det (^P(xl — A)P~ l ^j 

= det(P) det(x7 — A) det(P) -1 
= det(x7 — A) 

— ^a(x). • 


Definition. Let A be an n x n matrix with entries in a commutative ring k. If 77 = 
i i, ... ,i p and L — j[, ..., j p are increasing p < u-lists, then A ul is the p x p submatrix 
[a. sf ], where (s, t) e 77 x L. A minor of order p is the determinant of a p x p submatrix. 


For example, every entry aij is a minor of A — [a,j ] of order 1. If 


A = 


then some minors of order 2 are 


det 


an 

021 


a\\ 

012 

013 

021 

022 

023 

031 

032 

033 


012 

and det 

012 

013 

022 


032 

033 


In particular, if 1 < i < n, let i' denote the increasing n — 1 < «-list in which i is omitted; 
thus, an (n — 1) x (n — 1) submatrix has the form Apy, and its determinant is a minor of 
order n — 1. Note that Apr is the submatrix obtained from A by deleting its ith row and 
/'th column. 


Lemma 9.158. Let k be a commutative ring, and let x,,,..., x,- p e k n be regarded as 
columns of an n x p matrix A, where H = i\, ... ,i p is an increasing p < n-list. Then 

x il A • • • A x ip — ^2 det (A L ' H )eL, 

L 

where L varies over all increasing p < n-lists. 

Proof. For t= 1,2 ,.p, write x (< = a tl i e e te , so that 

x h A • • • a x ip = J2 a nh e h a • • • a ^ a tpip e tp 

h ip 

= a tih--- a tpip e h A ’" A V 

h-tp 
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All terms involving a repeated index are 0, so that we may assume that the sum is over all 
fi ... t p having no repetitions; that is, over all p-sublists T = {t\ ... t p } of {1, 2, ..., n}. 
Collecting terms, we may rewrite the sum as 

X X a hh ■ ■ ■ atpiptti A-AC, P . 

T T=[t u ...,t p } 

For any fixed p-sublist T = {/j,..., t p }, let L — l\,lo, ..., l p be the increasing p- 
list consisting of the integers in T ; thus, there is a permutation a e S p with i a (\) = 
t\, ..., t a (p) = tp- With this notation, 

X a hh • • • a ‘pip ( e >i A • • • A e ‘p ) = X a Mi)h • • • a t<,(p)ip ( e >l A ' ' ' A 

= X s g n ( ff ) a WV" a W'j> ei 

cr^Sp 

= det (A LyH )e L - • 

Multiplication in the exterior algebra /\(V) is determined by the products e/f A ex 
of pairs of basis elements. Let us introduce the following notation: If H = t\ ... t p and 
K — t\ ... i q are disjoint increasing lists, then define th.k to be the permutation that 
rearranges the list t\ .. .t p , l \ ■ ■ -l q into an increasing list, denoted by H * K. Define 

Ph,k = sgnfTtf,*:). 

With this notation. Lemma 9.152 says that 

| 0 if H n K £ 0 

CH AeK ~ \pH,Ke H *K if HHK = 0. 


Example 9.159. 

If H = 1,3,4 and K — 2, 6 are increasing lists, then 

H*K = 1,2,3,4,6, 


and 

Therefore, 

and 


th.k 


(1 3 4 2 6\ 
V 2 3 4 6) 


(2 4 3). 


Ph.k = sgn r H K = +1, 


e H A eic = (<?i A ej, A e/s) A (e2 A e (,) = e\ A ei A e?, A A e(, — en*K- 
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Proposition 9.160. Let A = [atj] be an n x n matrix with entries in a commutative ring k. 

(i ) If I = i i, ..., ip is an increasing p-list and x;,, ..., x,- p are the corresponding 
columns of A, then denote x/, A • • • A x; by X/. If J — j\,..., j q is an increasing 
q-list, then 

x/ A xj = E Ph.k det(Atfj) det(AK.j)eH*K, 

H.K 

where H * K is the increasing (p + q)-list formed from H U K when H fi K — 0. 

(ii) Laplace 23 expansion down the jth column '. For each fixed j, 

det(A) = (—l) 1+ - / ai ; det(A 1 y) 4-b (-l) n+j a n j det(A„y), 

where Aji a is the (n — 1) x (n — 1) submatrix obtained from A by deleting its ith 
row and jth column. 

(iii) Laplace expansion across the ith row: For each fixed i, 

det(A) = det(A rr ) + • • • + (-1 ) i+n a in det(A iV ). 

Proof, (i) By the lemma, 

xj A xj = E det (A H .i)e H A ^ det (A KJ )e K 

H K 

— ^2 det (A H j)e H A det (A KJ )e K 

H.K 

= ^2 det (A h .i) det (A K j)e H A e K 

H.K 

— Ph.k det(Agj) det (Ak.j)ch*k- 
H.K 


(ii) If I — j has only one element, and if / = j' — \.... ,j, .... n is its complement, 
then 


Xj A Xj’ = Xj A X\ A • • • A Xj A • • • A X„ 

= (-l) /_1 Xl A • • • A X n 
= (— I) 7-1 det(A)ei A • • • A e n , 

because j. \.... , j..... n can be put in increasing order by j — 1 transpositions. On the 
other hand, we can evaluate xj A xy using part (i): 

xj a xj’ — Y, ph.k det (Ahj) det(A K jfeH*K■ 

H.K 

23 After P. S. Laplace. 
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In this sum, H has just one element, say, H = i, while K has n — 1 elements; thus, K — (: 
for some element i. Since ei, Atf =0 if {/} fl I' ^ 0, we may assume that i ^ i'\ 
that is, we may assume that i' = i'. Now, det(A;j) = a, 7 (this is a 1 x 1 minor), while 
det(A^ ;/) = det(A,v,/); that is, A^y is the submatrix obtained from A by deleting its /'th 
column and its ;th row. Hence, if — e\ A • • • A e n , 

x j Ax j' = X Ph.k det(A H j)det(A K j')e H *K 

H.K 

= X Pi,i’ det(A,y) det{Aity)e N 
i 

= AedA/fj')e N . 

i 

Therefore, equating both values for Xj A xy gives 

det(A) = X ( '- | )' + ' ,a u det(A,■/;•/), 


as desired. 

(iii) Laplace expansion across the / th row of A is Laplace expansion down the ;th column 
of A f , and so the result follows because det(A ? ) = det(A). • 

Notice that we have just proved that Laplace expansion across any row or down any 
column always has the same value; that is, the determinant is independent of the row or 
column used for the expansion. The Laplace expansions resemble the sums arising in 
matrix multiplication, and the following matrix was invented to make this resemblance a 
reality. 

Definition. If A = [c/, 7 ] is an n x n matrix with entries in a commutative ring k, then the 
adjoint 24 of A is the matrix 

adj(A) = [Cij], 

where 

Cij = (-D i+j det {A yv ). 

The reversing of indices is deliberate. In words, adj(A) is the transpose of the matrix whose 
/j entry is (—1) ,+ - / det(A,v,/). We often call Cij the / j-cofactor of A. 


Corollary 9.161. If A is an n x n matrix with entries in a commutative ring k, then 

A adj(A) = det(A)/ = adj(A)A. 

“^There is no connection between the adjoint of a matrix as just defined and the adjoint of a matrix defined on 
an inner product space. 
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Proof. Denote the ij entry of A adj(A) by bij. The definition of matrix multiplication 
gives 

n n 

bij — y ' QjpCpj — y dip ( iy~*~^ dct( Aj>p>) . 

p =i p=t 

If j = i, then Proposition 9.160 gives 

bu = det(A). 

If j f i , consider the matrix M obtained from A by replacing row j with row i. Of course, 
det(M) = 0, for it has two identical rows. On the other hand, we may compute det(M) 
using Laplace expansion across its “new” row j. All the submatrices Mj> p > — A y p ’, and 
so all the corresponding cofactors of M and A are equal. The matrix entries of the new 
row j are ui p , so that 

0 = det(M) = (—l) !+1 a,i det(A /r ) + • • • + (-1 ) i+n a in det (A r „>). 

We have shown that A adj (A) is a diagonal matrix having each diagonal entry equal to 
det(A). 

The proof that det(A)/ = adj(A)A is similar and it is left to the reader. [We could also 
adapt the proof of Corollary 3.107, replacing vector spaces by free A;-modules, or we could 
show that adj(A') = adj(A) f .] • 

Definition. An n x n matrix A with entries in a commutative ring k is invertible over k 
if there is a matrix B with entries in k such that 


AB = 1 = BA. 


If k is a field, then invertible matrices are usually called nonsingular , and they are 
characterized by having a nonzero determinant. Consider the matrix with entries in Z: 


3 

1 1 


Now det(A) — 2 f 0, but it is not invertible over Z. Suppose 


'3 

f 

a c 


3 a + b 3 c + d 

1 

1 

b d 


a + b c + d 


If this product is /, then 

3a+b=\=c + d 
3 c + d — 0 — a + b. 

Hence, b — —a and 1 = 3a + b — 2a; as there is no solution to 1 = 2a in Z, the matrix A 
is not invertible over Z. Of course, A is invertible over Q. 
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Proposition 9.162. If k is a commutative ring and A e Mat,, (k), then A is invertible if 
and only ifdet(A) is a unit in k. 

Proof If A is invertible, then there is a matrix B with AB — I. Hence, 

1 = det(7) = det(A7?) = det(A) det(TJ); 
this says that det(A) is a unit in k. 

Conversely, assume that det(A) is a unit in k, so there is an element u e k with 
u det(A) = 1. Define 

B — wadj(A). 

By Corollary 9.161, 

AB = Af/adj(A) = u det(A)/ = I — nadj(A)A = BA. 

Thus, A is invertible. • 

Here is a proof by exterior algebra of the computation of the determinant of a matrix in 
block form. 

Proposition 9.163. Let k be a commutative ring, and let 



be an (m + n) x (m + n ) matrix with entries in k, where A is an m x m submatrix, and B 
is an n x n submatrix. Then 

det(X) = det(A) det(B). 

Proof. Let e\, ..., e m + n be the standard basis of k m+n , let a\ ,..., a m be the columns of 
A (which are also the first m columns of X ), and let y,- + f, be the (m + z')th column of X, 
where /,• is the zth column of C and ff is the zth column of B. 

Now Yi e (e\,..., e m ), so that /,• = Y^j= l c ji e j- Therefore, if H = 1,2, .... n, 

m 

eH A yi — e H A 'Y^Cjiej = 0, 

7=1 

because each term has a repeated e Using associativity, we see that 

eH A (yi + P \) A (Y2 + f>2) A • • • A (y„ + p„) 

— eH A /Si A (Y2 + fl) A • • • A (y„ + Pn) 

= e H A /Si A p 2 A • • • A (Yn + p„) 

— e H a /Si A f 2 A • • • A P„. 
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Hence, if J = m + 1, m + 2,. .., m + n, 

&et(X)e H A ej = on A • • • A a m A + ft) A • • • A (y n + ft) 

= det (A)e H A (yi + ft) A • • • A (y n + ft) 

= det (A)e H A ft A • • • A ft 
= det(A)e# A det(B)e 7 
= det(A) det(B)e# A ej. 

Therefore, dct( X) = det(A) det(B). • 

Corollary 9.164. If A — [c/ ;/ ] is a triangular n x n matrix, that is, aij — 0 for all i < j 
(lower triangular) or aij — 0 for alii > j (upper triangular), then 

n 

det(A) = ]~[a n ; 

i=i 

that is, det(A) is the product of the diagonal entries. 

Proof. An easy induction on n > 1, using Laplace expansion down the first column (for 
upper triangular matrices) and the proposition for the inductive step. • 

Although the definition of determinant of a matrix A in terms of the wedge of its 
columns gives an obvious algorithm for computing it, there is a more efficient means of 
calculating det(A) when its entries lie in a field. Using Gaussian elimination, there are 
elementary row operations changing A into an upper triangular matrix T : 

A Ai -* A r = T. 

Keep a record of the operations used. For example, if A —»• A 1 is an operation of Type I, 
which multiplies a row by a unit c, then c det(A) = det(Ai) and so det(A) = c -1 det(Ai); 
if A -» A 1 is an operation of Type II, which adds a multiple of some row to another one, 
then det(A) = det(Ai); if A -> Ai is an operation of Type III, which interchanges two 
rows, then det(A) = — det(Aj). Thus, the record allows us, eventually, to write det(A) 
in terms of det(T). But since T is upper triangular, det(T) is the product of its diagonal 
entries. 

Another application of exterior algebra constructs the trace of a map. 

Definition. Let A: be a commutative ring and let A be a Ar-algebra. A derivation of A is a 
homomorphism d : A —>• A of ^-modules for which 

d(ab) — (da)b + a (db). 

In words, a derivation acts like ordinary differentiation in calculus, for we are saying 
that the product rule, (fg)' — f'g + fg', holds. 
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Lemma 9.165. Let k be a commutative ring, and let M be a k-module. 

(i ) If tp: M —> M is a k-map, then there exists a unique derivation D v : T ( M ) —> 
T {M), where T ( M ) is the tensor algebra on M, which is a graded map {of degree 
0) with D V \M — tp\ that is, for all p > 0, 

D (p (T p (Mj) C T P {M). 

(ii) If <p: M —> M is a k-map, then there exists a unique derivation d v : f\(M) -> 
/\{M), which is a graded map {of degree 0) with d v \M = (p; that is, for all p > 0, 

P P 

d v (/\{M)) c /\(M). 


Proof (i) Define D^\k = 1* (recall that T°(M ) = k ), and define (recall 

that T l {M) = M). If p > 2, define D p : T P {M) T^(M) by 

p 

D p {m i ® • • • <g> OT p ) = ^2 m i ® ® (p{nij) <g> • • • <g> 

! = 1 

For each i, the ;th summand in the sum is well-defined, because it arises from the k- 
multilinear function (mi,..., m p ) i->- mi®"-® tp(nij) ® ® m p ; it follows that D<p is 
well-defined. 

It is clear that D v is a map of A:-modules. To check that D, p is a derivation, it suffices to 
consider its action on homogeneous elements u = u\® ■ ■ ■ ® u p and v — i>i ® ® v q . 

Dpluv) — Dylu i ® • • • <g> u p ® ® ® v q ) 

p 

= ^ Ml ® • • • <g> ^(m ; ) ® ® U p ® V 

i=i 

q 

+ ^2 U®V 1 ® • • • ® <p(vy) ® ■ ■ ■ ® Vq 

i =i 

= Dq,{u)v + uD v {v ) 

We leave the proof of uniqueness to the reader. 

(ii) Define d v : /\{M) —»■ /\(M) using the same formula as that for after replacing ® 
by A. To see that this is well-defined, we must show that Dq,{J) C ./, where ./ is the two- 
sided ideal generated by all elements of the form m ® m. It suffices to prove, by induction 
on p > 2, that D V {J P ) C /, where JP = J t T T P {M). The base step p = 2 follows from 
the identity, for a,b e M, 

a®b + b®a — {a + b) ® {a + b) — a® a — b ® b e J. 
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The inductive step follows from the identity, for a , c e M and b e J p 1 

a®b®c + J — —a ® c ® b + J 
= c<8>a®fo+/ 

= — c <8> b ® a + J • 


Proposition 9.166. Let k be a commutative ring, and let M be a finitely generated free 
k-module with basis e\, ..., e n . If cp: M —>• M is a k-map and d v : f\(M) —> /\(M) is 
the derivation it determines, then 


n 

d v \/\(M) = tr(cp)e L , 


where ez, = e\ A • • • A e n . 

Proof. By Lemma 9.165(ii), we have d v : /\ n (M) —>■ f\ n (M). Since M is a free k- 
module of rank n, the binomial theorem gives /\"(M) = k. Hence, d v (ei ) — ce / for 
some c e k\ we now show that c — tr (tp). Now <p{e{) = ^ a ji e j- 

dyleifi — ^ e\ A • • • A (p(e r ) A • • • A e n 
r 

= e\ A • • • A a j r e j A • • • A e n 

r 

= e\ A • • • A A • • • A c, 

r 

= a rr e L 

r 

= tr(ip)e L . • 


Exercises 

9.95 Let k be a commutative ring, and let V and IT be free k-modules of ranks m and n, respectively. 

(i) Prove that if /: V —»■ V is a fc-map, then 

det(/®lw) = [det(/)] n . 

(ii) Prove that if /: V —> V and g: W —> W are k-maps. then 

det(/ ® g) = [det(/)]”[det(g)] m . 
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9.96 


(i) 


Let z\, ■ ■ ■, z n be elements in a commutative ring k , and consider the Vandermonde 
matrix 

r i i ... i i 


V(zi,...,z„) = 


Zi 

A 


Z2 

*2 



Ur 1 4- 1 ••• zr‘J 

Prove that det(V(zi,.... z n )) = fl/c/Z; “ Zi)- 

(ii) If /(.v) = n,(-v — Z;') has discriminant D , prove that /) = det(V(zi, • • • > Zn)). 

(iii) Prove that if zi ,..., Zn are distinct elements of a field k, then V(zi , ..., Zn) is nonsin¬ 
gular. 

9.97 Define a tridiagonal matrix to be an n x n matrix of the form 


"*1 

1 

0 

0 

0 

0 

0 

0" 

-1 

*2 

1 

0 

0 

0 

0 

0 

0 

-1 

x 3 

1 

0 

0 

0 

0 

0 

0 

-1 

X 4 

0 

0 

0 

0 

0 

0 

0 

0 

X/i—3 

1 

0 

0 

0 

0 

0 

0 


Xn —2 

1 

0 

0 

0 

0 

0 

0 

-1 

x n— 1 

1 

_ 0 

0 

0 

0 

0 

0 

-1 

x n _ 


(i) If D n = det(r[jci, ..., x „]), prove that D\ = vj. ZL = v 11'2 + 1, and, for all n > 2, 

Dfi = x n D n _i -f D„_ 2 - 

(ii) Prove that if all jc,- = 1, then D n — F n _ |_j, the nth Fibonacci number. (Recall that 

Fq = 0, F\ = 1, and F n = 2 for all n > 2.) 

9.98 If a matrix A is a direct sum of square blocks, 

A = B x © ••• © B t , 

prove that det(A) = ]”[, det(6, ). 

9.99 If A and B are n x n matrices with entries in a commutative ring R, prove that AB and BA 
have the same characteristic polynomial. 

Hint. (Goodwillie) 


7 

s' 

'0 

0 " 

7 

-s' 


'BA 

o' 

_0 

I 

A 

AS 

_0 

I 


A 

0 _ 


9.10 Lie Algebras 


There are interesting examples of nonassociative algebras, the most important of which 
are the Lie algebras. In the late nineteenth century, Sophus Lie (pronounced LEE) stud¬ 
ied the solution space S of a system of partial differential equations using a group G of 
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transformations of S. The underlying set of G is a differentiable manifold and the group 
operation is a C°°-function; such groups are called Lie groups. The solution space is inti¬ 
mately related to its Lie group G; in turn, G is studied using its Lie algebra, a considerably 
simpler object, which arises as the tangent space at the identity element of G. Aside from 
this fundamental reason for their study. Lie algebras turn out to be the appropriate way to 
deal with families of linear transformations on a vector space (in contrast to the study of 
canonical forms of a single linear transformation given in the first sections of this chapter). 
Moreover, the classification of the simple finite-dimensional complex Lie algebras, due to 
W. Killing and E. Cartan at the turn of the twentieth century, served as a model for the re¬ 
cent classification of all finite simple groups. It was C. Chevalley who recognized that one 
could construct analogous families of finite simple groups by imitating the construction of 
simple Lie algebras. 

Before giving the definition of a Lie algebra, let us first present an allied definition. We 
have already defined derivations of rings; let us now generalize the notion a bit. 

Definition. Let A: be a commutative ring. A not necessarily associative k-algebra A is a 
A:-module equipped with some multiplication A x A -> A, denoted by (a, b) i-> ab, such 
that 

(i) a(b + c) — ab + ac and (b + c)a = ba + ca for all a, b,c e A; 

(ii) ua — au for all u e k and a e A; 

(iii) a(ub ) = (au)b — u(ab) for all u e k and a, b e A. 

A derivation of A is a A-map d : A —> A for which 

d(ab) — ( da)b + a(db). 

Aside from ordinary differentiation in calculus, which is a derivation because the prod¬ 
uct rule holds, (fg)' — f'g + fg', another example is provided by the R-algebra A of all 
real valued functions f(xi,, x n ) of several variables. The partial derivatives d/dxi are 
derivations, for i — 1 

The composite of two derivations need not be a derivation. For example, if d : A —* A 
is a derivation, then d 2 — d o d : A -* A satisfies the equation 

d 2 (fg) = d 2 (f)g + 2 d(f)d(g) + fd 2 (g ); 

the mixed term 2d ( f)d(g) is the obstruction to d 2 being a derivation. More generally, we 
may generalize the Leibniz formula (Exercise 1.6 on page 12) from ordinary differentiation 
on the ring of all C°°-functions to a derivation on any not necessarily associative algebra 
A. If /, g e A, then 

d n (fg) = J2 

1=0 
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It is still worthwhile to compute the composite of two derivations d\ and d 2 . If A is a not 
necessarily associative algebra and /, g e ,4, then 

d\d 2 (fg) = d\ [( d 2 f)g + f(d 2 g)] 

- ( did 2 f)g + (d 2 f){d\g) + ( dif)(d 2 g ) + f{d\d 2 g). 

Of course, 

d 2 d\{fg) = (d 2 d\ f)g + (d\f)(d 2 g) + (d 2 f )(d\g) + f(d 2 d\g). 

If we denote d[d 2 — d 2 d\ by [di, d 2 \, then subtraction gives 

[di,d 2 ](fg) = ([di,d 2 \f)g + f([d u d 2 ] g y, 
that is, [c/i, d 2 \ = did 2 — d 2 d\ is a derivation. 

Example 9.167. 

If k is a commutative ring, equip Mat,, (k ) with the bracket operation: 


[A, B] = AB - BA. 

Of course, A and B commute if and only if [A, B] — 0. It is easy to find examples showing 
that the bracket operation is not associative. However, for any fixed n x n matrix M. the 
function 

ad^: Mat„(k) —> Mat„(k), 


defined by 


ad m \ A i —y [Ad , A], 


is a derivation: 

[M, [A, B]] = [[M, A], B] + [A, [M, B]]. 

The verification of this identity should be done once in one’s life. -4 

The definition of Lie algebra involves a vector space with a multiplication generalizing 
the “bracket.” 

Definition. If k is a field, then a Lie algebra over k is a vector space L over k equipped 
with a bilinear operation L x L —> L, denoted by ( a , b) i-^ [ a , b] (and called bracket ), 
such that 

(i) [a, a] =0 for all a e L\ 

(ii) For each a e L, the function ad„ : b i->- [a. b] is a derivation. 
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For all u,v e L, bilinearity gives 

[u + v, u + i>] = [u, u] + [u, n] + [d, u] + [i>, i>], 
which, when coupled with the first axiom [a, a] — 0, gives 

[u,v] — —[v, «]; 

that is, bracket is anticommutative . The second axiom is often written out in more detail. 
If b, c e L. then their product in L is denoted by [b, c]; that ad„ is a derivation is to say 

[a, [/?, c]] = [[a,b],c] + [b, [, a,c ]]; 


rewriting, 


[a. [b , c]] - [ b , [a, c]] - [[a, b], c] = 0 . 

The anticommutativity from the first axiom now gives the Jacobi identity : 

[a, [ b , c]] + [ b , [c, a]] + [c, [a, k]] = 0 for all a, b, c e L. 


Thus, a vector space L is a Lie algebra if and only if [a, a] = 0 for all a e L and the Jacobi 
identity holds. 

Here are some examples of Lie algebras. 


Example 9.168. 

(i) If V is a vector space over a field k, define [a. b\ = 0 for all a, b e V. It is obvious that 
V so equipped is a Lie algebra, and it is called an abelian Lie algebra. 

(ii) In R 3 , define [u, v] = u x v, the vector product (or cross product) defined in calculus. 
It is routine to check that v x v = 0 and that the Jacobi identity holds, so that R 3 is a Lie 
algebra. This example may be generalized: For every field k, cross product can be defined 
on the vector space k 3 making it a Lie algebra. 

(iii) A subalgebra S of a Lie algebra L over a field k is a subspace that is closed under 
bracket: If a, b e S, then [a. b\ <= S. It is easy to see that every subalgebra is itself a Lie 
algebra. 

(iv) If k is a field, then Mat„ (k ) is a Lie algebra if we define bracket by 

[A, B] = AB - BA. 

We usually denote this Lie algebra by gl(n, k). This example is quite general, for it is a 
theorem of I. D. Ado that every finite-dimensional Lie algebra over a field k of character¬ 
istic 0 is isomorphic to a subalgebra of gl(n, k ) for some n (see Jacobson, Lie Algebras , 
page 202 ). 

(v) An interesting subalgebra of gl (n, k) is sl(n, k), which consists of all n x n matrices 
of trace 0. In fact, if G is a Lie group whose associated Lie algebra is g, then there is 
an analog of exponentiation g -> G. In particular, if g = g[(n,C), then this map is 
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exponentiation A e A . Thus, Proposition 9.52(viii) shows that exponentiation sends 
sl(n, C) into SL(n, C). 

(vi) If A is any algebra over a field k, then 

Dct(A/k) = {all derivations d: A —*■ A], 

with bracket [d\, d 2 \ — d\dj — did\, is a Lie algebra. 

It follows from the Leibniz rule that if k has characteristic p > 0, then d p is a derivation 
for every d e X)er(,4 /k), for (?) = 0 mod p whenever 0 < i < p. (This is an example of 
what is called a restricted Lie algebra of characteristic p.) 

There is a Galois theory for certain purely inseparable extensions, due to N. Jacobson 
(see Jacobson, Basic Algebra II, pages 533-536). If k is a field of characteristic p > 0 and 
E/k is a finite purely inseparable extension of height 1, that is, a p e k, for all a e E, 
then there is a bijection between the family of all intermediate fields and the restricted Lie 
subalgebras of Der (E/k), given by 


B i-> Ser (E/B)\ 

the inverse of this function is given by 

£ i-> {e e E : D{e) — 0 for all D e £}. ◄ 


Not surprisingly, all Lie algebras over a field k form a category. 

Definition. If L and L' are Lie algebras over a field k, then a function /: L —> L' is a Lie 
homomorphism if / is a Glinear transformation that preserves bracket: For all a, b e L, 

f([a,b]) = [fa,fb]. 

Definition. An ideal of a Lie algebra L is a subspace I such that [x, a] e I for every 
x e L and a e I. 

Even though a Lie algebra need not be commutative, its anticommutativity shows that 
every left ideal (as just defined) is necessarily a right ideal; that is, every ideal is two-sided. 
A Lie algebra L is called simple if L pk {0} and L has no nonzero proper ideals. 

Definition. If I is an ideal in L, then the quotient L/I is the quotient space (considering 
L as a vector space and I as a subspace) with bracket defined by 

[a + I, b + I] — [a, b] + I. 

It is easy to check that this bracket on L/I is well-defined. If a' + I = a + I and 
b' + / = b + /, then a — a' e / and b — b' e /, and so 

[o', b'] - [a, b] = [a', b'] - [ab] + [a', b] - [a, b] 

= [a,b' -b] + [a -a,b'] e I. 
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Example 9.169. 

(i) If f: L —> L' is a Lie homomorphism, then its kernel is defined as usual: 

ker f — {a e L : f(a ) = 0}. 

It is easy to see that ker / is an ideal in L. 

Conversely, the natural map v: L —> L/I, defined by a i-^ a + /, is a Lie homomor¬ 
phism whose kernel is I. Thus, a subspace of L is an ideal if and only if it is the kernel of 
some Lie homomorphism. 

(ii) If I and J are ideals in a Lie algebra L, then 

IJ = j». j r ] : i r e I and j r e /J. 

r 

In particular, L 2 = LL is the analog for Lie algebras of the commutator subgroup of a 
group: L 2 — {0} if and only if L is abelian. 

(iii) There is an analog for Lie algebras of the derived series of a group. The derived series 
of a Lie algebra L is defined inductively: 

L<°> = L; L (n+1) = (L<">) 2 . 

A Lie algebra L is called solvable if there is some n > 0 with L { " ] = {0}. 

(iv) There is an analog for Lie algebras of the descending central series of a group. The 
descending central series is defined inductively: 

L i — L\ L n +i = LL n . 

A Lie algebra L is called nilpotent if there is some n > 0 with L„ — {0}. < 

We merely mention the first two theorems in the subject. If L is a Lie algebra and 
a e L. then ad fl : L —*■ L, given by ad„: x i-> \a, x], is a linear transformation on L 
(viewed merely as a vector space). We say that a is ad-nilpotent if ad„ is a nilpotent 
operator; that is, (ad a ) m — 0 for some m > 1 . 

Theorem (Engel’s Theorem). 

(i) If L is a finite-dimensional Lie algebra over any field k, then L is nilpotent if and 
only if every a e L is ad-nilpotent. 

(ii) If L is a Lie subalgebra of g((«, k) all of whose elements A are nilpotent matrices, 
then L can be put into strict upper triangular form (all diagonal entries are 0); that 
is, there is a nonsingular matrix P so that PAP -1 is strictly upper triangular for 
every A e L. 

Proof. See Humphreys, Introduction to Lie Algebras and Representation Theory, 
page 12. • 
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Compare Engel’s theorem with Exercise 9.43(i) on page 682, which is the much sim¬ 
pler version for a single nilpotent matrix. Nilpotent Lie algebras are so called because of 
Engel’s theorem; it is likely that nilpotent groups are so called by analogy. Corollary 5.48, 
which states that every finite p-group can be imbedded as a subgroup of unitriangular 
matrices over ¥ p , may be viewed as a group-theoretic analog of Engel’s theorem. 


Theorem (Lie’s Theorem). Every solvable subalgebra L of $\(n, k), where k is an 
algebraically closed field, can be put into (not necessarily strict) upper triangular form; 
that is, there is a nonsingular matrix P so that PAP~ l is upper triangular for every 
A e L. 

Proof See Humphreys, Introduction to Lie Algebras and Representation Theory, 
page 16. • 

Further study of Lie algebras leads to the classification of all finite-dimensional sim¬ 
ple Lie algebras, due to E. Cartan and W. Killing, over an algebraically closed field of 
characteristic 0 (recently, the classification of all finite-dimensional simple Lie algebras in 
characteristic p has been given, where p > 7). To each such algebra, they associated a 
certain geometric configuration called a root system, which is characterized by a Cartan 
matrix. Cartan matrices are, in turn, characterized by Dynkin diagrams. 


A„, n > 1 : , * 


B n , n > 2 : * 

n 1 


C n , n > 3 : * 

_ j 


n -1 « 


n -1 ' « 


ii — 1 ' « 





E(> : •-•-•-•-• 






E-j : • 
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£ 8 : •-•-•-•-•-•-• 

F4 : •-• =)= •-• 

Gi : • == • 

Every Dynkin diagram arises from a simple Lie algebra over C, and two such algebras 
are isomorphic if and only if they have the same Dynkin diagram. We refer the reader 
to Humphreys, Introduction to Lie Algebras and Representation Theory , Chapter IV, and 
Jacobson, Lie Algebras , Chapter IV. 

There are other not necessarily associative algebras of interest. Jordan algebras are 
commutative algebras A in which the Jacobi identity is replaced by 

( x 2 y)x — x 2 (yx) 

for all x, y e A. They were introduced by P. Jordan to provide an algebraic setting for 
doing quantum mechanics. An example of a Jordan algebra is a subspace of all n x n 
matrices, over a field of characteristic not 2, equipped with the binary operation A * B, 
where 

A*B = \(AB + BA). 

Another source of not necessarily associative algebras comes from combinatorics. The 
usual construction of a projective plane P{k) over a field k, as the family of all lines in 
k 3 passing through the origin, leads to descriptions of its points by "homogeneous coor¬ 
dinates” [x, y, 7 ], where x,y,z e k. Define an abstract projective plane to be an ordered 
pair (A, C), where A is a finite set and £ is a family of subsets of A, called lines , subject 
to the following axioms: 

(i) All lines have the same number of points; 

(ii) Given any two points in A, there is a unique line containing them. 

We want to introduce homogeneous coordinates to describe the points of such a projective 
plane, but there is no field k given at the outset. Instead, we look at a collection 1C of 
functions on A, called collineations, and we equip 1C with two binary operations (called 
addition and multiplication). In general, 1C is a not necessarily associative algebra, but 
certain of its algebraic properties—commutativity and associativity of multiplication— 
correspond to geometric properties of the projective plane—a theorem of Pappus and a 
theorem of Desargues, respectively. 

An interesting nonassociative algebra is the Cayley numbers (sometimes called octo- 
nions), which is an eight-dimensional real vector space containing the quaternions as a 
subalgebra (see the article by Curtis in Albert, Studies in Modern Algebra). Indeed, Cay¬ 
ley numbers form a real division not necessarily associative algebra in the sense that every 
nonzero element has a multiplicative inverse. The Cayley numbers acquire added interest 
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(as do other not necessarily associative algebras) because its automorphism group has in¬ 
teresting properties. For example, the exceptional simple Lie algebra E& is isomorphic to 
the Lie algebra of all the derivations of the Cayley numbers, while the Monster, the largest 
sporadic finite simple group, is the automorphism group of a certain nonassociative algebra 
constructed by R. Griess. 

Exercises 

9.100 Consider the de Rham complex when n = 2: 

n i d ^ i 

0 S2°(Z) ^ Q l (X) -* n 2 (X) -+ 0. 

Prove that if f(x, y) e A(X) = £2°(X), then 

d 0 f= a ±dx+Vdy, 

dx dy 

and that if Pdx + Qcly is a 1-form, then 

i /9 Q dP\ 

d l {Pdx + Qdy) = (-) dx A dy. 

V dx dy ) 

9.101 Prove that if L and L' are nonabelian two-dimensional Lie algebras, then L = L r . 

9.102 (i) Prove that the center of a Lie algebra L, defined by 

Z(L) = [a e L : [a, „y] = 0 for all x e Lj, 
is an abelian ideal in L. 

(ii) Give an example of a Lie algebra L for which Z(L) = {0}. 

(iii) If L is nilpotent and L (0}, prove that Z(L) ^ {0}. 

9.103 Prove that if L is an n -dimensional Lie algebra, then Z(L) cannot have dimension n — I. 
(Compare Exercise 2.69 on page 95.) 

9.104 Equip C 3 with a cross product (using the same formula as the cross product on R 3 ). 
Prove that 


C 3 =s 1(2,0. 
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Homology 


10.1 Introduction 


When I was a graduate student, homological algebra was an unpopular subject. The 
general attitude was that it was a grotesque formalism, boring to learn, and not very useful 
once one had learned it. Perhaps an algebraic topologist was forced to know this stuff, but 
surely no one else should waste time on it. The few true believers were viewed as workers 
at the fringe of mathematics who kept tinkering with their elaborate machine, smoothing 
out rough patches here and there. 

This attitude changed dramatically when J.-P. Serre characterized regular local rings us¬ 
ing homological algebra (they are the commutative noetherian local rings of “finite global 
dimension”), for this enabled him to prove that any localization of a regular local ring is 
itself regular (until then, only special cases of this were known). At the same time, M. 
Auslander and D. A. Buchsbaum completed work of M. Nagata by using global dimension 
to prove that every regular local ring is a UFD. 

In spite of its newfound popularity, homological algebra still “got no respect.” For 
example, the two theorems just mentioned used the notion of the global dimension of a ring 
which, in turn, is defined in terms of the homological dimension of a module. At that time, 
I. Kaplansky offered a course in homological algebra. One of his students, S. Schanuel, 
noticed that there is an elegant relation between different projective resolutions of the same 
module (see Proposition 7.60). Kaplansky seized this result, nowadays called Schanuel’s 
lemma, for it allowed him to define the homological dimension of a module without having 
first to develop the fundamental constructs Ext and Tor of homological algebra, and he 
was then able to prove the theorems of Serre and of Auslander-Buchsbaum (Kaplansky’s 
account of this course can be found in his book. Commutative Algebra). However, as more 
applications were found and as more homology and cohomology theories were invented 
to solve outstanding problems, resistance to homological algebra waned. Today, it is just 
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another standard tool in a mathematician’s kit. 

The basic idea of homology comes from Green’s theorem, where a double integral over 
a region R with holes in it is equal to a line integral on the boundary of R. H. Poincare 
recognized that whether a topological space X has different kinds of holes is a kind of 
connectivity. To illustrate, let us assume that X can be “triangulated;” that is, X can be 
partitioned into finitely many n-simplexes , where n > 0: points are 0-simplexes, edges are 
1-simplexes, triangles are 2-simplexes, tetrahedra are 3-simplexes, and there are higher¬ 
dimensional analogs. The question to ask is whether a union of n-simplexes in X that 
“ought” to be the boundary of some (n + l)-simplex actually is such a boundary. For 
example, when n — 0, two points a and b in X ought to be the boundary (endpoints) 
of a path in X; if there is a path in X joining all points a and /;, then X is called path 
connected; if there is no such path, then X has a 0-dimensional hole. For an example of 
a one-dimensional hole, let X be the punctured plane; that is, the plane with the origin 
deleted. The perimeter of a triangle A ought to be the boundary of a 2-simplex, but this is 
not so if A contains the origin in its interior; thus, X has a one-dimensional hole. If X were 
missing a line segment containing the origin, or even a small disk containing the origin, 
this hole would still be one-dimensional; we are not considering the size of the hole, but 
the size of the possible boundary. We must keep our eye on the doughnut and not upon the 
hole! 


a - b 



For example, in the rectangle drawn above, consider the triangle [a, b. c] with vertices a, 
b , c and edges [a, b], [b , c], [a, c]. Its boundary 3[a, b, c] should be [a, b] + [b, c] + [c, a]. 
But edges are oriented (think of [a, c] as a path from a to c and [c, a] as the reverse path 
from c to a), so let us write [c, a] = —[a, c]. Thus, the boundary is 

3[a, b , c] = [a, b] — [a, c] + [ b , c]. 

Similarly, let us define the boundary of [a , b] to be its endpoints: 

3[a, b] — b — a. 


We note that 


3(3 [a, b , c]) = 3([a, b] - [a, c] + [ b , c]) 
= b — a — (c — a) + c — b 
= 0 . 


The rectangle with vertices a, b, c, d is the union of two triangles [a, b, c] + [a, c, d], and 
we check that its boundary is d[a, b, c] + 3[a, c, d] (note that the diagonal [a, c] occurs 
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twice, with different signs, and so it cancels, as it should). We see that the formalism 
suggests the use of signs to describe boundaries as certain linear combinations u of edges 
or points for which 3 (u) — 0 . 

Such ideas lead to the following construction. For each n > 0, consider all formal 
linear combinations of zz-simplexes; that is, form the free abelian group C n ( X) with basis 
all zz-simplexes, and call such linear combinations n-chains. Some of these /(-chains ought 
to be boundaries of some union of (n + l)-simplexes; call them n-cycles (for example, 
adding the three edges of a triangle, with appropriate choice of signs, is a 1-cycle). Certain 
//-chains actually are boundaries, and these are called n-boundaries fif A is a triangle in the 
punctured plane X, not having the origin in its interior, then the alternating sum of the edges 
of A is a 1-boundary; on the other hand, if the origin does lie in the interior of A, then the 
alternating sum is a 1-cycle but not a 1-boundary). The family of all the //-cycles, Z n (X ), 
and the family of all the //-boundaries, B n (X ), are subgroups of C n (X). A key ingredient 
in the construction of homology groups is that the subgroups Z„ and B n can be defined in 
terms of homomorphisms: there are boundary homomorphisms 3„ : C n (X) —> C„_i(A) 
with Z„ = ker3„ and B n — im3 n+ i, and so there is a sequence of abelian groups and 
homomorphisms 


-► C 3 (X) C 2 (X) -X Ci(A) -X C 0 (X). 

It turns out, for all // > 1, that 3;,3„+i = 0, from which it follows that 

B„(X) C Z„(X). 

The interesting group is the quotient group Z n (X)/B n (X), denoted by //„( X ) and called 
the //th homology 1 group of X. What survives in this quotient group are the //-dimensional 
holes; that is, those //-cycles that are not //-boundaries. For example, Hq(X) — 0 means 
that X is path connected: if there are two points a, b e X that are not connected by a 
path, then a — b is a cycle that is not a boundary, and so the coset a — b + Bq(X) is 
a nonzero element of Hq(X). For zz > 1, these groups measure more subtle kinds of 
connectivity. Topologists modify this construction in two ways. They introduce homology 
with coefficients in an abelian group G by tensoring the sequence of chain groups by G 
and then taking homology groups; they also consider cohomology with coefficients in G 
by applying the contravariant functor Hom( , G) to the sequence of chain groups and then 
taking homology groups. Homological algebra arose in trying to compute and to find 
relations between homology groups of spaces. 

1 1 have not been able to discover the etymology of the mathematical term homology as used in this context. 
The word “homology” comes from homo + logos, and it means “corresponding.” Its first usage as a mathematical 
term occurred in projective geometry in the early 19th century, as the name of a specific type of collineation. 
The earliest occurrence I have found for its usage in the sense of cycles and boundaries is in an article of 
H. Poincare: Analysis Situs, Journal de l’Ecole Polytechnique, Series II, First issue, 1895 (and Oeuvres, vol. 5), 
but he does not explain why he chose the term. Emili Bifet has written, in a private communication, “Consider 
the projective homology, between two distinct (hyper)planes, given by projection from an exterior point. This ho¬ 
mology suggests (and provides) a natural way of deforming the boundary of a simplex contained in one plane into 
the boundary of the corresponding simplex on the other one. Moreover, it suggests a natural way of deforming a 
boundary into a point. This could be what Poincare had in mind.” 
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We have already seen, in Proposition 7.51, that every left R -module M , where R is a 
ring, has a description by generators and relations. There is an exact sequence 

0 —> ker< p ——r F —M 0, 


where F is a free left R-module and i is the inclusion. If R is a PID, then ker <p is free, 
because every submodule of a free module is itself free; if R is not a PID, then ker ip may 
not be free. Now take generators and relations of ker (p: There is a free module F\ and an 
exact sequence 

0 —»■ ker i j/ — F] —ker (p —> 0. 

If we define F\ -» F to be the composite t \!/ , then there is a second exact sequence 


III/ W 

F\ —F —» M 


and, iterating this construction, there is a long exact sequence 

-» F 3 F 2 Fi -» F -> M 0. 

We can view the submodules ker( F n -> F„- 1 ) as “relations on relations’’ (nineteenth cen¬ 
tury algebraists called these higher relations syzygies). This long exact sequence resembles 
the sequence of chain groups in topology. There are other contexts in which such exact 
sequences exist; many algebraic structures give rise to a sequence of homology groups, 
and these can be used to translate older theorems into the language of homology. Exam¬ 
ples of such theorems are Hilbert’s Theorem 90 about algebras (see Corollary 10.129), 
Whitehead’s lemmas about Lie algebras (see Jacobson, Lie Algebras, pages 77 and 89), 
and Theorem 10.22, the Schur-Zassenhaus lemma, about groups. There are methods to 
compute homology and cohomology groups, and this is the most important contribution of 
homological algebra to this circle of ideas. Although we can calculate many things without 
them, the most powerful method of computing homology groups uses spectral sequences. 
When I was a graduate student, I always wanted to be able to say, nonchalantly, that such 
and such is true “by the usual spectral sequence argument,” but I never had the nerve. 2 We 
will sketch what spectral sequences are at the end of this chapter. 


10.2 Semidirect Products 

We begin by investigating a basic problem in group theory. A group G having a normal 
subgroup K can be “factored” into K and G/K; the study of extensions involves the in¬ 
verse question: How much of G can be recovered from a normal subgroup K and the 
quotient Q — G/K ? For example, we know that \G\ — \K\\Q\'A K and Q are finite. 

“This introduction is adapted from a review I wrote that appeared in Bulletin of the American Mathematical 
Society, Vol. 33, pp. 473-475, 1996; it is reproduced by permission of the American Mathematical Society. 
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Exactness of a sequence of nonabelian groups. 


n ^n+l 
tjn+l ^ tjj 7 


Gn- 1 


is defined just as it is for abelian groups: \md n +\ = ker d n for all n. Of course, each ker d n 
is a normal subgroup of G„. 


Definition. If K and Q are groups, then an extension of K by Q is a short exact sequence 


1 



Q 


1 . 


The notation K is to remind us of kernel, and the notation Q is to remind us of quotient. 

There is an alternative usage of the term extension, which calls the (middle) group 
G (not the short exact sequence) an extension if it contains a normal subgroup K\ with 
K[ = K and G/K\ = Q. As do most people, we will use the term in both senses. 


Example 10.1. 

(i) The direct product K x Q is an extension of K by Q; it is also an extension of Q by K. 


(ii) Both Sj, and Ilf, are extensions of I3 by I2. On the other hand. If, is an extension of I2 
by I3, but S 3 is not, for S 3 contains no normal subgroup of order 2. < 

We have just seen, for any given ordered pair of groups, that there always exists an 
extension of one by the other (their direct product), but there may be other extensions as 
well. The extension problem is to classify all possible extensions of a given pair of groups 
K and Q. 

In Chapter 5, on page 283, we discussed the relation between the extension problem 
and the Jordan-Holder theorem. If a group G has a composition series 

G = K 0 > Ki > K 2 > ■ ■ ■ > K n -1 > K n = {1} 

with simple factor groups Q \, .... Q n , where Q, = K/- \ / K, for all i > 1, then G could 
be recaptured from Q n , Q n ~\, . ... Q \ by solving the extension problem n times. Now all 
finite simple groups have been classified, and so we could survey all finite groups if we 
could solve the extension problem. 

Let us begin by recalling the partition of a group into the cosets of a subgroup. We have 
already defined a transversal of a subgroup K of a group G as a subset T of G consisting 
of exactly one element from each coset 3 Kt of K. 

Definition. If 

1 -> K -» G -A Q -> 1 

is an extension, then a lifting is a function l: Q —> G, not necessarily a homomorphism, 
with pi = 1 q. 

3 We have been working with left cosets tK, but, in this chapter, the subgroup K will be a normal subgroup, 
in which case tK = Kt for all t e G. Thus, using right cosets or left cosets is only a matter of convenience. 
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Given a transversal, we can construct a lifting. For each x e Q, surjectivity of p 
provides £(x) e G with p£(x) = x; thus, the function x i-> l(x) is a lifting. Conversely, 
given a lifting, we claim that imf is a transversal of K. If Kg is a coset, then pig) e Q: 
say, p(g) = x. Then p(g£(x)~ 1 ) — 1, so that a — g£(x)~ l e K and Kg — Ki(x). 
Thus, every coset has a representative in £(Q). Finally, we must show that £(Q) does 
not contain two elements in the same coset. If Kt(x) — K£(y), then there is a e K 
with a£(x) — £(y). Apply p to this equation; since p(a) — 1, we have x = y and so 
I (x) — t < v). 

The following group will arise in our discussion of extensions. 

Definition. Recall that an automorphism of a group K is an isomorphism K —> K. The 
automorphism group , denoted by Aut( K ), is the group of all the automorphisms of K 
with composition as operation. 

Of course, extensions are defined for arbitrary groups K , but we are going to restrict 
our attention to the special case when K is abelian. If G is an extension of K by Q, it 
would be confusing to write G multiplicatively and its subgroup K additively. Hence, we 
shall use the following notational convention: Even though G may not be abelian, additive 
notation will be used for the operation in G. Corollary 10.4 gives the main reason for this 
decision. 

Proposition 10.2. Let 

o->kAg-^q->\ 

be an extension of an abelian group K by a group Q, and let £: Q —> G be a lifting. 

(i) For every x e Q, conjugation 9 X : K —> K, defined by 

6 X : a i-> £(x) + a — £(x), 

is independent of the choice of lifting £(x) ofx. [For convenience, we have assumed 
that i is an inclusion; this merely allows us to write a instead ofi(a).] 

(ii) The function B : Q —> AuRAA, defined by x i —> 9 X , is a homomorphism. 

Proof, (i) Let us now show that 9 X is independent of the choice of lifting £(x) of x. 
Suppose that £'(x) e G and p£' (x) = x. There is b e K with £’{x) — £(x) + b [for 
— £(x) + £'(x) e ker p = im i = K). Therefore, 

£'(x) + a — £'(x) — £(x) + b + a — b — £(x) 

— £(x) + a — £(x), 


because K is abelian. 

(ii) Now 9 X (a) e K because K <i G, so that each 9 X : K —> K \ also, 9 X is an automorphism 
of K , because conjugations are automorphisms. 
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It remains to prove that 9: Q —>■ Aut( K ) is a homomorphism. If x, y e Q and a e K. 
then 

0 x (8 y (a)) = 9 x (lfy) + a- i{y)) = l(x) + Ify) +a- Ify) - l(x). 


while 

d xy (a) = (-ixy) +a - i(xy). 

But tfx) + £fy) and £(xy) are both liftings of xy, so that equality 9 x 9 y — 9 xy follows from 
part (i). • 

Roughly speaking, the homomorphism 0 tells “how” K is normal in G, for isomorphic 
copies of a group can sit as normal subgroups of G in different ways. For example, let K 
be a cyclic group of order 3 and let Q — (x) be cyclic of order 2. If G — K x Q, then G 

is abelian and K lies in the center of G. In this case, £(x) + a — £(x) = a for all a e K 

and 9 X — Ik- On the other hand, if G = .S 3 , then K = Aj which does not lie in the center; 

if l(x) = (1 2), then (1 2)(1 2 3)(1 2) = (1 3 2) and 0 X is not 1*. 

The existence of a homomorphism 6 equips K with a scalar multiplication making K 
a left ZQ-module, where T,Q is the group ring whose elements are all ^T lg y m x x for 
m x e Z. 

Proposition 10.3. Let K and Q be groups with K abelian. Then a homomorphism 
6 : Q —> Aut(Zf) makes K into a left ZQ-module if scalar multiplication is defined by 

xa — 9 x (a) 

for all a e K and x e Q. Conversely, if K is a left ZQ-module, then x 1 —> 9 X defines a 
homomorphism 6 : Q —>■ Aut(A'), where 9 X : a 1 —> xa. 

Proof. Define scalar multiplication as follows. Each u e Z Q has a unique expression of 
the form u — J2xeQ m x x -> where m x e Z and almost all m x = 0; define 


(y^m x x)a — y"m x d x (a) = ^ 


m x {xa). 


We verify the module axioms. Since 9 is a homomorphism, 0(1) = I k- and so 1 a = 9\ (a) 
for all a e K. That 9 X e Aut(W) implies x(a + b) = xa + xb, from which it follows that 
u(a + b) — ua + ub for all a e Z Q. Similarly, we check easily that (u + v)a — ua + va 
for u,ve Z Q. Finally, (uv)a — u(va) will follow from (xy)a = xfya) for all x, y e Q: 
but 

(xy)a = 9 xy (a) = 9 x (9 y (a)) = 9 x (ya) = xfya). 

The proof of the converse is also routine. • 
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Corollary 10.4. If 

o-+kAg-?>q->i 

is an extension of an abelian group K by a group Q, then K is a left hQ-module if we 
define 

xa — £(x) + a — £(x), 

where l\ Q —> G is a lifting, x e Q, and a e K; moreover, the scalar multiplication is 
independent of the choice of lifting l. 

Proof Propositions 10.2 and 10.3. • 

From now on, we will abbreviate the term “left Z (7-module” to “(7-module.” 

Recall that a short exact sequence of left /(-modules 

o->a4b4c->o 

is split if there exists a homomorphism j: C B with pj — 1 c\ in this case, the middle 
module is isomorphic to the direct sum A © C. Here is the analogous definition for groups. 

Definition. An extension of groups 

O^K-^G-ig^l 

is split if there is a homomorphism / : (7 -> G with pj — 1 q. The middle group G in a 
split extension is called a semidirect product of K by Q. 

Thus, an extension is split if and only if there is a lifting, namely, /, that is also a 
homomorphism. We shall use the following notation: The elements of K shall be denoted 
by a,b,c ,and the elements of Q shall be denoted by x, y, z, . ■ ■ ■ 

Proposition 10.5. Let G be an additive group having a normal subgroup K. 

(i) If 0 -> f 4 g 4 g -> \ is a split extension, where j: Q -> G satisfies pj = 1 q, 
then i(K) fl j(Q ) = {0} and i{K) + j(Q ) = G. 

(ii) In this case, each g e G has a unique expression g — i(a) + j (x), where a e K 
and x e Q. 

(iii) Let K and Q be subgroups of a group G with K < G. Then G is a semidirect product 
of K by Q if and only if K IT Q — {0}, K + Q — G, and each g e G has a unique 
expression g = a + x, where a e K and x e Q. 

Proof (i) If g e i(K) fl j(Q), then g — i(a) — j(x) for a e K and x e Q. Now 
g = j(x) implies p(g) — pj(x) = x, while g = i(a) implies p(g) = pi(a) = 0. 
Therefore, x — 0 and g — j (x) = 0. 

If g e G, then p(g) = pjp(g) (because pj — 1 q), and so g — (jp(g)) e ker p — imi\ 
hence, there is a e K with g — ( jp(g )) = i ( a ), and so g — i (a) + j (pg) e i ( K ) + j ( Q ). 
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(ii) Every element g e G has a factorization g = i (a) + j ( pg ) because G = i (K) + j (Q). 
To prove uniqueness, suppose that i (a) + j(x) = i(b) + j(y), where b e K and y e Q. 
Then —i(b) + i(a ) = j(y) — j (x) e i(K) fi j(Q) = {0}, so that i(a) = i(b ) and 
j(x) - j(y). 

(iii) Necessity is the special case of (ii) when both i and j are inclusions. Conversely, each 
g e G has a unique factorization g = ax for a e K and x e Q\ define p: G —> Q by 
p(ax) = x. It is easy to check that p is a surjective homomorphism with ker p = K. • 

A semidirect product is so called because a direct product G of K and Q requires, in 
addition to KQ = G. and K fi Q = {1}, that both subgroups K and Q be normal; here, 
only one subgroup must be normal. 

Definition. If K < G and C < G satisfies C fi K = {1} and KC — G, then C is called 
a complement of K. 

In a semidirect product G, the subgroup K is normal; on the other hand, the image j(Q), 
which Proposition 10.5 shows to be a complement of K, may not be normal. For example, 
if G = Sj and K = A 3 = {(1 2 3)>, we may take C = (r), where r is any transposition 
in .S 3 ; this example also shows that complements need not be unique. However, any two 
complements of K are isomorphic, for any complement of K is isomorphic to G/K. 

The definition of semidirect product allows the kernel K to be nonabelian, and such 
groups arise naturally. For example, the symmetric group S n is a semidirect product of the 
alternating group A„ by I 2 . In order to keep hypotheses uniform, however, let us assume 
in the text (except in some exercises) that K is abelian, even though this assumption is not 
always needed. 

Example 10.6. 

(i) A direct product K x Q is a semidirect product of K by Q (and also of Q by K). 

(ii) An abelian group G is a semidirect product if and only if it is a direct product (usually 
called a direct sum), for every subgroup of an abelian group is normal. 

(iii) The dihedral group Di n is a semidirect product of I„ by I 2 . If Ihn = (a, b), where 
a" = 1 , b 2 = 1 , and bab — a -1 , then (a) is a normal subgroup having (b) as a comple¬ 
ment. 

(iv) Every Frobenius group is a semidirect product of its Frobenius kernel by its Frobenius 
complement. 

(v) Let G — H x , the multiplicative group of nonzero quaternions. It is easy to see that if 
R + is the multiplicative group of positive reals, then the norm N: G —> R + , given by 

A(u T bi cj dk) — a ^ b~ T c~ T d~ , 

is a homomorphism. There is a “polar decomposition” h — rs, where r > 0 and 3 e ker A, 
and G is a semidirect product of ker A by R + . (The normal subgroup ker A is the 3- 
sphere.) In Exercise 10.4, we will see that ker A = SU (2, C), the special unitary group. 
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(vi) Cyclic groups of prime power order are not semidirect products, for they cannot be a 
direct sum of two proper subgroups. < 

Definition. Let K be a (9-module. An extension G of K by Q realizes the operators if, 
for all x e Q and a e K, we have 

xa = l(x) + a — £(x); 

that is, the given scalar multiplication of 7LQ on K coincides with the scalar multiplication 
of Corollary 10.4 arising from conjugation. 

Here is the construction. 

Definition. Let Q be a group and let K he a (Lmodule. Define 

G = K xi Q 

to be the set of all ordered pairs (a, x) e K x Q with the operation 

(a, x) + ( b, y) = (a + xb, xy). 

Notice that (a, 1) + (0, x) = {a, x) in K x Q. 

Proposition 10.7. Given a group Q and a Q-module K, then G = K x Q is a semidirect 
product of K by Q that realizes the operators. 

Proof. We begin by proving that G is a group. For associativity, 

[(a, x) + ( b , y)] + (c, z) = (a + xb, xy) + (c, z) 

— (a + xb + ( xy)c , (xy)z). 

On the other hand, 

(a,x) + [(b, y) + (c, z)] = (a, x) + (b + yc, yz) 

= (a + x{b + yc), x(yz)). 

Of course, (xy)z = x(yz), because of associativity in Q. The first coordinates are also 
equal: Since K is a (Lmodule, we have 

x(b + yc) — xb + x(yc) = xb + (xy)c. 

Thus, the operation is associative. The identity element of G is (0, 1), for 

(0, 1) + (a, x) — (0 + la, lx) — (a, x), 

and the inverse of (a, x) is (—x~ [ a, x -1 ), for 

(— x~ l a, x _1 ) + (a, x) = (— x~ l a + x~ l a, x - 1 x) = ( 0 , 1 ). 
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Therefore, G is a group, by Exercise 2.22 on page 61. 

Define a function p: G -> Q by p: (a , x ) x. Since the only “twist” occurs in the 
first coordinate, p is a surjective homomorphism with ker p — {(a, 1): a e K}. If we 
define i : K —> G by i: a i-> (a, 1), then 

o^^4g4(3^i 

is an extension. Define j: Q —> G by j: x i-> (0, x ). It is easy to see that j is a 
homomorphism, for (0, x ) + (0, y) = (0, xy). Now pjx — p( 0, x) = x, so that pj — 1q, 
and the extension splits; that is, G is a semidirect product of K by Q. Finally, G realizes 
the operators: If x e Q , then every lifting of x has the form i(x) — (b, x) for some b e K, 
and 


(b, x) + (a, 1 ) — (b, x) = (b + xa, x) + (—x l b,x 

— {b + xa + x(—x _ 1 b), xx~ 1 ) 

— (b + xa — b. 1 ) 

= (xa, 1). • 

We return to the multiplicative notation for a moment. In the next proof, the reader will 
see that the operation in K x Q arises from the identity 

( ax)(by) = a(xbx~ l )xy. 

Theorem 10.8. Let K be an abelian group. If a group G is a semidirect product of K by 
a group Q, then there is a Q-module structure on K so that G = K x Q. 

Proof. Regard G as a group with normal subgroup K that has Q as a complement. We 
continue writing G additively (even though it may not be abelian), and so will now write 
its subgroup Q additively as well. If a e K and x e Q, define 

xa — x + a — x; 

that is, xa is the conjugate of a by x. By Proposition 10.5, each g e G has a unique 
expression as g — a +x, where a e K and x e Q. It follows that < p: G -* K yi Q, defined 
by <p: a + x (a, x), is a bijection. We now show that cp is an isomorphism. 

<p((a + x) + (b + y)) — (p(a + x + b + (—x + x) + y) 

= <p(a + (x + b — x) + x + y) 

= (a + xb , x + y) 

The definition of addition in K x Q now gives 

(a + xb, x + y) — (a, x) + ( b, y) 

= <p(a + x) + <p(b + y). • 

We now use semidirect products to construct some groups. 
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Example 10.9. 

If AT = (a) = I 3 , then an automorphism of K is completely determined by the image 
of the generator a; either a i-> a and the automorphism is 1/f, or a 2a. Therefore, 
Aut(W) = I 2 ; let us denote its generator by (p, so that cp(a) = 2 a and <p(2a) — a; that is, (p 
multiplies by 2 . Let Q — (x) = I 4 , and define 9 : Q — > Aut(W) by 9 X = <p\ hence 

xa — 2a and x2a = a. 


The group 

T — I 3 x I 4 

is a group of order 12. If we define s — (2a, x 2 ) and t — (0, x), then the reader may check 
that 

6 s = 0 and 2t = 3s = 2 (s + t). 

The reader knows four other groups of order 12. The fundamental theorem says there 
are two abelian groups of this order: I 42 = 13 x 14 and IL x ^6 = V x I 3 . Two nonabelian 
groups of order 12 are A 4 and S 3 x I 2 (Exercise 10.7 on page 794 asks the reader to prove 
that A 4 ^ S 3 x I 2 ). The group T just constructed is a new example, and Exercise 10.17 on 
page 812 says that every group of order 12 is isomorphic to one of these five. [Note that 
Exercise 2.85(h) on page 113 states that Z>i 2 = S 3 x I 2 .] ◄ 


Example 10.10. 

Let p be a prime and let K — I p ® I p . Hence, K is a vector space over F p , and so 
Aut(7T) = GL(7T). We choose a basis a. b of K , and this gives an isomorphism Aut(7if) = 
GL(2, p). Let 0 — lx) be a cyclic group of order p. 

Define 0: <2 —> GL(2, p) by 




0 

1 


for all n e Z. Thus, 

xa — a + b and xb — b. 

It is easy to check that the commutator x + a — x — a = xa — a — b, and so G = K x Q 
is a group of order p 3 with G — (a, b. x); these generators satisfy relations 

pa = pb — px — 0, b — [x, a], and [ b , a] = 0 = [ b , x]. 

If p is odd, then we have the nonabelian group of order p 3 and exponent p in Propo¬ 
sition 5.45. If p — 2, then |G| = 8, and the reader is asked to prove, in Exercise 10.8 on 
page 794, that G = Z>s; that is, Z>8 = V xi I 2 . In Example 10.6(iii), we saw that D& is a 
semidirect product of I4 by I 2 . Thus, V xi I 2 = I4 x I 2 , and so a group can have different 
factorizations as a semidirect product. -4 
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Example 10.11. 

Let k be a field and let k x be its multiplicative group. Now k x acts on k by multiplication 
fif a e k and a ^ 0 , then the additive homomorphism x i-> ax is an automorphism whose 
inverse is x i-a a~ l x). Therefore, the semidirect product k x k x is defined. In particular, 
if ( b , a), ( d , c) e k x k x . then 

( b , a) + {d, c) — {ad + b, ac ). 

Recall that an affine map is a function /: k -> k of the form /: x ha- ax + b, where 
a. b e k and a ^ 0 , and the collection of all affine maps under composition is the group 
Aff(l, k). Note that if g(x) = cx + d, then 

(f °g)(x) — f(cx+d) 

— a(cx + d) + b 

— ( ac)x + (ad + b ). 

It is now easy to see that the function (p: (b, a) i-a- /, where f(x) — ax + b, is an 
isomomorphism k x k x -» Aff(l,£). ◄ 


Exercises 

In the first three exercises, the group K need not be abelian; in all other exercises, it is assumed to be 
abelian. 

10.1 Kernels in this exercise may not be abelian groups. 

(i) Prove that SL(2, F 5 ) is an extension of I2 by A 5 which is not a semidirect product. 

(ii) If is a field, prove that GL(«, k) is a semidirect product of SL(n, k) by k x . 

Hint. A complement consists of all matrices diag(l, ..., 1, a) with a e k x . 

10.2 Let G be a group of order mn, where (m, n) — 1. Prove that a normal subgroup K of order m 
has a complement in G if and only if there exists a subgroup C < G of order n. (Kernels in 
this exercise may not be abelian groups.) 

10.3 {Baer) Prove that a group G is injective 4 in the category of all groups if and only if G — {1}. 
(Kernels in this exercise may not be abelian groups.) 

Hint. Let A be free with basis {x, y), and let B be the semidirect product B = A x {z), where 
z is an element of order 2 that acts on A by zxz = y and zyz — x. 

10.4 Let SU (2) be the special unitary group consisting of all complex matrices [° K\ of determi¬ 
nant 1 such that 

ab + cd = 0, aa + bb = 1, cc + dd = 1. 

If S is the subgroup of H x in Example 10.6(v), prove that S = SU{ 2). 

Hint. Use Exercise 8.2 on page 531. 

4 The term injective had not yet been coined when R. Baer, who introduced the notion of injective module, 
proved this result. After recognizing that injective groups are duals of free groups, he jokingly called such groups 
fascist, and he was pleased to note that they are trivial. 
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10.5 Give an example of a split extension of groups 



for which there does not exist a homomorphism q : G —» K with qi = \g. Compare with 
Exercise 7.17. 

10.6 Prove that Q, the group of quaternions, is not a semidirect product. 

Hint. Recall that Q has a unique element of order 2. 

10.7 (i) Prove that A 4 £ S 3 x I 2 . 

Hint. Use Proposition 2.64 saying that A 4 has no subgroup of order 6 . 

(ii) Prove that no two of the nonabelian groups of order 12: A 4 , S 3 x I 2 , and T are isomor¬ 
phic. (See Example 10.9.) 

(iii) The affine group Aff(l, F 4 ) (see Example 10.11) is a nonabelian group of order 12. Is 
it isomorphic to A 4 , S 3 x I 2 , or T = 13 x I 4 ? 

10.8 Prove that the group G of order 8 constructed in Example 10.10 is isomorphic to Dg. 

10.9 If K and Q are solvable groups, prove that a semidirect product of K by Q is also solvable. 

10.10 Let K be an abelian group, let Q be a group, and let 9 : Q — > Aut(AT be a homomorphism. 
Prove that K y Q = K x Q if and only if 9 is the trivial map (9 X = 1 k f° r all Jf € Q). 

10.11 (i) If K is cyclic of prime order p , prove that Aut(A') is cyclic of order p — 1. 

(ii) Let G be a group of order pq, where p > q are primes. If q \ [p — 1). prove that G is 
cyclic. Conclude, for example, that every group of order 15 is cyclic. 

10.12 Let G be an additive abelian p-group, where p is prime. 

(i) If (m, p) = 1, prove that the function a 1 —*■ ma is an automorphism of G. 

(ii) If p is an odd prime and G = (g) is a cyclic group of order p~, prove that <p : G —> G, 
given by <p: a k>- 2a, is the unique automorphism with <p(pg ) = 2pg. 


10.3 General Extensions and Cohomology 

We now proceed to the study of the general extension problem: Given a group Q and an 
abelian group K , find all (not necessarily split) extensions G of K by Q. In light of our 
discussion of semidirect products, that is, of split extensions, it is reasonable to refine the 
problem by assuming that A is a Q-module and then to seek all those extensions realizing 
the operators. 

One way to describe a group G is to give a multiplication table for it; that is, to list all its 
elements a\,ai,... and all products (paj. Indeed, this is how we constructed semidirect 
products: the elements are all ordered pairs (a, x ) with a e K and x e Q, and multiplica¬ 
tion (really addition, because we have chosen to write G additively) is 

(a, x) + (b, y) = (a + xb, xy). 

O. Schreier, in 1926, solved the extension problem in this way, and we present his solution 
in this section. The proof is not deep; rather, it involves manipulating and organizing a 
long series of elementary calculations. 
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We must point out, however, that Schreier’s solution does not allow us to determine 
the number of nonisomorphic middle groups G. Of course, this last question has no easy 
answer. If a group G has order n, then there are n ! different lists of its elements and hence 
at most (n!)" different multiplication tables for G (there are n \ possibilities for each of 
the n rows). Suppose now that H is another group of order n. The problem of determin¬ 
ing whether or not G and H are isomorphic is essentially the problem of comparing the 
families of multiplication tables of each to see if there is one for G and one for H that 
coincide. 

Our strategy is to extract enough properties of a given extension G that will suffice to 
reconstruct G. Thus, we may assume that K is a 2-module, that G is an extension of K 
by Q that realizes the operators, and that a transversal t\ Q -» G has been chosen. With 
this initial data, we see that each g e G has a unique expression of the form 

g — a + £(x), a e K and x e Q; 

this follows from G being the disjoint union of the cosets K + £(x). Furthermore, if 
x, y e Q , then i(x) + l(y) and l{xy) are both representatives of the same coset (we do 
not say these representatives are the same!), and so there is an element /(x, y) e K such 
that 

i(x) + l(y) = f{x, y) + l{xy). 

Definition. Given a lifting l : Q -> G, with l (1) = 0, of an extension G of K by Q, 
then a factor set 5 (or cocycle ) is a function /: Q x Q -> K such that 

£(x) + £(y) = fix, y) + t-ixy) 


for all x, y e Q. 

It is natural to choose liftings with l (1) = 0, and so we have incorporated this condition 
into the definition of factor set; our factor sets are often called normalized factor sets. 

Of course, a factor set depends on the choice of lifting t. When G is a split exten¬ 
sion, then there exists a lifting that is a homomorphism; the corresponding factor set is 
identically 0. Therefore, we can regard a factor set as the obstruction to a lifting being a 
homomorphism; that is, factor sets describe how an extension differs from being a split 
extension. 

Proposition 10.12. Let Q be a group, K a Q-module, and 0 —> K —> G —> Q —> \ an 
extension realizing the operators. If l : Q —> G is a lifting with £(1) = 0 and f: Q x Q -> 
K is the corresponding factor set, then 

(i) for all x,y e Q, 

/(l,y) = 0= /(x, 1); 

5 If we switch to multiplicative notation, we see that a factor set occurs in the factorization l(x)l(y) = 
fix, y)Uxy). 
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(ii) the cocycle identity holds: For all x, y, z € Q, we have 

f(x, y) + /(x;y, z) = x/(y, z) + /(x, yz). 

Proof. Set x = 1 in the equation that defines fix, y), 

Hx) + £iy) = fix, y) + £(xy), 

to see that £(y) = /(l, y) + £(y) [since £(1) = 0 , by our new assumption], and hence 
/(l, y) = 0. Setting y — 1 gives the other equation of (i). 

The cocycle identity follows from associativity in G. For all x, y, z e Q, we have 

m) + liy)] + Hz) = fix, y) + Hxy) + Hz) 

= fix, y) + fixy, z) + Hxyz). 


On the other hand, 

Hx) + [£{y) + Hz)i = Hx) + fiy, z) + Hyz) 

= xfiy, z) + Hx) + Hyz) 

= xfiy,z) + fix,yz) + Hxyz). • 

It is more interesting that the converse is true. The next result generalizes the construc¬ 
tion of K xi Q in Proposition 10.7. 

Theorem 10.13. Given a group Q and a Q-module K, a function f : Q x Q —> K is a 
factor set if and only if it satisfies the cocycle identity 6 

xfiy, z) - fixy, z) + fix, yz) - fix, y) = 0 

and fi 1, y) — 0 — fix, 1 )for all x, y, z e Q. 

More precisely, there is an extension G of K by Q realizing the operators, and there is 
a transversal i: Q —> G whose corresponding factor set is f. 

Proof. Necessity is Proposition 10.12. For the converse, define G to be the set of all 
ordered pairs (a, x) in K x Q equipped with the operation 

ia, x) + ib, y) = ia + xb + fix, y), xy). 

(Thus, if / is identically 0, then G — K x i Q.) The proof that G is a group is similar to the 
proof of Proposition 10.7. The cocycle identity is used to prove associativity: 

((a, x) + ib. y)) + (c, z) = (a + xb + fix, y), xy) + (c, z) 

= (a + xb + fix, y) + xyc + fixy, z), xyz) 

^Written as an alternating sum. this identity is reminiscent of the formulas describing geometric cycles as 
described in Section 10.1. 
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and 


(a,x) + (( b , y) + (c, z)) = (a, x ) + (fo + yc + /(y, z), yz) 

= (a + x(? + xyc + x/(y, z) + /(x, yz), xyz). 

The cocycle identity shows that these elements are equal. 

We let the reader prove that the identity is (0, 1) and the inverse of (a, x) is 

— ( a , x) = (—x -1 a — x - 1 /(x, x -1 ), x -1 ). 

Define p: G -> Q hy p: (a, x) i-> x. Because the only “twist” occurs in the first coordi¬ 
nate, it is easy to see that p is a surjective homomorphism with ker p — {(a, 1) : a e K}. 

If we define i: K -> G hy i : a (a, 1), then we have an extension 0 -> K -V G -4- 

Q -> l. 

To see that this extension realizes the operators, we must show, for every lifting l, that 
xa — l(x) + a — i (x) for all a e K and x e Q. Now l (x) = (b, x) for some b e K and 

£(x) + (a, 1 ) — i(x) — ( b, x) + ( a , 1 ) — ( b , x) 

= (b + xa , x) + (— x~ l b — x _1 /(x, x -1 ), x _1 ) 

= (b + xa + x[—x~ l b — x _ 1 /(x, x -1 )] + /(x, x _1 ), 1 ) 

= (xa, 1 ). 

Finally, we must show that / is the factor set determined by l. Choose the lifting 
£(x) = (0, x) for all x e Q. The factor set F determined by l is defined by 

F(x, y ) = l (x) + £(y) - £(xy) 

= (0, x) + (0, y) - (0, xy) 

= (fix, y), xy) + (-(xy) _ 1 /(xy, (xy) -1 ), (xy) -1 ) 

= (fix, y) + xy[-(xy) - 1 /(xy, (xy) -1 )] + /(xy, (xy) -1 ), xy(xy) -1 ) 

= (fix, y), l). • 

The next result shows that we have found all the extensions of a Q -module K by a 
group g. 

Definition. Given a group Q, a g-module K, and a factor set /, let G(K. Q, f) denote 
the middle group of the extension of K by Q constructed in Theorem 10.13. 


Theorem 10.14. Let Q be a group, let K be a Q-module, and let G be an extension of K 
by Q realizing the operators. Then there exists a factor set f: Q x Q -> K with 


G = G(K, Q, /). 
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Proof. Let £: Q -> G be a lifting, and let /: Q x Q -> K be the corresponding factor 
set: that is, for all x,y e Q, we have 

£ 0 ) + £(y) = /(x, y) + £(xy). 

Since G is the disjoint union of the cosets, G — [J x€ q K + £(x), each g e G has a unique 
expression g = a + t(x) for a e K and x e Q. Uniqueness implies that the function 
<p: G -> G(K, Q. /), given by 

<p: g = a + £(x) (a, x), 

is a well-defined bijection. We now show that <p is an isomorphism. 

<p(a + £(x) + b + £(y)) — q>(a + £(x) + b - £(x) + £(x) + £(y)) 

= q>{a + xb + £(x) + £(y)) 

= <p(a + xb + /(x, y) + £(xy)) 

= (a + xb + fix , y), xy) 

= (a, x) + (b, y) 

= <p(a + £(x)) + (p(b + £(y)). • 

Remark. For later use, note that if a € K, then q>(a) — (p(a + £(1)) = (a, 1) and, 
if x e Q , then (p{£{x )) = (0, x). This would not be so had we chosen a lifting t with 
£(1)^0. « 

We have now described all extensions in terms of factor sets, but factor sets are deter¬ 
mined by liftings. Any extension has many different liftings, and so our description, which 
depends on a choice of lifting, must have repetitions. 

Lemma 10.15. Given a group Q and a Q-module K, let G be an extension of K by 
Q realizing the operators. Let £ and £' be liftings that give rise to factor sets f and f, 
respectively. Then there exists a function h: Q —> K with /z(l) = 0 and, for all x,y e Q, 

/'(*> }’) - fix, y) = xh(y) - h(xy) + h(x). 

Proof. For each x e Q, both £(x) and £'(x) lie in the same coset of K in G, and so there 
exists an element /z(x) e K with 


£'(x) = h(x) + £(x). 

Since £(1) — 0 — £'(1), we have h( 1) = 0. The main formula is derived as follows: 

£'(x) + £'(y) = [h(x) + £(x)] + [h(y) + £(y)] 

= h (x) + xh(y) + £(x) + £(y). 
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because G realizes the operators. The equations continue, 

l'(x) + i\y) = h{x) + xh(y) + f(x, y ) + i(xy) 

= h(x) + xh(y) + fix, y) - h(xy) + l\xy). 

By definition, f satisfies l'{x) + t'(y) — fix, y) + i’fxy). Therefore, 

fix, y) = h(x) + xh(y) + fix, y) - h{xy). 

and so 

f'(x<y)-f(x,y)=xh(y)-h(xy) + h (. x ). • 


Definition. Given a group Q and a (7-module K, a function g: Q x Q —> K is called a 
coboundary if there exists a function h : Q —> K with h(l) — 0 such that, for all x, y e Q. 

g(x, y) = xh(y) - h(xy ) + h(x). 

The term coboundary arises because its formula is an alternating sum analogous to the 
formula for geometric boundaries that we described in Section 10.1. 

We have just shown that if / and /' are factor sets of an extension G that arise from 
different liftings, then f — / is a coboundary. 

Definition. Given a group Q and a (7-module K , define 

Z 2 (6, K) — {all factor sets /: Q x (7 —»■ K] 
and 

B 2 (Q , K) = {all coboundaries g: Q x (7 —>■ K}. 

Proposition 10.16. Given a group Q and a Q-module K, then Z 2 (Q. K) is an abelian 
group with operation pointwise addition, 

/ + /': (x,y)\-± f(x,y) + f'(x,y), 

and B 2 (Q, K) is a subgroup of Z 2 (Q, K). 

Proof. To see that Z 2 is a group, it suffices to prove that f — f satisfies the two identities 
in Proposition 10.12. This is obvious: Just subtract the equations for / and /'. 

To see that B 2 is a subgroup of Z 2 , we must first show that every coboundary g is 
a factor set; that is, that g satisfies the two identities in Proposition 10.12. This, too, is 
routine and is left to the reader. Next, we must show that B 2 is a nonempty subset; but the 
zero function, g(x, y) = 0 for all x, ye Q, is clearly a coboundary. Finally, we show that 
B 2 is closed under subtraction. If h, h': Q —>• K show that g and g' are coboundaries, that 
is, g(x, y) = xh(y) — h(xy) + h(x) and g'(x, y) = xh'(y) — h'(xy) + h!(x), then 

(g ~ g'){x, y) = x(h - h')(y) - (h - h'){xy) + (h - h')(x). • 
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A given extension has many liftings and, hence, many factor sets, but the difference 
of any two of these factor sets is a coboundary. Therefore, the following quotient group 
suggests itself. 

Definition. The second cohomology group is defined by 

H 2 (Q, K) = Z 2 (Q, K)/B 2 (Q, K). 

Definition. Given a group Q and a (l-modulc K , two extensions G and G' of K by Q 
that realize the operators are called equivalent if there is a factor set f of G and a factor 
set /' of G' so that /' — / is a coboundary. 


Proposition 10.17. Given a group Q and a Q-module K, two extensions G and G' of K 
by Q that realize the operators are equivalent if and only if there exists an isomorphism 
y : G —> G' making the following diagram commute: 


0 

0 


>■ K 


G 


p 


Q 


i 

> K 


y 


1Q 


V 

> G' 



1 


1 


Remark. A diagram chase shows that any homomorphism y making the diagram com¬ 
mute is necessarily an isomorphism. ◄ 

Proof Assume that the two extensions are equivalent. We begin by setting up notation. 
Let l\ Q —> G and £': Q —»■ G' he liftings, and let /, /' be the corresponding factor sets; 
that is, for all x, ye Q, we have 

l(x) + £(y) = f(x, y) + i(xy), 

with a similar equation for /' and t'. Equivalence means that there is a function h: Q —>• K 
with h{ 1) = 0 and 


fix, y) - fix , y) = xhiy) - hixy) + h{x) 

for all x, y e Q. Since G = (J X &Q ^ + f (x ) is a disjoint union, each g e G has a unique 
expression g — a + £(x) for a e K and x e Q: similarly, each g' e G' has a unique 
expression g' — a + i'ix). 

This part of the proof generalizes that of Theorem 10.14. Define y : G —*■ G' by 
yia + l (x)) — a + h(x) + £'(x). 

This function makes the diagram commute. If a e K, then 


yia ) = yia + £(1)) = a + h(l) + £'(1) = a: 
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furthermore, 

p'yia + £(x)) — p'ia + h(x) + i'ix)) = x = p(a + f(x)). 

Finally, y is a homomorphism: 

y([a + t(x)] + [b + l(y)]) — y(a + xb + fix, y) + £(xy)) 

= a + xb + fix, y) + h(xy) + i'ixy), 

while 

y(a + iix)) + yib + iiy)) = (a + hix) + i\x)) + (b + h(y) + i'iy )) 

— a + hix) + xb + xhiy ) + f'ix, y) + t’ixy ) 

— a + xb + (hix) + xhiy) + f'ix, y)) + t’ixy) 

— a + xb + fix, y) + hixy) + i'ixy). 

We have used the given equation for / — /' [remember that the terms other than I'ixy) all 
lie in the abelian group K, and so they may be rearranged]. 

Conversely, assume that there exists an isomorphism y making the diagram commute, 
so that yia) — a for all a e K and 

x = pilix)) = p'yitix)) 

for all x e Q. It follows that yt: Q -> G' is a lifting. Applying y to the equation 
l(x) + iiy) = fix, y) + l{xy) that defines the factor set /, we see that yf is the factor set 
determined by the lifting yi. But yfix, y) = fix, y) for all x,y e Q because fix, y) e 
K. Therefore, / is also a factor set of the second extension. On the other hand, if /' is any 
other factor set for the second extension, then Lemma 10.15 shows that / - /' e B 2 : that 
is, the extensions are equivalent. • 

We say that the isomorphism y in Proposition 10.17 implements the equivalence. The 
remark after Theorem 10.14 shows that the isomorphism y : G -> G (K. Q, f) imple¬ 
ments an equivalence of extensions. 

Example 10.18. 

If two extensions of K by Q realizing the operators are equivalent, then their middle groups 
are isomorphic. However, the converse is false: We give an example of two inequivalent 
extensions with isomorphic middle groups. Let p be an odd prime, and consider the fol¬ 
lowing diagram: 



Define K = (a), a cyclic group of order p, G = (g) = G', a cyclic group of order p 1 , and 
Q = (x), where x = g + K. In the top row, define i (a) — pg and ir to be the natural map; 
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in the bottom row define i'(a) — 2 pg and n' to be the natural map. Note that i' is injective 
because p is odd. 

Suppose there is an isomorphism y: G —*■ G' making the diagram commute. Com¬ 
mutativity of the first square implies y(pa) — 2 pa, and this forces y(g) = 2g, by Exer¬ 
cise 10.12(h) on page 794; commutativity of the second square gives g + K — 2g + K ; 
that is, g e K. We conclude that the two extensions are not equivalent. ■* 

The next theorem summarizes the calculations in this section. 

Theorem 10.19 (Schreier). Let Q be a group, let K be a Q-module, and let e(Q, K ) 
denote the family of all the equivalence classes of extensions of K by Q realizing the 
operators. There is a bijection 

cp: H 2 {Q, K) e(Q, K) 

that takes 0 to the class of the split extension. 

Proof. Denote the equivalence class of an extension 

0 -+K-^G->Q->l 
by [G]. Define <p: H 2 (Q, K) —> e(Q, K ) by 

cp: f + B 2 ^[G(K, Q, /)], 

where / is a factor set of the extension and the target extension is that constructed in 
Theorem 10.13. 

First, cp is a well-defined injection: / and g are factor sets with f + B 2 — g + B 2 if and 
only if [G(K, Q , /)] = [G(K, Q, g)], by Proposition 10.17. To see that cp is a surjection, 
let [G] e e(Q, K). By Theorem 10.14 and the remark following it, [G] = [G(K, Q, /)] 
for some factor set /, and so [G] = < p(f + B 2 ). Finally, the zero factor set corresponds to 
the semidirect product. • 

If H is a group and if there is a bijection cp: H —»• X , where X is a set, then there is a 
unique operation defined on X making X a group and cp an isomorphism: Given x, y e X , 
there are g, h e H with x = cp(g) and y — cp(h), and we define xy — cp(gh). In particular, 
there is a way to add two equivalence classes of extensions; it is called Baer sum (see 
Section 10.6). 

Corollary 10.20. If Q is a group, K is a Q-module, and ll 2 (Q, K) = {0}, then every 
extension of K by Q realizing the operators is a semidirect product. 

Proof. By the theorem, e(Q, K) has only one element; since the split extension always 
exists, this one element must be the equivalence class of the split extension. Therefore, 
every extension of K by Q realizing the operators is split, and so its middle group is a 
semidirect product. • 


We now apply Schreier’s theorem. 
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Theorem 10.21. Let G be a finite group of order mn, where (in. n) = 1. If K is an 
abelian normal subgroup of order m, then K has a complement and G is a semidirect 
product. 

Proof. Define Q = G/K. By Corollary 10.20, it suffices to prove that every factor set 
/: Q x Q -+ K is a coboundary. Define a : Q — »■ K by 

o(x) = ^2 f(x,y ); 

ye Q 

a is well-defined because Q is finite and K is abelian. Now sum the cocycle identity 
xfiy, z) - /Cry, z) + f{x, yz ) - fix, y) = 0 
over all z £ Q to obtain 


xoiy) - cr(xy) + cs(x) = nf(x, y) 

(as z varies over all of Q, so does yz). Since On. n) = 1, there are integers s and t with 
sm + tn — 1. Define h: Q -> K by 


hfx) = toix). 


Note that /z(l) = 0 and 

xhiy) - hixy) + h(x) = fix , y) - ms fix, y). 

But sfix, y) e K, and so ms fix, y) — 0. Therefore, / is a coboundary. • 

Remark. P. Hall proved that if G is a finite solvable group of order mn, where (in. n ) = 1, 
then G has a subgroup of order m and any two such are conjugate. In particular, in a solv¬ 
able group, every (not necessarily normal) Sylow subgroup has a complement. Because of 
this theorem, a (not necessarily normal) subgroup H of a finite group G is called a Hall 
subgroup if (\H\, [G : H]) = 1. Thus, Theorem 10.21 is often stated as every normal Hall 
subgroup of an arbitrary finite group has a complement. -4 

We now use some group theory to remove the hypothesis that K be abelian. 

Theorem 10.22 (Schur-Zassenhaus 7 Lemma). Let G be a finite group of order mn, 
where (m, n) — 1. If K is a normal subgroup of order m, then K has a complement and 
G is a semidirect product. 


7 I. Schur proved this theorem, in 1904, for the special case Q cyclic. H. Zassenhaus, in 1938, proved the 
theorem for arbitrary finite Q. 
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Proof. By Exercise 10.2 on page 793, it suffices to prove that G contains a subgroup of 
order n; we prove the existence of such a subgroup by induction on m > 1. Of course, the 
base step m — 1 is true. 

Suppose that there is a proper subgroup T of K with {1} < T <1 G. Then K/T <1 G/T 
and ( G/T)/(K/T ) = G/K has order n. Since T < K. we have \K/T\ < |_ST| = m, 
and so the inductive hypothesis provides a subgroup N/T < G/T with \N/T\ — n. Now 
\N\ — n\T\, where (\T\,n) — 1 [because |T| is a divisor of | K | = in], so that T is a 
normal subgroup of N whose order and index are relatively prime. Since 7' | < \K \ = m, 
the inductive hypothesis provides a subgroup C of IV (which is obviously a subgroup of G ) 
of order n. 

We may now assume that K is a minimal normal subgroup of G; that is, there is no 
normal subgroup T of G with {1) < T < K. Let p be a prime divisor of | K | and let P be 
a Sylow /^-subgroup of K. By the Frattini argument. Exercise 5.21 on page 277, we have 
G = KN g (P). Therefore, 


G/K = KN g (P)/K 

= n g (P)/(k r\N G (P)) 

= N G {P)/N K {P). 

Hence, \Nk(P)W = \Nk(P)\\G/K\ — \N G (P)\. If N G (P ) is a proper subgroup of 
G, then \Nk(P)\ < m, and induction provides a subgroup of N G (P) < G of order n. 
Therefore, we may assume that N G (P ) = G; that is, P <\G. 

Since {1} < P < K and P is normal in G, we must have P — K, because K is 
a minimal normal subgroup. But P is a /;-group, and so its center, Z(P), is nontrivial. 
By Exercise 5.19(v) on page 277, we have Z(P) < G, and so Z(P) — P, again because 
P — K is a minimal normal subgroup of G. It follows that P is abelian, and we have 
reduced the problem to Theorem 10.21. • 

Corollary 10.23. If a finite group G has a normal Sylow p-subgroup P, for some prime 
divisor p o/|G|, then G is a semidirect product ; more precisely, P has a complement. 

Proof. The order and index of a Sylow subgroup are relatively prime. • 

There is another part of the Schur-Zassenhaus lemma that we have not stated: If K is a 
normal subgroup of G whose order and index are relatively prime, then any two comple¬ 
ments of K are conjugate subgroups. We are now going to see that there is an analog of 
H 2 (K , Q) whose vanishing implies conjugacy of complements when K is abelian. This 
group, H l (K, Q), arises, as did H 2 (K , Q), from a series of elementary calculations. 

We begin with a computational lemma. Let Q be a group, let K be a Q -module, and let 
0 —> K —> G Q — >lbea split extension. Choose a lifting t : Q -> G, so that every 
element g e G has a unique expression of the form 

g — a + i x. 


where a e K and x e Q. 
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Definition. An automorphism <p of a group G stabilizes an extension 0 -> K —>• G 
Q —> 1 if the following diagram commutes: 

0- K —G — Q - ^1 



<p 

1 i : 

P 

1 S' 


0 --^1 

The set of all stabilizing automorphisms of an extension of K by Q , where K is a 
Q-module, form a group under composition, denoted by 

Stable, K). 

Note that a stabilizing automorphism is an isomorphism that implements an equivalence 
of an extension with itself. We shall see, in Proposition 10.26, that Stable, K) does not 
depend on the extension. 

Proposition 10.24. Let Q be a group, let K be a Q-module, and let 

o^kAgAq->i 

be a split extension. If l: Q —>■ G is a lifting, then every stabilizing automorphism 
(p\ G —> G has the form 

cp(a + ix) — a + d(x) + lx, 

where d(x) € K is independent of the choice of lifting l. Moreover, this formula defines a 
stabilizing automorphism if and only if, for all x, y & Q, the function d: Q —> K satisfies 

d{xy) = d{x) + xdiy). 

Proof. If (p is stabilizing, then (pi = i, where /: K —> G, and pep = p. Since we are 
assuming that i is the inclusion [which is merely a convenience to allow us to write a 
instead of i(a)], we have (p(a) = a for all a e K. To use the second constraint on <p, 
suppose that cp(lx) = d(x) + ly for some d(x) e K and y € Q. Then 

x = p(lx) 

= PV(tx) 

= p(d(x) + ly) 

= y. 


that is, x — y. Therefore, 


( p(a + lx) — ep(a) + cp(lx) — a + d(x) + lx. 
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To see that the formula for d holds, we first show that d is independent of the choice of 
lifting. Suppose that l! : Q —> G is another lifting, so that cpll'x) — d'(x ) + t'x for some 
d'(x) e K. Now there is k(x) e K with I'x = k(x ) + lx, for pl'x — x — plx. Therefore, 

d'(x) — <p(l'x) — I'x 

— cp(k(x) + lx) — I'x 
= k(x) + d (x) + lx — I'x 
= d(x), 

because k(x) + lx — I'x — 0 . 

Since d(x) is independent of the choice of lifting l, and since the extension splits, we 
may assume that £ is a homomorphism: lx + ly — l(xy). We compute <p(lx + ly) in two 
ways. On the one hand, 

<p(lx + ly) — (p(l(xy)) — d(xy) + l(xy). 

On the other hand. 


<p(lx + ly) — <p(lx) + <p(ly) 

= d(x) + lx + d(y) + ly 
= d(x) + xd(y) + l(xy). 

The proof of the converse, if (p(a+lx) = a+d(x)+lx,’wheied satisfies the given iden¬ 
tity, then <p is a stabilizing isomorphism, is a routine argument that is left to the reader. • 

We give a name to functions like d. 

Definition. Let 0 be a group and let K be a Q-module. A derivation 8 (or crossed 
homomorphism) is a function d. Q —> K such that 

d{xy) = xd{y) + d(x). 

The set of all derivations, Der(<2, K), is an abelian group under pointwise addition [if K 
is a trivial g-module, then Der(g, K) = Horn((9. K )]. 

If d is a derivation, then <f(ll) = 1J(1) + d(Y) e K, and so c/(l) = 0. 

Example 10.25. 

(i) If Q is a group and K is a (0-module, then a function u : Q -* K of the form u(x) = 
xa o — ao, where ao e K, is a derivation: 

u(x) + xu(y) — xa() — oq + x(yao — oq) 

— xao — ao + xyao — xao 

— xyao ~ ao 
= u(xy). 

^Earlier, we defined a derivation of a (not necessarily associative) ring R as a function cl : R —> R with 
d(xy ) = d{x)y + xd{y). Derivations here are defined on modules, not on rings. 
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A derivation u of the form u (x) = xciq — ciq is called a principal derivation. 
If the action of Q on K is conjugation, xa = x + a — x, then 


xao — ao = x + ao — x — ao', 
that is, xao — ao is the commutator of x and oq. 

(ii) It is easy to check that the set PDer( Q, K ) of all the principal derivations is a subgroup 
ofDer(g,/f). ◄ 

Recall that Stahl Q, K) denotes the group of all the stabilizing automorphisms of an 
extension of K by Q. 

Proposition 10.26. If Q is a group, K is a Q-module, and 0 —> K —> G —> Q —> 1 is a 
split extension, then there is an isomorphism Stab(Q, K ) -* Der (Q, K ). 

Proof. Let ip be a stabilizing automorphism. If l: Q —> G is a lifting, then Proposi¬ 
tion 10.24 says that ip(a + lx) = a + d(x) + lx, where d is a derivation. Since this 
proposition further states that d is independent of the choice of lifting, tp (->• d is a well- 
defined function Stab( Q. K) -> Der (Q, K ), which is easily seen to be a homomorphism. 

To see that this map is an isomorphism, we construct its inverse. If d e Der (Q, K), de¬ 
fine ip\ G -> G by ip(a +lx) = a+d(x) + lx. Now ip is stabilizing, by Proposition 10.24, 
and d i->- q> is the desired inverse function. • 

It is not obvious from its definition that Stabf Q. K) is abelian, for its binary operation 
is composition. However, Stab((9, K) is abelian, for Deri Q, K) is. 

Recall that an automorphism q> of a group G is called an inner automorphism if it is a 
conjugation; that is, there is c e G with <p(g) = c + g — c for all g e G (if G is written 
additively). 

Lemma 10.27. Let 0->K-±G-*Q-+\bea split extension, and let l: Q —> G 
be a lifting. Then a function ip\ G —>■ G is an inner stabilizing automorphism by some 
ao € K if and only if 

q>{a + lx) — a + xao — ao + lx. 

Proof. If we write d (x) — xao—ao , then <p(a+lx) — a+d(x)+lx. Bute/ is a (principal) 
derivation, and so ip is a stabilizing automorphism, by Proposition 10.24. Finally, q> is 
conjugation by —ao, for 

—ao + (a + lx) + ao = —ao + a + xao + lx — ip(a + lx). 

Conversely, assume that ip is a stabilizing conjugation. That <p is stabilizing says that 
ip(a + lx) = a +d(x) + lx~, that cp is conjugation says that there is b e K with tp(a +lx) = 
b + a + lx — b. But b + a + lx — b — b + a — xb + lx, so that d(x) = b — xb, as 
desired. • 
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Definition. If Q is a group and K is a 0-module, define 

H\Q, K ) = Der(0, tf)/PDer(Q, K), 

where PDcr( Q, K) is the subgroup of Deri Q, K ) consisting of all the principal derivations. 

Proposition 10.28. Let 0 —> K —> G —> Q —> 1 be a split extension, and let C and C' 
be complements of K in G. If H l (Q, K) — {0}, then C and C' are conjugate. 

Proof Since G is a semidirect product, there are liftings l: Q —»• G, with image C, 
and l ': Q -> G, with image C', which are homomorphisms. Thus, the factor sets / 
and f' determined by each of these liftings is identically zero, and so /' — / = 0. But 
Lemma 10.15 says that there exists h: Q -> K, namely, h(x ) = i'x — lx, with 

0 = fix, y ) - f(x, y ) = xh{y) - h(xy) + h(x); 

thus, h is a derivation. Since H l (Q, K) — {0}, h is a principal derivation: there is ao e K 
with 

i'x — lx = h (x) = xao — «o 

for all x e Q. Since addition in G satisfies I'x — c/q = —xao + f 7 x, we have 
lx — ao — xao + I'x = ao + l' x — ao- 
But imf = C and im l' = C', and so C and C' are conjugate via ao- • 

We can now supplement the Schur-Zassenhaus theorem. 

Theorem 10.29. Let G be a finite group of order mn, where (m. n ) = 1. If K is an 
abelian normal subgroup of order m, then G is a semidirect product of K by G/K, and 
any two complements of K are conjugate. 

Proof. By Proposition 10.28, it suffices to prove that H l (Q, K) — {0}, where Q = G/K. 
Note, first, that \Q\ — \G\/\K\ — mn/m — n. 

Let d : Q -» K be a derivation: for all x, y e Q, we have 

d{xy) — xdiy) + d(x). 

Sum this equation over all y e Q to obtain 

A = xA + nd{x), 

where A = diy) (as y varies over Q, so does xy). Since ( m,n ) = 1, there are 

integers 5 and t with sn + tm — 1. Hence, 

d(x ) = snd(x) + tmd(x) — snd(x), 

because d(x) e K and so md(x) — 0. Therefore, 

d(x) = s A — xsA. 

Setting ao = —s A, we see that d is a principal derivation. • 
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Removing the assumption in Theorem 10.29 that K is abelian is much more difficult 
than removing this assumption in Theorem 10.21. We first prove that complements are 
conjugate if either K or / is a solvable group. Since |/| and K | are relatively prime, at 
least one of K or / has odd order. The Feit-Thompson theorem, which says that every 
group of odd order is solvable, now completes the proof. 

There are other applications of homology in group theory besides the Schur-Zassenhaus 
lemma. For example, if G is a group, a e G. and y a : g aga~ l is conjugation by a, 
then Ya '■ 8 ^ a n ga~ n for all n. Hence, if a has prime order p and a ^ Z(G), then y a 
is an automorphism of order p. A theorem of W. Gaschiitz uses cohomology to prove that 
every finite nonabelian p-group has an automorphism of order p that is not conjugation by 
an element of G. 

Let us contemplate the formulas that have arisen. 


factor set: 
coboundary : 
derivation : 
principal derivation : 


0 = xfiy, z) - /(.ry, z) + fix, yz) - fix, y) 
fix, y) — xhiy) - h(xy) + h(x) 

0 — xdiy) — d (xy) + d (x) 
dix) = xciq — ao 


All these formulas involve alternating sums; factor sets and derivations seem to be in ker¬ 
nels, and coboundaries and principal derivations seem to be in images. Let us make this 
more precise. 

Denote the cartesian product of n copies of / by /"; for clarity, we denote an element 
of /" by [x'i,..., x n ] instead of by (xi,..., x„ ). Factor sets and coboundaries are certain 
functions Q 2 —> K , and derivations are certain functions /* -* K. Let F n be the free 
left Z(/module with basis Q". By the definition of basis, every function /: Q" —> K 
gives a unique Q -homomorphism /: -> K extending /, for K is a /-module; that is, 

if Set(Z" , K) denotes the family of all functions Q" —>• K in the category of sets, then 
/ i-> / gives a bijection 

Set(<2", K) -> HomzeCFn, K). 

The inverse of this function is restriction 


res: Horn ZQiF„, K ) Set(g", K), 

defined by res: g g\Q n ■ 

We now define maps that are suggested by the various formulas: 

</: F 3 ->■ F 2 : d?,[x, y, z] — x[y, z] - [xy, z] + [x, yz] - [x, y]; 

d 2 '. F 2 F\ : d 2 [x, y] = x[y] - [xy] + [x]. 

In fact, we define one more map: let Q° = { 1 } be a 1 -point set, so that Fq — ZQ is the 
free /-module on the single generator, 1 . Now define 

d\ : F\ —*■ Fq : d\[x] = x — 1 . 

We have defined each of d 2 , d 2 , and d\ on bases of free modules, and so each extends to a 
/-map. 
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Proposition 10.30. The sequence 

„ d\ „ do „ d\ „ 

7*3 ”*■ F 2 7*0 

is a?; exact sequence of Q-modules. 

Sketch of Proof. We will only check that d\d 2 — 0 and did}, — 0; that is, 'm\d 2 C kert/i 
and imd} C kcrcL. The (trickier) reverse inclusions will be proved in Theorem 10.117 
after we introduce some homological algebra. 

d\d 2 [x, >•] = d\{x[y] - [xy] + [x]) 

= xd\ [>•] - d] [xy] + ddx]) 

— x(y - 1) - (xy - 1) + (x - 1) 

= 0 

(the equation t/|x[y] = xd\ [y] holds because d\ is a 0-map). The reader should note that 
this is the same calculation as in Proposition 10.16. 

d 2 d}[x, y, z ] = d 2 (x[y, z] - [xy, z] + [x, yz] - [x, y]) 

= xd 2 [y, z] - d 2 [xy, z] + d 2 [x, yz] - d 2 [x, y] 

= x{y[z] - [yz] + [y]) - (xy[z] - [xyz] + [xy]) 

+ (x[yz\ - [xyz] + [x]) - (x[y] - [xy] + [x]) 

= 0 • 

Let us recall that if X is a set and K is a module, then functions X —> K are the same 
as homomorphisms F K, where F is the free module having basis X: Formally, the 
functors Set(A, ) and Hom( F, ), which map zgMod —> Set, are naturally equivalent. 
Applying the contravariant functor Hom^gl , K) to the sequence in Proposition 10.30, we 
obtain a (not necessarily exact) sequence 

d* d* d* 

Hom(F 3 , K) Horn( F 2 , K) Hom(/q, K) Hom(F 0 , K); 

inserting the bijections res: g i->- g| 0" gives a commutative diagram of sets: 

Set(0 3 , K ) ■<-Set(0 2 , K) ^ -Set(0, K ) ^-Set([l], K) 


res 

res 

res 

dt 

d* 

d* 


Hom(F 3 , K ) -= Hom(F 2 , K) Hom(7q, K) -< Hom(F 0 , K). 

We regard a function /: Q n K as the restriction of the 0-map /: F n —»■ K which 
extends it. Suppose that /: 0 2 —> K lies in kerb*. Then 0 = d|(/) = fd}. Hence, for 
all x,y,z e 0, we have 

0= fd}[x,y,z] 

= f (x[y, z] - [xy, z] + [x, yz] - [x, y]) 

= xf[y, z] - /[xy, z] + /[x, yz] - /[x, y]; 
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the equation f(x[y, z]) = xf[y, z] holds because / is the restriction of a (1-map. Thus, 
/ is a factor set. If / lies in im d*. then there is some h: Q -> K with / = d*(h) = hd 2 . 
Thus, 

fix, y] = hd 2 [x, y] 

= h(x[y] - [xy] + M) 

= xh[y] — h[xy] + h[x]; 

the equation h{x[yYl — xh[y ] holds because h is the restriction of a (1-map. Thus, / is a 
coboundary. 

A similar analysis shows that if g: Q —> K lies in kerJ|, then g is a derivation. Let us 
now compute im d*. If k: {1} —> K , then 

d\{k) = kd\ (x) = k((x - 1)1) = (x - l)A(l), 

because k is the restriction of a Q- map. Now ATI) is merely an element of K ; indeed, 
if we identify k with its (1-point) image k( 1) = a o, then we see that d*(k ) is a principal 
derivation. 

Observe that d 2 d 2 = 0 implies d* 2 — 0, which is equivalent to im d* C kert/|; 
that is, every coboundary is a factor set, which is Proposition 10.16. Similarly, d\d 2 = 0 
implies im d* C ker d %; that is, every principal derivation is a derivation, which is Exam¬ 
ple 10.25(i). 

As long as we are computing kernels and images, what is kert/*? If A: {1} —»■ K and 
A(l) = a o, then k e kert/* says 

0 = J*(A) = kdi(x) = (x - l)A(l) = (x - 1 )a 0 , 

so that xii{) — ao for all x e Q. We have been led to the following definition. 

Definition. If Q is a group and K is a (9-module, then the submodule of fixed points is 
defined by 

H°(Q , K) = [a € K : xa = a for all x e Q}. 

The groups H 2 (Q. K ), II 1 (Q. K), and H°(Q , K) were obtained by applying the func¬ 
tor Hom( , K) to the exact sequence F 2 —> F 2 —> F\ —> Fq. In algebraic topology, we 
would also apply the functor ®jjq K. obtaining homology groups [the tensor product is 
defined because we may view the free (1-modules F„ as right (1-modules, as in Exam¬ 
ple 8.79(v)]: 

Hq(Q , K) — ker (J 0 <S> l)/im(Ji ® 1); 

H\(Q , K) — ker(Ji ® l)/im(J 2 <8> 1); 

H 2 (Q , K) = ker (d 2 <8> l)/im(J 3 ® 1). 

We can show that Hq(Q, K) is the maximal g-trivial quotient of K. In the special case 
K — Z viewed as a trivial (1-module, we see that ll\(Q. Z) = Q/Q ', where Q' is the 
commutator subgroup of Q. 

We discuss homological algebra in the next section, for it is the proper context in which 
to understand these constructions. 
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Exercises 

10.13 Let Q be a group and let K be a <2-module. Prove that any two split extensions of K by Q 
realizing the operators are equivalent. 

10.14 Let Q be a group and let A" be a g-module. 

(i) If K and Q are finite groups, prove that H 2 {Q, K ) is also finite. 

(ii) Let t(K, Q) denote the number of nonisomorphic middle groups G that occur in exten¬ 
sions of K by Q realizing the operators. Prove that 

t(K, Q) < \H 2 (Q,K)\. 

(iii) Give an example showing that the inequality in part (ii) can be strict. 

Hint. Observe that r(lp, Ip) = 2 (note that the kernel is the trivial module because 
every group of order p 2 is abelian). 

10.15 Recall Example 5.79 on page 307: & generalized quaternion group Q„ is a group of order 2", 
where n > 3, which is generated by two elements a and b such that 

on —1 _I _i o 2 n-2 

a~ — I. bab = a , and b~ = a 

(i) Prove that Q„ has a unique element z of order 2 and that Z(Q„) = (z). Conclude that 
Q„ is not a semidirect product. 

(ii) Prove that Q„ is a central extension (i.e., 9 is trivial) of I 2 by 

(iii) Using factor sets, give another proof of the existence of Q ; ,. 

10.16 If p is an odd prime, prove that every group G of order 2p is a semidirect product of I p by I 2 , 
and conclude that either G is cyclic or G = Dip- 

10.17 Show that every group G of order 12 is isomorphic to one of the following five groups: 

Il 2 , V x I 3 , A 4 , £3 x I 2 , T , 

where T is the group in Example 10.9. 

10.18 If Q is a group and A' is a <2- m °dule, let £ be a semidirect product of K by Q and let l : G —> 
E be a lifting. Prove that l(x) = (d(x), .v), where d :(?—»■ K, and £ is a homomorphism if 
and only if d is a derivation. 

10.19 If U : zqMoA -*■ Sets is the forgetful functor (which assigns to each module its set of ele¬ 
ments), prove that the ordered pair (<P, U) is an adjoint pair of functors. [By Exercise 7.39(ii) 
on page 471, there exists a free functor <I>: Set -*■ zgMod that assigns to each set X the free 
(?-module 4>(A - ) with basis X.] 

10.20 Prove that the functors Set(7C ) and Hom(<P, ), which map ggMod —»• Set, are naturally 
equivalent, where $ is the free functor defined in Exercise 10.19. 
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Let R be a ring. In this section, the word module will always mean “left /?-module.” Given 
a module M, there is a free module Fq and a suijection e: Fq —>• M ; thus, there is an exact 
sequence 

0 -» Oi -A Fo -U M -» 0, 

where Q\ — kere and i: Qi —> Fo is the inclusion. This is just another way of describing 
a presentation of M ; that is, a description of M by generators and relations. Thus, if X is 
a basis of Fo, then we say that X [or s(X )] are generators of M and that Q| are relations. 
The idea now is to take generators and relations of fi i, getting “second-order” relations 
fL, and to iterate this construction giving a free resolution of M, which should be regarded 
as a more detailed presentation of M by generators and relations. In algebraic topology, 
a topological space X is replaced by a sequence of chain groups, and this sequence yields 
the homology groups H n (X). We are now going to replace an R -module M by a resolution 
of it. 


Definition. A projective resolution of a module M is an exact sequence, 

• • • —>■ P n -> P n - 1 —»■ • • • —> P\ -> Po — * M -* 0, 

in which each module P n is projective. A free resolution is a projective resolution in 
which each module P n is free. 

Proposition 10.30 displayed an exact sequence of free left Z (1-modules 

_ d-x do „ d\ „ 

F3 —-> F 2 — F\ —-> F 0 , 

where F n is the free (1-module with basis Q". The module Fo — Z(1 is free on the 
generator 1, and the map d\ : F\ —> XQ is given by 

d\ '■ [x] i— y x — 1 . 


Proposition 10.31. For any group Q, there is an isomorphism ZQ/imdi = Z, where Z 
is regarded as a trivial Q-module. 

Proof. Define £: 7LQ -> Z by 

£: m x x m x . 

x&Q xeQ 


Now £ is a Q-ma p, for if x e Q, then s(x) — 1; on the other hand, e(x) — six ■ 1) = 
x£(l) = 1, because Z is a trivial Q-module. It is clear that £ is a surjection and that 
imc/i < ker£ [because e(x — 1) = 0]. For the reverse inclusion, if m * x e ker£, 

then m x = 0. Hence, 


m x x — m x x 
xeQ xeQ 


(Z m^j 1 = ^ m x (x - 1) e imc/i. 

xeQ xeQ 


Therefore, coker d\ — ZQ/imt/i = Z. • 
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Thus, the exact sequence in Proposition 10.30 can be lengthened so that it ends with 
coker d\ = 1QJ \ m d \, and so it looks like the beginning of a free resolution of the trivial 
Q -module Z. 

Proposition 10.32. Every module M has a free resolution (and hence it has a projective 
resolution). 

Proof. As in Section 10.1, there is a free module If and an exact sequence 

0 -* £2i -A- F 0 -A M 0. 

Similarly, there is a free module F\, a surjection e \: F\ —» Q j, and an exact sequence 

0 -» Q.2 -A- Fi -% -> 0. 

Define d\: F\ -> Fq to be the composite i\e\. It is plain that im d\ = = kere and 

ker d\ = Q 2 , so there is an exact sequence 

F\ - - ->- F 0 —M ->■ 0 



0 ->■ 

Plainly, this construction can be iterated for all n > 0 (so that the ultimate exact sequence 
is infinitely long). • 

There is a dual construction. 

Definition. An injective resolution of a module M is an exact sequence, 

0 -* M -* E° E l ->-^ E" -> E n+{ ->•••, 

in which each module E" is injective. 

Proposition 10.33. Every module M has an injective resolution. 

Proof. We use Theorem 8.104, which states that every module can be imbedded as a 
submodule of an injective module. Thus, there is an injective module E°, an injection 
rj: M —>• E°, and an exact sequence 

0 M -A E° -A E 1 -> 0, 

where Y. 1 = coker i] and p is the natural map. Now repeat: there is an injective module 
E l , an imbedding )] 1 : E 1 —>■ E 1 , yielding an exact sequence 

r) d° 

0->■ M ->- E° -^ E 1 



E 1 E 2 -0 

where d° is the composite d° — r; 1 p. This construction can be iterated. • 



Sec. 10.4 Homology Functors 


815 


We are now going to generalize both of these definitions. 


Definition. A complex 9 (C., d .) is a sequence of modules and maps, for every n e Z, 


C. 


„ d„ + 1 (in ' 

n+1 ^ ^ 


c n ~ i 


in which d„d n +\ = 0 for all n. The maps d„ are called differentiations. 

Usually, we will shorten the notation (C., d.) to C.. 

Note that the equation d n d n+ \ = 0 is equivalent to 


imd n +\ c ker d n 


Example 10.34. 

(i) Every exact sequence is a complex, for the required inclusions, im d n+ \ C ker d n , are 
now equalities, im d n+ \ — ker d„. 

(ii) The sequence of chain groups of a triangulated space X, 

• • • -> C 3 (X) X C 2 (X) X CfX) X Co(X), 

is a complex. However, a complex is supposed to have a module for every n e Z. We 
force this to be a complex by defining C n (X) = {0} for all negative n\ there is no problem 
defining differentiations d„ : C n (X) -> C n -\ [X ) for n < 0, for there is only the zero map 
from any module into {0}. 

(iii) In Chapter 9, we considered the de Rham complex of a connected open subset X of M": 

0 S2°(X) £l l (X) 4 f2 2 (X) ->-» S2" _I (X) X Q n (X) -> 0, 

where the maps are the exterior derivatives. 

(iv) The zero complex 0. is the complex (C., d.) each of whose terms C n — {0} and, 
necessarily, each of whose differentiations d n = 0. 

(v) If { M„ : n e Z} is any sequence of modules, then (M., d,) is a complex with nth term 
M„ if we define d„ — 0 for all n. 

(vi) Every homomorphism is a differentiation. If /: A -> B is a homomorphism, define a 
complex (C., d.) with C\ = A, Co = B. d\ — /, and all other terms and differentiations 
zero. 

(vii) Every projective resolution of a module M, 

-► Pi -> P 0 M 0, 

is a complex if we add {0}’s to the right. 


®These are also called chain complexes in the literature. 
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(viii) Every injective resolution of a module M, 

0 -* M -» E° E l -» • • • , 
is a complex if we add {0}’s to the left. 

We have used a convenient notation. According to the definition of complex, differen¬ 
tiations lower the index: d n : C„ —> C n ~\. The simplest way to satisfy the definition is to 
use negative indices: define C_„ = E", and 

0 -> M -> C 0 -> C-i C-2 -> • • • 


is a complex. 

(ix) If C. is a complex, 

C. 

and if F is an additive (covariant) functor, say, F : ft Mod Ab, then F( C.), defined by 

Fd 

F{ C.) = •••-»- F(C„) -4 F(C n - 1) 


C n 


C n — i 


is also a complex: 

0 = F (0) = F(d n d n + 1) = F(d n )F(d n+1 y, 

the equation 0 = /-TO) holds because F is an additive functor. Note that even if the original 
complex is exact, the functored complex F( C. ) may not be exact. 

(x) If F is a contravariant additive functor, it is also true that F( C.) is a complex, but we 
have to arrange notation so that differentiations lower indices by 1. In more detail, after 
applying F, we have 

Fd 

F( C.) = ■••<- F{C n ) ^ F{C n - 1) 

the differentiations Fd n increase indices by 1. Introducing negative indices almost solves 
the problem. If we define X- n = F(C„), then the sequence is rewritten as 

Fd 

F{ C.) = ■■■-> A_„ +1 -4- A_„ 

However, the index on the map should be — n + 1, and not n. Define 

$—n+l = Fd n . 

The relabeled sequence now reads properly: 

F( C.) = ■■•-»- X_„ +1 ^ 

Negative indices are awkward, however, and the following notation is customary: 
Change the sign of the index by raising it to a superscript: Write 

5 " = 8 - n . 
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The final version of the functored sequence now looks like this: 


F( C.) 




72 — 1 


X n 


It is convenient to consider the category of all complexes, and so we introduce its mor- 
phisms. 


Definition. If (C., cl.) and ( C' m , d'.) are complexes, then a chain map 

/ = /.: (C„ </.)-► (€'„<) 

is a sequence of maps /„ : C„ —> C' n for all n e Z making the following diagram commute: 

^"+1 ^ dn n 

^ '-' 22+1 ^22 ^ 22—1 > ' 


fn +1 

fn 




d'„ 


r' 

^ 22+1 


C' 


\ f, -i 

■cu¬ 


lt is easy to check that the composite g/ of two chain maps 

/.: (C„ J.) (Cl, d'j and g.: (C'., d'j (Cl', <') 

is itself a chain map, where (g/)„ — g„f n - The identity chain map lc. on (C., d. ) is the 
sequence of identity maps 1 c n ■ C n —> C„. 


Definition. If R is a ring, then the category of all complexes of left /^-modules is denoted 
by a' Comp; if the ring R is understood, then we will omit the prescript R. 

The category Comp is a preadditive category (that is, the Horn’s are abelian groups and 
the distributive laws hold whenever possible) if we define 

(/ + g)n = fn + gn for all n e Z. 

The following definitions imitate the construction of homology groups of triangulated 
spaces that we described in Section 10.1. 


Definition. If (C., d.) is a complex, define 

n-cycles = Z„(C.) = ker d n \ 
n-boundaries — B„( C.) = im d n+ \. 

Since the equation d n d n + \ — 0 in a complex is equivalent to the condition 

\md n+ \ c ker d n , 

we have B n ( C.) C Z„(C.) for every complex C.. 
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Definition. If C. is a complex and n e Z, its nth homology is 

H n (C.) = Z„(C.)/B„(€.). 


Example 10.35. 

A complex is an exact sequence if and only if all its homology groups are {0}: that is, 
H n ( C.) = {0} for all n. Thus, the homology groups measure the deviation of a complex 
from being an exact sequence. An exact sequence is often called an acyclic complex ; 
acyclic means “no cycles”; that is, no cycles that are not boundaries. ■* 


Example 10.36. 

In Example 10.34(vi), we saw that every homomorphism f: A —* B can be viewed as 
part of a complex C. with C\ = ,4, C'o = 11, d\ — /, and having {0}’s to the left and to 
the right. Now do — 0 implies im cl 2 = 0, and do = 0 implies kcr do = B; it follows that 


Hn( c.) 


ker / if n — 1; 

coker / if n — 0; 

0 otherwise. 4 


Proposition 10.37. For each n e Z, homology H n : ^ Comp -> ^Mod is an additive 
functor. 

Proof. We have just defined H„ on objects; it remains to define H n on morphisms. If 
/: (C., d,) —> (C., d ' % ) is a chain map, define H n (/): H n (C.) —>■ H n (C.) by 

H n (f) ■ Zn + B n ( C.) I— > fnZn + B n (C'f). 


We must show that f n z n is a cycle and that H n (f) is independent of the choice of cycle 
z„ ; both of these follow from / being a chain map; that is, from commutativity of the 
following diagram: 


Cn +1 


Cn 


C „-1 


fn +1 


c 

H + l 


C' 


■CL 


n— 1 


First, let z be an ;;-cycle in Z„(C.), so that d n z — 0. Then commutativity of the diagram 
gives 

d'nfnZ — fn-ldnZ = 0. 

Therefore, f n z is an n -cycle. 

Next, assume that z + B„( C.) = y + B n (C.); hence, z — y e B n { C.); that is, 


z-y = d n+ \c 
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for some c G C n+ \ . Applying f n gives 

fnZ - f n y = fndn+ic = d' n + x f n + \C G B n (C'j. 

Thus, f nZ + BJCJ = f n y + B n (C'.). 

Let us see that H n is a functor. It is obvious that H n ( lc.) is the identity. If / and g are 
chain maps whose composite gf is defined, then for every n -cycle z., we have 

H n (gf) '■ Z + B i-> (gf) n (z + B) 

= gnfn(z + B ) 

= H n (g)(f n z.+ B) 

= H n (g)H n (f)(z + B). 

Finally, H n is additive: if g: (C., cl. ) —> (C'., d'.) is another chain map, then 

H n (f + g)- z + B n ( C.) i-^- (/„ + g n )z + B„(C'.) 

— fnZ T g„Z T B„(C'J 
= (H n (f) + H n (g))(z + B n ( CD). . 

Definition. We call H n (f) the induced map , and we usually denote it by /„*, or even 

by /*• 


Proposition 10.38. Let R and A be rings, and let T : Mod -> \ Motl be an exact 
additive functor. Then T commutes with homology, that is, for every complex (C., d.) G 
^Comp and for every n G Z, there is an isomorphism 

H n (TC., Td.) = TH n (C.,d.), 

Proof. Consider the commutative diagram with exact bottom row. 


> H n (C.) ->- 0 , 


C 


n +1 


■imd n+ i 


C„ 


I k 

ker d n 


where j, and k are inclusions and d' n+[ is just d n + 1 with its target changed from C n to 
m\d n +\. Applying the exact functor T gives the commutative diagram with exact bottom 
row 


rj, r i Td n .)-i Td n 


Td' 


n +1 


Tj 


Tk 


T (imt/„ + i) ——>■ T (ker d„) 


TH„( C.)-^0 
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On the other hand, because T is exact, we have T (im d n+ \) — im T (d n+ \) and T (ker d„) = 
ker(r d„), so that the bottom row is 

0 -> im(Td n+ i) -* ker (Td n ) TH„( C.) 0. 

By definition, ker(7’c/„)/im(Ta f „ + i) = H„(TC.), and so H,,(TC.) = TH n ( C.), by 
Proposition 8.93. • 

We now introduce a notion that arises in topology. 


Definition. A chain map /: (C., cl. ) — >• (C^, d ') is nullhomotopic if, for all n, there are 
maps s n : A„ —> A' n+1 with 

fn = d n + 1 s n -p S n —\d n . 



If /, g: (C., d .) —> (Cl. d’.) are chain maps, then / is homotopic 10 to g, denoted by 
/ ±2 g, if / — g is nullhomotopic. 


Proposition 10.39. Homotopic chain maps induce the same homomorphism between 
homology groups: if f, g : (C., d.) —> (C., d' m ) are chain maps and f ^2 g, then 

f*n = g*n ■ H„( C.) -> H„ (£.). 

Proof. If z is an /;-cycle, then d„z = 0 and 

fnZ 8nZ = d n+1 S n Z “h S n — |cl n Z — d n _^_^S n Z. 

Therefore, f,z - g„z e «„(€.), and so /*„ = g, tn . • 


Definition. A complex (C.,d.) has a contracting honwtopy 11 if its identity lc. is 
nullhomotopic. 


10 Two continuous functions f,g:X^Y are called homotopic if / can be "deformed" into g\ that is, there 
exists a continuous F : X x I —>- Y, where I = [0, 1] is the closed unit interval, with F(x, 0) = f{x ) and 
F{x, 1) = g(;r) for all x e X. Now every continuous / : X — >• Y induces homomorphisms /* : H n (X ) —>- 
H n (Y), and one proves that if / and g are homotopic, then /* = g*. The algebraic definition of homotopy given 
here has been distilled from the proof of this topological theorem. 

11 A topological space is called contractible if its identity map is homotopic to a constant map. 
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Proposition 10.40. A complex (C., d,) having a contracting homotopy is acyclic, that 
is, it is an exact sequence. 

Proof. We use Example 10.35. Now lc.: H n ( C.) -> //„ (C.) is the identity map, while 
0*: H n ( C.) -> H n ( C.) is the zero map. Since lc. — 0, however, these maps are the 
same. It follows that H n ( C.) = {0} for all n; that is, ker<r/ /( = im d n+ \ for all n, and this is 
the definition of exactness. • 

Once we complete the free resolution of the trivial Z(1-module Z whose first few terms 
were given in Proposition 10.30 (see also Proposition 10.31), we will prove that it is an 
exact sequence by showing that it has a contracting homotopy as a complex of abelian 
groups. 

In order to study the homology functors, it is necessary to understand their domain 
Comp. Many of the constructions in ^Mod can also be done in the category Comp. We 
merely list the definitions and state certain properties, whose verifications are straightfor¬ 
ward exercises for the reader. 


(i) An isomorphism in Comp is an equivalence in this category. The reader should 
check that a chain map /: C. -> C, is an isomorphism if and only if f n : C„ -> C' n 
is an isomorphism in /.-Mod for all n e Z. (We must check that the sequence of 
inverses f~ l is a chain map; that is, that the appropriate diagram commutes.) 

(ii) A complex (A., <$.) is a subcomplex of a complex (C., d.) if, for every n e Z, we 
have A n a submodule of C n and S n = d n \A n . 

If i„ : A n -> C„ is the inclusion, then it is easy to see that A. is a subcomplex of C. 
if and only if i: A. -> C. is a chain map. 

(iii) If A. is a subcomplex of C., then the quotient complex is 

C./A. = • • • -»• C n /A n -4+ C n -i/A n -\ ->•••, 

where d \[: c n + A n i-> d„c„ + A„_i (it must be shown that d[[ is well-defined: if 
c n -|- A fl — b a A a , then d n c n -f- A n — j — d n b n -h A^_[). If 7 T fl . C f j ^ A n is the 
natural map, then it: C. —> C./A. is a chain map. 

(iv) If /. : (C., d.) -» (C^, df) is a chain map, define 

ker/ =-* ker/„ + i ker f n ker /„_i , 


where S„ — d n \ ker /„, and define 

im/ =-> im/„ + i im/„ im/„_i ->•••, 

where A„ = d' n \ im /„. It is easy to see that ker/ is a subcomplex of C., that im/ 
is a subcomplex of C'., and that the first isomorphism theorem holds: 

C./ker / = im/ 
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(v) A sequence of complexes and chain maps 

fn+l rn 

_^ £ n+1 ^ ^ n J ^ ^ n— 1 _^ 

is an exact sequence if, for all n e Z, 

im f n+l = ker/". 

We may check that if A. is a subcomplex of C., then there is an exact sequence of 
complexes 

0. -»■ A. —+ C„ 

where 0. is the zero complex and i is the chain map of inclusions. More generally, if 
i : C. —> C'. is a chain map, then each i n is injective if and only if there is an exact 

sequence 0. —» C. — '-*■ C'.. Similarly, if p : C. -> C" is a chain map, then each p n 
is surjective if and only if there is an exact sequence 

c. -A c: o„ 

The reader should realize that this notation is very compact. For example, if we 
write a complex as a column, then a short exact sequence of complexes is really the 
infinite commutative diagram with exact rows: 


■ C 


l n +1 ’ Pn +1 ^ 


n +1 


d 1 


«+1 

>c: 

d[ 


r 

'-'77 


Pn 


n+l 




l/i-l ^ 

-„_1 ► t-n-l 


>. r" - 
d'l 

C" 


n—1 


->■ o 


A sequence of complexes 
only if 


€. 


rn 

n -\-1 


C m n A-> C. 


n— 1 


c 


n+l 


C" -* C" 1 -> ••• 


• • • is exact if and 


is an exact sequence of modules for every hi e Z. 
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(vi) If {(C“, d")} is a family of complexes, then their direct sum is the complex 

E c ? = “ E c “ “ _1 E c “-i 

a a a a 

where acts coordinatewise; that is, '■ ( c “) l-> c“). 

To summarize, we can view Comp as a category having virtually the same properties 
as the category of modules; indeed, we should view a complex as a generalized module. 
(Categories such as r Mod and Comp are called abelian categories .) 

The following elementary construction is fundamental; it gives a relation between dif¬ 
ferent homology modules. The proof is a series of diagram chases. Ordinarily, we would 
just say that the proof is routine, but, because of the importance of the result, we present 
(perhaps too many) details; as a sign that the proof is routine, we drop subscripts. 


Proposition 10.41 (Connecting Homomorphism). If 


0. 


c: 


i 



c. 


o. 


is an exact sequence of complexes, then, for each n e Z, there is a homomorphism 


3„: H n (C:) - ff„_i(Cl) 


defined by 

9„: z" + B n (Cl') H* i;\d n pfh'; + B n _ 1 (Cl). 

Proof. We will make many notational abbreviations in this proof. Consider the commu¬ 
tative diagram having exact rows: 


0 


0 


t . | V 

'"+ 1 r, P"+ l /-'ll 

Si+1 ^ ( -«+1 ^ L 'n + 1 


0 


d n +1 


I 

C’„ 


&n +1 


> C„ 



^0 


0 


C’ 

1 


Cn-1 


Pn~ 1 


c 


n— 1 


o 


Suppose that z" e C" and d"z," = 0. Since p n is surjective, there is c e C n with pc = z" ■ 
Now push c down to dc e C n -\. By commutativity, p n -\dc — d"p n c = d"z" — 0, so 
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that dc e kcri = imi„_i. Therefore, there is a unique c e C' x with i n -\c' = dc, 
for i is an injection. Thus, i~\dp~ l z" makes sense; that is, the claim is that 

3 n(z" + fi") = c' + B' n _, 


is a well-defined homomorphism. 

First, let us show independence of the choice of lifting. Suppose that p n c = z,", where 
c e C„. Then c — c e ker p„ = im i n , so that there is u' e C' n with i n u' = c — c. By 
commutativity of the first square, we have 

i n -\d'u — di n u' — dc — dc. 

Hence, i~ l dc — i~ [ dc = d'u' e B r that is, i~ l dc + B r . — i~ l dc + B' l . Thus, the 
formula gives a well-defined function 


C' n -JB' n _ v 


Second, the function Z" -> C'_ 1 /B'_ 1 is a homomorphism. If z", z![ e Z", let pc = 
z" and pc\ = z![. Since the definition of 3 is independent of the choice of lifting, choose 
c + ci as a lifting of z" + z![. This step may now be completed in a routine way. 

Third, we show that if i n -\c' = dc, then c' is a cycle: 0 = ddc — die' — idc' , and so 
d'e' — 0 because i is an injection. Hence, the formula gives a homomorphism 


Z" Z'/B' = H n —i . 


Finally, the subgroup B\[ goes into B' n _ { . Suppose that z" = d”c”, where c" e C" +1 , 
and let pu = c", where u e C„+ 1 . Commutativity gives pdu — d"pu = d"c" = z"■ 
Since d(z") is independent of the choice of lifting, we choose du with pdu — z", and so 
d(z" + B") = i~ l d(du) + B' = B'. Therefore, the formula does give a homomorphism 
3„: fln-tCCl). • 

The first question we ask is what homology functors do to a short exact sequence of 
complexes. The next theorem is also proved by diagram chasing and, again, we give too 
many details because of the importance of the result. The reader should try to prove the 
theorem before looking at the proof we give. 


Theorem 10.42 (Long Exact Sequence). If 

o. -> c; 4c.^> c: -> o. 

is an exact sequence of complexes, then there is an exact sequence of modules 
-> /WO ^4 w„(0 H n { C.) -4 H n ( C") -4 H n -i(C.) 
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Proof. This proof is also routine. Our notation is abbreviated, and there are six inclusions 
to verify. 

(i) im /* C ker p* 

p*i* — (pi)* =0* — 0 


(ii) kerp* c im/* 

If p*(z + B) — pz + B" = B". then pz — d"c" for some c" e C" +1 . But p surjective 
gives c" — pc for some c e C„+i, so that pz — d"pc — pdc, because p is a chain map, 
and so p(z — dc ) = 0. By exactness, there is c' e C' n with ic' = z — dc. Now c' is a 
cycle, for id'c' = did — dz — ddc = 0, because z is a cycle; since / is injective, d'c' = 0. 
Therefore, 

i*(c' + B') — ic' + B — z — dc+B —z + B. 


(iii) im p* C ker 3 

If p*(c + B) — pc + B" e imp*, then 3 {pz + B") = z! + B ', where iz! — dp~ l pz. 
Since this formula is independent of the choice of lifing of pz, let us choose p _1 pz = Z. 
Now dp~ l pz = dz = 0, because z is a cycle. Thus, iz! = 0, and hence z! — 0, because / 
is injective. 

(iv) ker 3 C im p* 

If 3 (z" + B") = B then z! — i~ l dp~ l z" e B'-, that is, z! — d'c' for some d e C'. But 
iz! — id'c' — die' = dp~ l z", so that d(p~ l z" — ic') = 0; that is, p~' L z!' — ic' is a cycle. 
Moreover, since pi — 0 because of exactness of the original sequence, 

p*(p~ l z" - id + B) — pp~ l z" - pic' + B" = z" + B". 


(v) im 3 C ker /* 

We have i*d(z." + B") — iz' + B', where iz! — dp~ l z" e B\ that is, /*3 = 0. 

(vi) ker /* C im 3 

If i*(z' + B') — iz' + B = B, then iz! — dc for some c e C. Since p is a chain map, 
d"pc = pdc = piz' — 0, by exactness of the original sequence, and so pc is a cycle. But 

3 (pc + B") — i~ l dp~ l pc + B' = i~ l dc + B' — i~ l iz' + B' = z! + B'. • 

Theorem 10.42 is often called the exact triangle because of the diagram 




*H.( C.) 
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Corollary 10.43 (Snake Lemma). Given a commutative diagram of modules with exact 
rows, 

0-*• A' 

f 

0-> B' 

there is an exact sequence 

0 —> ker / —*■ kerg —> ker h —> coker / —> coker g —> coker h —*■ 0. 

Proof. If we view each of the vertical maps /, g, and h as a complex [as in Exam¬ 
ple 10.34(vi)], then the given commutative diagram can be viewed as a short exact se¬ 
quence of complexes. The homology groups of each of these complexes has only two 
nonzero terms: for example. Example 10.36 shows that the homology groups of the first 
column are H\ = ker/, Hq — coker/, and all other H n = {0}. The snake lemma now 
follows at once from the long exact sequence. • 


l-*- A"->■ 0 

8 h 

!->■ B" -^ 0 


Theorem 10.44 (Naturality of 3). Given a commutative diagram of complexes with 
exact rows. 


0. 

0. 


-c: 

/ 

-a: 


c. 


g 

V 

f \ 

/ J L 

4 A 


c: 

I h 


o. 


^ 0. 


there is a commutative diagram of modules with exact rows , 


-*- ///€.) H n ( C.) H n (C>:) —^ //„-i(C.)->■ • • • 

/* 8* h * /* 

y . y Y 

-*- H n (AD H n ( A.) H n (K) —^ H n - t(Al)-• • • 

Proof Exactness of the rows is Theorem 10.42, while commutativity of the first two 
squares follows from H n being a functor. To prove commutativity of the square involving 
the connecting homomorphism, let us first display the chain maps and differentiations in 
one (three-dimensional!) diagram: 


—.> c' —' > c n —„> c;; 
r.\u /„\s* r \, 

(-'! i I f-' " I (~\n i 


^0 


f* 


A' _ 

A n • 

I Jtg 
A ;? _i . >- A„_i 


g* i 

A n - 

/ h t /, 

A" 


A" — 


n—1 


0 


0 
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If z" + B(C") e H n ( C"), we must show that 

/*9(z" + 6(0) = 9'Mz" + S(C1')). 

Let c e C n be a lifting of z"\ that is, pc = z" ■ Now d(z" + B( C")) = z! + B(C' a ), where 

iz' = dc. Hence, /*9(z" + B(A")) = fz' + B(A' t ). On the other hand, since h is a chain 

map, we have qgc = hpc = hz" ■ In computing d'(hz" + B( A")), we choose gc as the 

lifting of hz". Hence, d'(hz" + B( A") = u' + B(A' t ), where ju' — Sgc. But 

jfz' = giz' = gdc = Sgc = ju', 

and so fz' = u', because j is injective. • 

We shall apply these general results in the next section. 

Exercises 

10.21 If C. is a complex with C n = {0} for some n, prove that H n (C.) = {0}. 

10.22 Prove that isomorphic complexes have the same homology: If C. and D. are isomorphic, then 
H n ( C.) = H n ( D.) for all n. 

10.23 Regard the map d : Z -*■ Z, defined by d: m 2/m, as a complex, as in Example 10.34(vi). 
Prove that it is not a projective object in the category ;gComp even though each of its terms is 
a projective Z-module. 

10.24 View Z as the category PO(Z) whose objects are the integers, and with exactly one morphism 
m -*■ in whenever m < n, with no morphisms otherwise. [If we view Z as a partially ordered 
set, then this is the associated category defined in Example 7.25(v).] Prove that a complex 
(C«, 9.) is a contravariant functor PO(Z) r Mod, and that a chain map is a natural trans¬ 
formation. 

10.25 In this exercise, we prove that the snake lemma implies the long exact sequence (the converse 
is Corollary 10.43). Consider a commutative diagram with exact rows (note that two zeros are 
“missing” from this diagram): 


A - 

I a 


p 

B -^ C -^0 


P 


y 


Y , V V 

0- A' - B' -> c' 

(i) Prove that A: ker y —> coker c/, defined by 

A: z + ima, 

is a well-defined homomorphism. 

(ii) Prove that there is an exact sequence 


ker a —>■ ker f) —>■ kery 


A 


coker a —>■ coker fl -*■ coker y. 
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(iii) Given a commutative diagram with exact rows, 

0->- A' ->■ A n ->■ A" 


d'„ 


0 


■ A' 


n —1 


I" 

A n _ i 


d" 


A" 

' A /i-l 


>0 


0 


prove that the following diagram is commutative and has exact rows: 

A'J im d ' n+ j - A n / im d n +\ - * A"/ im d ” + , 


- ker d f , 
n —1 


r 

• kerd„_i 


• kerd" , 
n— 1 


(iv) Use part (ii) and this last diagram to give another proof of the long exact sequence. 

10.26 Let /, g: C. — > be chain maps, and let F : C. —>■ be an additive functor. If f g, 

prove that Ff — Fg\ that is, if / and g are homotopic, then Ff and Fg are homotopic. 

10.27 Let 0. -*■ — U- C. —C” -*■ 0. be an exact sequence of complexes in which and 

C" are acyclic; prove that C. is also acyclic. 

10.28 Let (C., d,) be a complex each of whose differentiations d n is the zero map. Prove that 
H n ( C.) = C n for all n. 

10.29 (3x3 Lemma) Given a commutative diagram in which the columns and the bottom two rows 
are exact sequences, 


0 0 0 


0 


0 


0 


K' - 


> P' 


> A' 


> K" -=? 0 


> p" 0 

> A" -* 0 


V V Y 

0 0 0 


prove that the top row is an exact sequence. 

10.30 Prove that homology commutes with direct sums: For all n, there are natural isomorphisms 

a a 
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10.31 (i) Define a direct system of complexes {C^, i p',}, and prove that lim exists. 

(ii) If (C^, (ft 1 -} is a direct system of complexes over a directed index set, prove, for all n > 0. 
that 

H n (ljmCi) = limH n (Ci). 

10.32 Suppose that a complex (C,, cl, ) of R -modules has a contracting homotopy in which the maps 
s n : C n -*■ C n+ \ satisfying 

1 C„ — d n +l s n + S n —\dn 
are only Z-maps. Prove that (C., d,) is an exact sequence. 

10.33 (i) Let 0 —> F n —> F„_\ —>■■■—>■ Fq —>■ 0 be an exact sequence of finitely generated 

free ^-modules, where k is a commutative ring. Prove that 

n 

X)(-l)' rank(F;) = 0. 

i=0 


(ii) Let 


F„ 


F, 


ii—l 


F 0 ^ 0 


and 

be free resolutions of a fc-module M in which all F, and F’. are finitely generated free 
k -modules. Prove that 


n m 

^(-l)'rank(F,-) = ^(-l)- 7 ' rank(Fj). 

i =0 7=0 

The common value is denoted by / (M), and it is called the Euler-Poincare character¬ 
istic of M. 

Hint. Use Schanuel’s lemma. 

10.34 (i) Let C. : 0 —> C n — >■ C n —\ — >■ ■■■—»■ Co —>■ 0 be a complex of finitely generated free 

fc-modules over a commutative ring k. Prove that 


ii ii 

^TC-I)' rank(Q) = ^Tl-I)' rank(H ; -(C.)). 


i=0 


1=0 


(ii) Let 0 —>■ M 1 —> M —> M" —> 0 be an exact sequence of fc-modules. If two of the 
modules have an Euler-Poincare characteristic, prove that the third module does, too, 
and that 

X (M) = X (M') + x(M"). 

10.35 (i) (Barratt-Whitehead). Consider the commutative diagram with exact rows: 


A n 


Pn 


C n 


3 n 


An¬ 


ti—l 


Bn —l 


C„- 


ii—l 


fn 

Sn 

hn 

fn— 1 

Sn— 1 

hn— 1 

\ 



f \ 


! V 


A' 


> B'„ 


Jn 


In 


=5- C 


A’ 


n— 1 


K-i 


C' 

1 
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If each li n is an isomorphism, prove that there is an exact sequence 


0 H !) A' n © Bn j "-4" B' n d, K qn A n _i - © B n _, - B’ 


n— 1 ’ 


where (/„,/„): a„ h* (/„<*„, i'„a„) and - g n : {a' n ,b n ) i-> j n a' n - g„b n . 

(ii) {Mayer-Vietoris). Assume, in the diagram of Theorem 10.44, that every third vertical 
map h* is an isomorphism. Prove that there is an exact sequence 

-* H n { Cl) -► H n ( Ai) © H n ( C.) H n ( A.) -* //„_i(Ci) 


Remark. The Eilenberg-Steenrod axioms characterize homology functors on the category Top 
of all topological spaces and continuous maps. If /?„ : Top -*■ Ab is a sequence of functors, for all 
n > 0, satisfying the long exact sequence, naturality of connecting homomorphisms, h n (/) = h n ( g ) 
whenever / and g are homotopic, Iiq(X) — Z and h n (X) — (0) for all n > 0 when A is a 1-point 
space, and excision , then there are natural isomorphisms h n —> H n for all n. Now excision involves 
an added construction, called relative homology, but in the presence of the other axioms, excision 
can be replaced by exactness of the Mayer-Vietoris sequence. ◄ 


10.5 Derived Functors 

In order to apply the general results about homology, we need a source of short exact 
sequences of complexes, as well as commutative diagrams in which they sit. The idea is 
to replace a module by a (deleted) resolution of it. We then apply either Horn or ®, and 
the resulting homology modules are called Ext or Tor. Given a short exact sequence of 
modules, we shall see that we may replace each of its modules by a resolution and obtain 
a short exact sequence of complexes. 

This section is fairly dry, but it is necessary to establish the existence of homology 
functors. The most useful theorems in this section are Theorem 10.46 (the comparison the¬ 
orem), Proposition 10.50 (which shows that the basic construction is well-defined). Corol¬ 
lary 10.57 (the long exact sequence), and Proposition 10.58 (naturality of the connecting 
homomorphism). 

For those readers who are interested in using Tor (the left derived functors of tensor) and 
Ext (the right derived functors of Horn) immediately, and who are willing to defer looking 
at mazes of arrows, the next theorem gives a set of axioms characterizing the functors Ext". 

Theorem 10.45. Let EXT" : a Mod —> Ab be a sequence of contravariant functors, for 
n > 0, such that 

(i) for every short exact sequence 0— > A —> B —>■ C —> 0, there is a long exact 
sequence and natural connecting homomorphisms 

■ ■ ■ -> ext"(C) -> ext"(B) ext" (A) ext" +1 (C) -*•••; 

(ii) there is a left R-module M with EXT 0 and Homs( , M) naturally equivalent'. 
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(iii) EXT"(P) = {0 } for all projective modules P and all n > 1. 

If Ext"( , M) is another sequence of contravariant functors satisfying these same ax¬ 
ioms, then EXT" is naturally equivalent to Ext"( , M) for all n > 0. 


Remark. There are axiomatic descriptions of the covariant Ext functors and of the Tor 
functors in Exercises 10.44 and 10.45 on page 869. < 


Proof. We proceed by induction on n > 0. The base step is axiom (ii). 
For the inductive step, given a module A, choose a short exact sequence 

0->L—>P->A—>0, 

where P is projective. By axiom (i), there is a diagram with exact rows: 


EXT°(P) 


ext°(L) 


Ao 


EXT 1 (A) 


rp 


r l 


EXT 1 !/ 5 ) 


V 

Hom(P, M) 


t 

> Hom(L, M) 


3o 


y 

Ext 1 (A, M) 


Ext’(P, M), 


where the maps r p and T[ are the isomorphisms given by axiom (ii). This diagram com¬ 
mutes because of the naturality of the equivalence EXT 0 -> Hom( , M ). By axiom (iii), 
Ext^P, M) = {0} and EXT 1 )/*) = {0}. It follows that the maps Ao and do are surjective. 
This is precisely the sort of diagram in Proposition 8.93, and so there exists an isomorphism 
EXT 1 (A) —»■ Ext 1 (A, M) making the augmented diagram commute. 

We may now assume that n > 1, and we look further out in the long exact sequence. 
By axiom (i), there is a diagram with exact rows 


ext" (P)- 5 - ext"(L)-^ ext" +1 (A)- 5 - ext" +1 (P) 

O 

Y V 

Ext"(P, M) -Ext"(L, M) Ext" +1 (A, M) -^ Ext" +1 (P, M), 


where a: EXT "(L) -> Ext" (L,M) is an isomorphism given by the inductive hypoth¬ 
esis. Since n > 1, all four terms involving the projective P are {0}; it follows from 
exactness of the rows that both A„ and 3„ are isomorphisms. Finally, the composite 
: EXT" +1 (A) —> Ext" +1 (A, M ) is an isomorphism. 

It remains to prove that the isomorphisms EXT" (A) —> Ext" (A, M ) constitute a natural 
transformation. It is here the assumed naturality in axiom (i) of the connecting homomor¬ 
phism is used, and this is left for the reader to do. • 

Such slow starting induction proofs, proving results for n = 0 and n — 1 before proving 
the inductive step, arise frequently, and they are called dimension shifting. 
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The rest of this section consists of constructing functors that satisfy axioms (i), (ii), and 
(iii). We prove existence of Ext and Tor using derived functors (there are other proofs as 
well). As these functors are characterized by a short list of properties, we can usually work 
with Ext and Tor without being constantly aware of the details of their construction. 

We begin with a technical definition. 

Definition. If • • • —»■ Pn —> P\ Pq —► A —»■ 0 is a projective resolution of a module 
A, then its deleted projective resolution is the complex 

Pa = • • • -» P 2 -* Pi -* Po -* 0. 

d° 

Similarly, if 0 —> A — > E° —> E 1 —> E 2 —> ■ ■ ■ is an injective resolution of a module 
A , then a deleted injective resolution is the complex 

E a = 0 -> E° -» E l E 2 -> • • • . 

In either case, deleting A loses no information: A = coker d\ in the first case, and 
A = ker d° in the second case. Of course, a deleted resolution is no longer exact: 

Hq(Pa) = ker(P 0 {0})/imz/i = P 0 /imdi = A. 

We know that a module has many presentations, and so the next result is fundamental. 

Theorem 10.46 (Comparison Theorem). Given a map f: A —> A ', consider the dia¬ 
gram 



where the rows are complexes. If each P n in the top row is projective, and if the bottom 
row is exact, then there exists a chain map f: Pa —> P ' A , making the completed diagram 
commute. Moreover, any two such chain maps are homotopic. 

Remark. The dual of the comparison theorem is also true. Now the complexes go off to 
the right, the top row is assumed exact, and every term in the bottom row other than A' is 
injective. < 

Proof, (i) We prove the existence of /„ by induction on n > 0. For the base step n = 0, 
consider the diagram 


Po 
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Since s' is surjective and Pq is projective, there exists a map fo : Pq —> Pq with s' fo = fs. 
For the inductive step, consider the diagram 


Pn +1 Pn — 


fn 


Pn -1 

fn -1 


P' 

n+1 


d n+\ 


PL 


d’ 


PL. 


n — 1 


If we can show that im f n d n+ \ C im d' n+1 , then we will have the diagram 

Pn +1 

fndn +1 

P „+1 —^ imfl, „+t —^0 


and projectivity of P„+i will provide a map /„+j: P n+ \ -> P,' +1 with d' n+l f„+ \ = 
f n d n +\. To check that the inclusion does hold, note that exactness at P’ n of the bot¬ 
tom row of the original diagram gives im c/ ( j +! = l<ert/ ( j, and so it suffices to prove that 

dfi.indn I I = 0- Fut d n f u d n j | fn—idudn+i = 0. 

(ii) We now prove uniqueness of / to homotopy. If h: P. -> P'. is a chain map also 
satisfying s'Iiq — fs, then we construct the terms s„ : P n -> P’ n+ \ of a homotopy s by 
induction on n > 0; that is, we want 


fn h-n — d >t j T S f! — | d n . 


Let us now begin the induction. If we define sq = 0 = ,v_ i, then d[s o + s-ido = 0. On the 
other hand, 

(fo - hoW = foe' - h 0 e' = sf - sf = 0. 

Since s' is a surjection, we have fo — ho = 0, as desired. 

For the inductive step, it suffices to prove that 


for we have a diagram with exact row 


Pn+\ 


p' 
n+2 


u n+2 


i m d. 


hn +1 fn +1 s nd n +1 
/ 

«+2 ^ 


o 
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and projectivity of P n +\ will give a map s„+i: P n+ \ —>■ P' l+2 satisfying the desired equa¬ 
tion. As in the proof of part (i), exactness of the bottom row of the original diagram gives 
im d' n+ r, — ker d' n+v and so it suffices to prove 

^n+\(fn+t fn- 1-1 s ndn- t-l) — 0- 


But 


d n j r \i}l n +1 fn- t-l + = ^n+ lfn+l fn- t-l) ^n-\-\^ndn+\ 

= d n +i(hn+l fn+ 1) (fn fn ^n—\^n)^n+\ 

= ^ n +\ fn+1 fn+ 1) fn fn)dn+ 1> 


and the last term is 0 because h and / are chain maps. • 

We introduce a term to describe the chain map / just constructed. 


Definition. If /: A -> A' is a map of modules, and if Pa and P' v , are deleted projective 
resolutions of A and A', respectively, then a chain map /: Pa V A , 


- Pi 




e 


A 



0 

0 


is said to be over f if 

fe = e'f 0 . 

Thus, the comparison theorem implies, given a homomorphism /: A -» A', that a 
chain map over / always exists between deleted projective resolutions of A and A'; more¬ 
over, such a chain map is unique to homotopy. 

Given a pair of rings R and S and an additive covariant functor T: r Mod —> <,Mod, we 
are now going to construct, for all n e Z, its left derived functors L n T : ^Mod —> ^Mod. 

The definition will be in two parts: first on objects; then on morphisms. 

Choose, once for all, a deleted projective resolution Pa of every module A. As in 
Example 10.34(ix), form the complex 7’P\, and take homology: 

L„T(A) = H n (TP A ). 

This definition is suggested by two examples. First, in algebraic topology, we tensor 
the complex of a triangulated space X to get homology groups H n {X\ G) of X with coef¬ 
ficients in an abelian group G; or, we apply Hom( , G) to get a complex whose homology 
groups are called cohomology groups of X with coefficients in G (of course, this last func¬ 
tor is contravariant). Second, when we considered group extensions, the formulas that 
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arose suggested constructing a free resolution of the trivial module Z, and then applying 
Hom( , K) or ®K to this resolution. 

We now define L n T(f), where /: A -> A' is a homomorphism. By the comparison 
theorem, there is a chain map /: Pa -> P ' A , over /. It follows that T f: TP a -> T P' A , is 
also a chain map, and we define L„ T (/): L n T (A) -> L n T ( A ') by 

L„T(f) = ff B (P/) = (P/)*. 

In more detail, if z € ker Td n , then 

(L n T)f : z + imTd n+ i (-* (Tf n )z + im Td' n+v 

In pictures, look at the chosen projective resolutions: 

-Pi - Pq -> A-0 

/ 

-- p[ —- n; —- a' —-o 

Fill in the a chain map / over /, then apply T to this diagram, and then take the map 
induced by T f in homology. 

Example 10.47. 

If r e Z(R) is a central element in a ring K, and if A is a left P-module, then fi r : A -» A, 
defined by jjL r \ A i-> rA, is an P-map. We call //,. multiplication by r. 


Definition. A functor T : /,’Mod ^Mod, of cither variance, preserves multiplications 
if T (/x r ): TA -* PA is multiplication by r for all r e Z(R). 

Tensor product and Horn preserve multiplications. We claim that if T preserves multi¬ 
plications, then L n T also preserves multiplications; that is, 

L n T(fi r ) = multiplication by r. 

d\ s v . 

Given a projective resolution •••—»■ Pi —> Po —> A —> 0, it is easy to see that // is a 
chain map over \i r . 




0 

0 , 


where fi n : P„ —> P„ is multiplication by r for every n > 0.. Since T preserves multipli¬ 
cations, the terms of the chain map T jl are multiplication by r, and so the induced maps in 
homology are also multiplication by r: 

( T ji)* : z n + im Td „+1 ^ (TFn)z„ + im P d n+ i — rz n + im P d n+ 1, 


where z n e ker Td„. < 
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Proposition 10.48. Given a pair of rings R and S and an additive covariant functor 
T : a* Mod —» sMod, then 

L n T : tfMod -> 5 Mod 
is an additive covariant functor for every n. 

Proof We will prove that L„ T is well-defined on morphisms; it is then routine to check 
that it is a covariant additive functor (remember that H n is a covariant additive functor from 
complexes to modules). 

If h: Pa —*■ P ' A , is another chain map over /, then the comparison theorem says that 
h /; therefore, Th ^2 T f, by Exercise 10.26 on page 828, and so H n (Th ) = //„ (T /), 
by Proposition 10.39. • 

Proposition 10.49. IfT : a Mod —> y Mod is a covariant additive functor, then L n T A = 
{0 } for all negative n and for all A. 

Proof By Exercise 10.21 on page 827, we have L n TA — {0} because, when n is nega¬ 
tive, the nth term of P A is {0}. • 

Definition. If B is a left R -module and T — 0a IP define 

Tor *{,B) = L n T. 


Thus, if 

Pa = • • • -> Pi A Pi Pq -> 0 

is the chosen deleted projective resolution of a module A, then 


Tor*(A, B) = H„(Pa ®r B) = 


ker (d„ 0 Is) 
im(c/„ + i 0 Is)’ 


The domain of Tor* ( , B) is Mods, the category of right /?-modules; its target is Ab, 
the category of abelian groups. For example, if R is commutative, then A 0s B is an 
/^-module, and so the values of Tors ( , B) lie in a Mod. 


Definition. If A is aright /^-module and T = A 0 a , define tor* (A, ) = L„ T. Thus, if 


Qb — ■■■ 


Qi — Q\ —A- Qo 


is the chosen deleted projective resolution of a module B, then 


tor*(A, B) = H n (A ® R Qb) = 


ker(l A ®d n ) 
im(l A 0 c/,1+1)’ 


The domain of tor* (A, ) is a Mod, the category of left A’-modules; its target is Ab, the 
category of abelian groups but, as before, its target may be smaller (if R = Q, for example) 
or larger [if R — ZG, for every Z-module can be viewed as a (trivial) /^-module]. 
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One of the nice theorems of homological algebra is, for all A and B (and for all R and 
ri), that 

Tor,f (A, B) = tor,f (A, B). 

There is a proof using spectral sequences, but there is also an elementary proof due to A. 
Zaks (see Rotman, An Introduction to Homological Algebra, p. 197). 

There are now several points to discuss. The definition of L„T assumes that a choice 
of deleted projection resolution of each module has been made. Does L n T depend on this 
choice? And, once we dispose of this question (the answer is that L n T does not depend on 
the choice), how can we use these functors? 

Assume that new choices Pa of deleted projective resolutions have been made, and let 
us denote the left derived functors arising from these new choices by L„T. 

Proposition 10.50. Given a pair of rings R and S, and an additive covariant functor 
T : Mod —> ^Mod, then, for each n, the functors L n T and L n T are naturally equivalent. 
In particular, for all A, 

( L n T)A={L n T)A , 

and so these modules are independent of the choice of ( deleted ) projective resolution of A. 
Proof. Consider the diagram 



where the top row is the chosen projective resolution of A used to define L„T and the 
bottom is that used to define L n T. By the comparison theorem, there is a chain map 
i: Pa —> Pa over 1 a- Applying T gives a chain map T i: TP a —> TP a over TIa — 1 ta- 
This last chain map induces homomorphisms, one for each n, 

t a = (Ti)*: (L n T)A -* ( L n T)A. 

We now prove that each ta is an isomorphism (thereby proving the last statement in 
the theorem) by constructing its inverse. Turn the preceding diagram upside down, so 
that the chosen projective resolution Pa -* A —*■ 0 is now the bottom row. Again, the 
comparison theorem gives a chain map, say, k : Pa —»■ Pa- Now the composite ki is a 
chain map from Pa to itself over lp A . By the uniqueness statement in the comparison 
theorem, ki ±2 lp A ; similarly, lk ±2 lp . It follows that T(ik) ±2 l 7 -p i and T(ki) ±2 l r p ( . 
Hence, 1 = (T lk)* = (T 0*(T k)* and 1 = (: Tkl )* = (Tk)*(T ()*. Therefore, ta = (TO* 
is an isomorphism. 
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We now prove that the isomorphisms ta constitute a natural equivalence: that is, if 
/: A -> B is a homomorphism, then the following diagram commutes. 


(L„T)A 

L„T(f) 

Y 

(L n T)B 


*(L n T)A 

L„T(f) 

Y 

*{L n T)B 


To evaluate in the clockwise direction, consider 


• • •-S" P% ->• Pq ->- A -s- 0 

1A 

y 

• • •-^ Pi->■ P 0 - A -a- o 

/ 

Y 

-- Qi -- Qo -- B -- 0, 

where the bottom row is some projective resolution of B. The comparison theorem gives 
a chain map Pa —> Pa over /1 a = /. Going counterclockwise, the picture will now 
have the chosen projective resolution of B as its middle row, and we get a chain map 
Pa —*■ Pa over 1 #/ = /. The uniqueness statement in the comparison theorem tells us 
that these two chain maps are homotopic, so that they give the same homomorphism in 
homology. Thus, the appropriate diagram commutes, showing that r: L n T —»• L n T is a 
natural equivalence. • 


Corollary 10.51. The module Tor,f(A, B ) is independent of the choices of projective 
resolutions of A and of B. 

Proof The proposition applies at once to the left derived functors of <E) r H , namely, 
Tor 7 f ( , B ), and to the left derived functors of A(S)r , namely tor,f (A, ). But we have 
already cited the fact that Tor,f (A, B) = tor^(A, B). • 


Corollary 10.52. Let T : ^Mod —» 5 Mod be an additive covariant functor. If P is a 
projective module, then L n T (P) = {0} for all n > 1. 

In particular, if A and P are right R-modules, with P projective, and if B and Q are 
left R-modules, with Q projective, then 

Tor,?(P, B) = {0} and Tor,f (A, Q) = {0} 


for all n > 1 . 
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Proof. Since P is projective, a projective resolution of it is 

€. =-t0-^0->F-^P->0, 

and so the corresponding deleted projective resolution C p has only one nonzero term, 
namely, Co = P. It follows that TCp is a complex having nth term {0} for all n > 1, and 
so L n TP — H n (TC P ) — {0} for all n > 1, by Exercise 10.21 on page 827. • 

We are now going to show that there is a long exact sequence of left derived functors. 
We begin with a useful lemma; it says that if we are given an exact sequence of modules as 
well as a projective resolution of its first and third terms, then we can “fill in the horseshoe”; 
that is, there is a projective resolution of the middle term that fits in the middle. 

Lemma 10.53 (Horseshoe Lemma). Given a diagram 



where the columns are projective resolutions and the row is exact, then there exists a pro¬ 
jective resolution of A and chain maps so that the three columns form an exact sequence 
of complexes. 

Remark. The dual theorem, in which projective resolutions are replaced by injective 
resolutions, is also true. < 

Proof. We show first that there is a projective If and a commutative 3x3 diagram with 
exact columns and rows: 

0 0 0 


1 I I 



0 0 0 
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Define Pq = If © P ( " ; it is projective because both / J ( j and P ( " are projective. Define 
/o : Pq -> Pq © Pq by x' (x', 0), and define po: Pq © Pq ->• Pq by (x, x") i->- x" . It 
is clear that 

0 -* Pq P 0 -A Pq -> 0 

is exact. Since P ( " is projective, there exists a map a : Pq -> A with per = e". Now define 
e : Po —> Aby s\ ( xx") i-> ie'x' + crx" . It is left as a routine exercise that if Kq = kere, 
then there are maps K' {) -> Kq and Kq —>• Kf (where /Tq = ker s' and k' ( j' = ker s"), so 
that the resulting 3x3 diagram commutes. Exactness of the top row is Exercise 10.29 on 
page 828. 

We now prove, by induction on n > 0, that the bottom n rows of the desired diagram 
can be constructed. For the inductive step, assume that the first n steps have been filled in, 
and let K n — kcr(P„ -> P„_i), etc. Now construct the 3x3 diagram whose bottom row 
is 0 —> K' n —> K n -* K” —>• 0, and splice it to the nth diagram, as illustrated next (note 
that the map P„+i -» P„ is defined as the composite P n +\ —»■ K„ -> P„). 


0 K' n+ 1 >■ K n+ 1 >- K'n+t ^ 0 

X \ X 



The columns of the new diagram are exact because, for example, im(P, I+ i —» P„) = K n — 
ker(P„ -* P„_i). • 


Theorem 10.54. If 0 —> A' —4 A —4 A" —>■ 0 is an exact sequence of modules and if 
T : ftMod —> ^Mod is a covariant additive functor, then there is a long exact sequence: 


L n TA' ^ L n TA L -^l L n TA" ^4 


L n _\TA! Ln -^' L n _\TA L "-±l P L n _\TA" ^4 


that ends with 

-* LqTA! -> LqTA -» LqTA" 0 . 

Proof Let P4' and P''„ be the chosen deleted projective resolutions of A' and of A", 
respectively. By Lemma 10.53, there is a deleted projective resolution Px of A with 
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(in the notation of the comparison theorem, j — i is a chain map over /.and q — p is a 
chain map over p). Applying T gives the sequence of complexes 

o. -> tpV —^ tp A tp" a „ o.. 

To see that this sequence is exact , 12 note that each row 0 —»■ P' n -X P n -X- P” —»• 0 is 
a split exact sequence (because P” is projective), and additive functors preserve split short 
exact seqeunces. There is thus a long exact sequence 

-► H n (TP' A ,) H„(TF a ) ( Xi* H n (TP" A „) -X H n -i(TP' A ,) ->•••; 

that is, there is an exact sequence 

-* L„TA’ ( —t* L n T A ( Xi* L n TA" -X L n -\TA! ->•••. 


We do not know that the projective resolution of A given by the horseshoe lemma is the 
resolution originally chosen, and this is why we have L n TA instead of L n TA. But there 
is a natural equivalence r: L„T -* L„T, and so there is an exact sequence 


L„TA’ 


L„TA TA XX* l„TA" 


Ln-lTA' 


The sequence does terminate with {0}, for L-\T — {0} for all negative n. by Proposi¬ 
tion 10.49. 

It remains to show that r A l (Ti)* — L n T(i) and r A l (Tq)* = L n T(p). Now r A l = 
(T /r)*, where k : P 4 —> P i is a chain map over 1 , and so 

tA^Ti)* = (TK)*(.Ti)* = (T(ici))'. 

Both tci and / are chain maps P i —> P i over 1 a, so they are homotopic, by the comparison 
theorem. Therefore, T (,tci ) and Ti are homotopic, and hence they induce the same map 
in homology: (T(/c/))* = (7T)* = L n T(i), and so r A l (Ti)* = L n T(i). We prove 
r^ 1 (Tq)t, = L n T(p) in the same way. • 


Corollary 10.55. IfT : a-M od -> s Mod is a covariant additive functor, then the functor 
LqT is right exact. 

Proof. This follows at once from the theorem. • 

'“The exact sequence of complexes is not split because the sequence of splitting maps need not constitute a 
chain map V" A „ —> Pa ■ 
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Theorem 10.56. 

(i) If an additive covariant functor T : ^Mod —>■ ,$Mod is right exact, then T is natu¬ 
rally equivalent to LqT. 

(ii) The functor ®rB is naturally equivalent to Torg ( , B ). Hence, for all right R- 
modules A, there is an isomorphism 

A® r B = Torg (A, B). 


Proof (i) Let P\ be the chosen deleted projective resolution of A, and let 

0 

be the chosen projective resolution. By definition. 




ker(e ® 1 r ) 

LqT A = ■ A ^ ' = c °ker(c?! ® 1 B ). 

im(di ® \ b ) 

But right exactness of T gives an exact sequence 

TPi TPq TA -* 0. 


Now s <g> 1 induces an isomorphism <x,\ : col<er(c/| <g> 1) -> TA, by the first isomorphism 
theorem; that is, a a : LqT A — > LA. It is left as a routine exercise that a : LqT -» T is a 
natural equivalence. 

(ii) Immediate from part (i), for <E)r B is a covariant right exact additive functor. • 


We have shown that Tor repairs the loss of exactness that may occur after tensoring a 
short exact sequence. 


Corollary 10.57. If 0 -> A' -> A 

then there is a long exact sequence 


Tor 


-* Torf (A, B) 
Torf (A', B) - 


■* A" —> 0 is a short exact sequence of modules, 

» Torf (A", B) 

Torf (A, B) -> Torf (A", B) 

-» A' ® r B A® r B -+ A" ® r B -» 0. 


The next proposition shows that the functors Tor„( , B) satisfy the covariant version of 
Theorem 10.45. 


Proposition 10.58. Given a commutative diagram of modules having exact rows, 
0-A' —A —^ A"->■ 0 

/ 8 h 

. W 

0 — c' —c C" —>■ 0 
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there is, for all n, a commutative diagram with exact rows 


Tor,? (A', B) — U-Tor? (A , B) ^Tor? (A", B) -Tor?^(A', B) 


1.1 1 Qf 1 

Tor?(C', B) J * >Tor?(C, B) %ATor?(C", B) —' Tor'^, (C'. B) 


There is a similar diagram if the first variable is fixed. 

Proof. Given the diagram in the statement, erect the chosen deleted projective resolutions 
on the corners P' A ,, P"„, Q' c ,, and Q" „. We claim that there are deleted projective resolu¬ 
tions P i and Qc, together with chain maps, giving a commutative diagram of complexes 
having exact rows: 



Once this is done, the result will follow from the naturality of the connecting homomor¬ 
phism. As in the inductive proof of Theorem 10.44, it suffices to prove a three-dimensional 
version of the horseshoe lemma. We complete the following commutative diagram, whose 
columns are short exact sequences, and in which P', P", Q 1 , and Q" are projectives and 
N', N ", K\ and K" are kernels. 



to the following commutative diagram, whose rows and columns are short exact sequences. 
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and in which P and Q are projective: 

0 

0 

0 

0 

0 — 

o-^ c' ——c 


> K 

".x K . 

■■■■> K 

n . 

A 

J 

.> N . : .> A 

y 

JZ 

// .... 

J 

O 

A 

„>■ / 

>' • l> 

F : 

> 

F" , 

n . 


£ Y / 

F 

s" 


»0 


>0 


Q! > Q :> Q" > 0 


x- A' — —A - 

j / f V Z' 8 


-x- A' 1 


C" 


Step 1. By the comparison theorem, there are chain maps /: P' A , —x Q' c , over / and 
h : P' A „ -x Q'c„ over h. To simplify notation, we will write F' — fo and F" = Hq. 

Step 2. Define P — P'@P", and insert the usual injection and projection maps P' -x P and 
P -x P", namely, x' i-> (x', 0) and (x' , x") hx x". Similarly, define Q = Q' ®Q", 
and insert the injection and projection maps Q' -> Q and Q -x Q". Of course, the 
sequences 0 —> P' —> P -> P" -» 0 and 0 —> Q' —* Q -» Q" -» 0 are exact. 

Step 3. As in the proof of the horseshoe lemma, define s: P -x A by e: (x', x") i-x is'x' + 
ox", where a : P" -x A satisfies po — e" (such a map a was shown to exist in 
the proof of the horseshoe lemma); indeed, the horseshoe lemma shows that the rear 
face of the diagram commutes. Similarly, define Q —x B by ry. (y',y") i-x 
jr/y' + xy", where r: Q" -x B satisfies qr — r]"\ the front face commutes as well. 

Step 4. Define F: P —> Q by 

F : (x', x") hx (F'x' + yx", F"x"), 

where y: P" —x Q' is to be constructed. It is easy to see that the plane containing 
the P’s and Q’s commutes, no matter how y is defined. 

Step 5. It remains to choose y so that the square with vertices P, Q, C, and A commutes; 
that is, we want fe — r/F. Evaluating each side leads to the equation 

Jisx + fox = jr\ t x + jq yx +xP x . 

Now fie' = jf'e' — jrjF' (because F' is the 0th term in the chain map / over /), 
and so it suffices to find y so that 


je'y = fo - xF". 
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Consider the diagram with exact row: 


P" 


Q' 


jp 


fa-xF" 

q 

c -^ c" 


Now im {fa — r F") C im js' = kerg, for 

qfo - qxF" = f" pa - q” F" = f"e" - q"F" = 0. 

Since P" is projective, there exists a map y: P" Q' making the diagram com¬ 
mute. 

Step 6 . By the 3 x 3 Lemma (Exercise 10.29 on page 828), the rows 0 -> K' -> K —> 
K" -> 0 and 0 —> N' —»■ N —> N" -> 0 are exact, and we let the reader show that 
there are maps on the top face making every square commute. • 

In the next section, we will show how Tor can be computed and used. But, before 
leaving this section, let us give the same treatment to Horn that we have just given to tensor 
product. 

Left derived functors of a functor T are defined so that TV a is a complex with all its 
nonzero terms on the left side; that is, all terms of negative degree are {0}. One consequence 
of this is Corollary 10.55: If T is right exact, then LqT is naturally equivalent to T. As 
the Horn functors are left exact, we are now going to define right derived functors R n T , in 
terms of deleted resolutions C. for which T C. is on the right. We shall see that R°T is 
naturally equivalent to T when T is left exact. 

Given an additive covariant functor T: r Mod —»• ^Mod, where R and S are rings, 
we are now going to construct, for all n e Z, its right derived functors R"T: r Mod —» 
sMod. 

Choose, once for all, a deleted injective resolution E ' of every module A, form the 
complex T E A , and take homology: 

„ „ 4 ker T d n 

R T(A) = H"(TE a ) = -r. 

i mTd n ~ l 

The reader should reread Example 10.34(x) to recall the index raising convention; if the 
indices are lowered, then the definition would be 


„ a kci l U— n 

R T(A) = H- n (TE A ) = ——— . 

im T d- n+ \ 

Notice that we have raised the index on homology modules as well; we write H n instead 
of H- n . 

The definition of R n T(f), where / : A —> A' is a homomorphism, is similar to that 
for left derived functors. By the dual of the comparison theorem, there is a chain map 
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f: E a —»■ E ,A over /, unique to homotopy, and so a unique map R"T(f ): //"f/'E 1 ) -> 
//"(7’E 1 ), namely, (T /„)*, is induced in homology. 

In pictures, look at the chosen injective resolutions: 

0- A' - E'° -^ E’ { -*- 

A 

/ 

O g# A ->• E° 

Fill in the a chain map / over /, then apply T to this diagram, and then take the map 
induced by T f in homology. 

Proposition 10.59. Given a pair of rings R and S and an additive covariant functor 
T : ){Mod —» y Mod, then 

R n T : ftMod -> sMod 
is an additive covariant functor for every n. 

The proof of this proposition, as well as the proofs of other propositions about right 
derived functors soon to be stated, are essentially duals of the proofs we have already 
given, and so they will be omitted. 

Example 10.60. 

If T is a covariant additive functor that preserves multiplications, and if /i r : A —» A 
is multiplication by r, where r e Z(R) is a central element, then R"T also preserves 
multiplications (see Example 10.47). ■* 

Proposition 10.61. IfT : /{Mod — > y Mod is a covariant additive functor, then R n T A = 
{0 } for all negative n and for all A. 

Definition. IfT — Hom A T«, ), define Ext \{B, ) = R"T. Thus, if 



is the chosen deleted injective resolution of a module A , then 

„ „ 4 ker(c/")* 

Ext'' (B, A) = H (Hom A (B, E A )) = . \ * , 

tm (c/' ,_1 )* 

where (c/")*: Hom A (B, E n ) —> Hom A (B, E ,n ) is defined, as usual, by 

(d n )*: / !->■ d n f. 

The domain of R n T , in particular, the domain of Ext'^fB, ), is A \Iod, the category of 
all left R -modules, and its target is Ab, the category of abelian groups. The target may be 
larger; for example, it is A Mod if R is commutative. 

Assume that new choices E 1 of deleted injective resolutions have been made, and let 
us denote the right derived functors arising from these new choices by R n T. 
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Proposition 10.62. Given a pair of rings R and S, and an additive covariant functor 
T : /?Mod —> sMod, then, for each n, the functors R n T and R n T are naturally equivalent. 
In particular, for all A, 

(R n T)A = (R n T)A, 

and so these modules are independent of the choice of ( deleted ) injective resolution of A. 

Corollary 10.63. The module Ext ’^(B, A) is independent of the choice of injective reso¬ 
lution of A. 

Corollary 10.64. Let T : ft Mod — » «, Mod be an additive covariant functor. If E is an 
injective module, then R n T(E ) = {0} for all n > 1. 

In particular, if E is an injective R-module, then Ext'^(B, E) — {0} for all n > 1 and 
all modules B. 


Theorem 10.65. If 0 —»■ A' A A" —» 0 is an exact sequence of modules and if 
T : ftMod —> sMod is a covariant additive functor, then there is a long exact sequence: 


R n T A' ^ R n T A R n TA" 


n+i T s' R + T ' p n + lr pA R R n+i TA" 


R + T A 


that begins with 


R°T A' 


R°TA 


R°TA h 


Corollary 10.66. IfT : ft Mod -> 5 Mod is a covariant additive functor, then the functor 
R°T is left exact. 


Theorem 10.67. 

(i) If an additive covariant functor T : /{Mod —> sMod is left exact, then T is naturally 
equivalent to R°T. 

(ii) If B is a left R-module, the functor HomR(B , ) is naturally equivalent toExt° R (B, ). 
Hence, for all left R-modules A, there is an isomorphism 

Homft(B, A) = Ext ° R {B, A). 

We have shown that Ext repairs the loss of exactness that may occur after applying Horn 
to a short exact sequence. 
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Corollary 10.68. //() —*■ A' -> A —>• A" —> 0 is a short exact sequence of modules, then 
there is a long exact sequence 

0 -» Horn*(B, A') -+ Hom^(B, A) Hom^(B, A") 

-> Extjj(B, A') -* Extjj(B, A) ExtjjfB, A") 

-* Ext|(B, A') -> Ext|(B, A) -> Ext|(B, A") -» •. 

Proposition 10.69. Given a commutative diagram of modules having exact rows, 

0-> A' A A" ->■ 0 

/ 8 h 

o —^ c c q > c" —> 0 

there is, for all n, a commutative diagram with exact rows 


Ext" R (B, A') '' >-Ext"(B, A) 

/» 


-^ExtJfB, A") ?J ">-Ext" +I ( 6 , A') 


Ext" (B, C') 


J* 


■Ext" (B, C ) 


< 7 * 


Ext" (B, C") 


■Ext" +1 (B,C0 


Finally, we discuss derived functors of contravariant functors T. If we define right 
derived functors R n T , in terms of deleted resolutions C. for which T C. is on the right, 
then we start with a deleted projective resolution P A , for then the contravariance of T puts 
TV A on the right . 13 

Given an additive contravariant functor T : r Mod —> ^Mod, where R and S are rings, 
we are now going to construct, for all n e Z, its right derived functors R n T : r Mod —» 

sMod. 

Choose, once for all, a deleted projective resolution Pa of every module A, form the 
complex TP A , and take homology: 

„ „ kerTd„+i 

R T(A) = H (TP A ) = . " +1 . 

tm T d n 

If /: A —> A', define R"T(f): R n T(A') —>■ R n T(A) as we did for left derived func¬ 
tors. By the comparison theorem, there is a chain map /: P A —» P' A , over /, unique to 
homotopy, which induces a map R n T(f) \ H n (TP' A ,) —> II"(T P A ), namely, (T /„)*, in 
homology. 

I3 [f we were interested in left derived functors of a contravariant T, hut we are not, then we would use injective 
resolutions. 
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Example 10.70. 

If T is an additive contravariant functor that preserves multiplications, and if fi r : A A 
is multiplication by r, where r e Z(R) is a central element, then R"T also preserves 
multiplications (see Example 10.47). ■* 


Proposition 10.71. Given a pair of rings R and S and an additive contravariant functor 
T : /{Mod —»■ yMod, then 

R n T : ^Mod -> sMod 

is an additive contravariant functor for every n. 


Proposition 10.72. If T : a-M od -» 5 Mod is a contravariant additive functor, then 
R"T A — {0} for all negative n and for all A. 


Definition. If T — Horn a ( , C), define ext^( , C) = R n T. Thus, if 

-» P 2 A P\ Pq -> 0 

is the chosen deleted projective resolution of a module A , then 

ext''(A, C) = ff"(Hom*(P A , C) = , 

imW„) 

where (d n )* : Horn r(P,[, C ) —> HomA(P„, C) is defined, as usual, by 

(d n )*: f i > f d n . 

The same phenomenon that holds for Tor holds for Ext: for all A and C (and for all R 
and «), 

Ext^(A,C) = ext^(A,C). 

The same proof that shows that Tor is independent of the variable resolved also works 
for Ext (see Rotman, An Introduction to Homological Algebra, p. 197). In light of this 
theorem, we will dispense with the two notations for Ext. 

Assume that new choices P A of deleted projective resolutions have been made, and let 
us denote the right derived functors arising from these new choices by R"T. 

Proposition 10.73. Given a pair of rings R and S, and an additive contravariant functor 
T : ^Mod -* ^Mod, then, for each n, the functors R"T and R n T are naturally equivalent. 
In particular, for all A, 

(R"T)A = (R"T)A, 


and so these modules are independent of the choice of ( deleted ) projective resolution of A. 
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Corollary 10.74. The module Ext'^(A, C) is independent of the choice of projective res¬ 
olution of A. 

Corollary 10.75. Let T : a Mod —> s 5 lot I be an additive contravariant functor. If P is a 
projective module, then R n T(P ) = {0} for all n > 1. 

In particular, if P is a projective R-module, then Ext'^(P, B) — {0 } for all n > 1 and 
all modules B. 


Theorem 10.76. If 0 —» A' A — A" —■> 0 is an exact sequence of modules 
and ifT : a Mod —> a Mod is a contravariant additive functor, then there is a long exact 
sequence 


R n T A 


R n TA R n TA 


n+ WA" R "Z1 P R n+1 TA P»+l 


R n+ T A 


R n+l T A' 


an+l 


that begins with 


0 R^TA" -> R°TA -> R°TA' -+■■■. 


Corollary 10.77. If T : a- Mod —> 5 Mod is a contravariant additive functor, then the 
functor R°T is left exact. 


Theorem 10.78. 

(i) If an additive contravariant functor T : a Mod -> ,$Mod is left exact, then T is 
naturally equivalent to R () T. 

(ii) IfC is a left R-module, t he functor Horn , C ) is naturally equivalent foExt^( , C). 

Hence, for all left R-modules A, there is an isomorphism 

Hom*(A,C) =Ext^(A,C). 

We have shown that Ext repairs the loss of exactness that may occur after applying Horn 
to a short exact sequence. 

Corollary 10.79. IfO —> A 1 —> A —> A" -> 0 is a short exact sequence of modules, then 
there is a long exact sequence 

0 -» Horn*(A", C) -* Hom s (A, C) Horn*(A', C) 

Ext];(A", C) -* Ext l R (A, C ) -* Ext];(A', C) 

-> Ext|(A", C) -> Ext|(A, C) -* Ext|(A', C) -> • • • . 
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Proposition 10.80. Given a commutative diagram of modules having exact rows. 


0 ->■ A*’-- - A ->■ A' 1 


C 


C 


C" 


there is, for all n, a commutative diagram with exact rows 


Ext" R (A", B) -^-Ext" (A, B) 
h* 


■ Ext n R (A', B ) —^ Ext'; +1 (A", B ) 


Ext" (C", B) 


■ Ext''(C, B) 1L Ext" (C', B) Ext" +l (C", B) 


Remark. When T is a covariant functor, then we call the ingredients of L n T chains, 
cycles, boundaries, and homology. When T is contravariant, we often add the prefix “co,” 
and the ingredients of R n T are usually called cochains, cocycles, coboundaries, and co¬ 
homology. Unfortunately, this clear distinction is blurred because the Horn functor is con¬ 
travariant in one variable but covariant in the other. In spite of this, we usually use the “co” 
prefix for the derived functors Ext" of Horn. ◄ 

Derived functors are one way to construct functors like Ext and Tor. In the next section, 
along with more properties of Ext and Tor, we shall describe another construction of Ext, 
due to N. Yoneda, and another construction of Tor, due to S. Mac Lane. Indeed, derived 
functors will rarely be mentioned in the sequel. 


Exercises 

10.36 If r: F -*■ G is a natural transformation between additive functors, prove that r gives chain 
maps tq : F C —> GC for every complex C. If r is a natural equivalence, prove that FC = 
GC. 

10.37 (i) Let T : ^Mod —*■ ^Mod be an exact additive functor, where R and S are rings, and 

suppose that P projective implies T P projective. If B is a left R -module and Pg is a 
deleted projective resolution of B, prove that TPjg is a deleted projective resolution of 
TB. 

(ii) Let A be an /?-algebra, where R is a commutative ring, which is flat as an /{-module. 
Prove that if B is an /I-module (and hence an I?-module), then 

A Tor* (B, C) = Tor^(6, A ® R C) 

for all fLmodules C and all n > 0. 



852 


Homology Ch. 10 


10.38 Let R be a semisimple ring. 

(i) Prove, for all n > 1, that Tor *(A, B) = {0) for all right ^-modules A and all left 

-modules B. 

Hint. If R is semisimple, then every (left or right) B-module is projective. 

(ii) Prove, for all n > 1, that Ext'jj(A, B ) = (0) for all left ^-modules A and B. 

10.39 If R is a PID, prove, for all n > 2, that Tor *(A, B) = {0) = Ext'^(A, B) for all ^-modules A 
and B. 

Hint. Use Theorem 9.8. 

10.40 Let R be a domain and let A be an B-module. 

(i) Prove that if the multiplication g r : A —> A is an injection for all r ^ 0, then A is 
torsion-free. 

(ii) Prove that if the multiplication \i r : A —*■ A is a surjection for all r ^ 0, then A is 
divisible. 

(iii) Prove that if the multiplication g r : A —>■ A is an isomorphism for all r ^ 0, then A is 
a vector space over Q, where Q = Frac( R). 

Hint. A module A is a vector space over Q if and only if it is torsion-free and divisible. 

(iv) If either C or A is a vector space over Q, prove that Tor*(C, A) and Ext'^(C, A) are 
also vector spaces over Q. 

10.41 Let R be a domain and let Q — Frac(B). 

(i) If r € R is nonzero and A is an B-module for which rA = {0}; that is, ra = 0 for all 
a € A, prove that Ext n R (Q, A) = {0} = Tor^(<2, A) for all n > 0. 

Hint. If V is a vector space over Q for which r V = {0), then V — (0). 

(ii) Prove that Ext^(V, A) = {0} = Tor* (V, A) for all n > 0 whenever V is a vector space 
over Q and A is an /^-module for which r A — {0} for some nonzero r e R. 

10.42 Let A and B be l?-modules. For /: A' —> B, where A' is a submodule of A, define its 
obstruction to be 3(/), where 3 : Hom^(A / , B) —»■ Ext^(A/A', B) is the connecting homo¬ 
morphism. Prove that / can be extended to a homomorphism /: A —>■ B if and only if its 
obstruction is 0. 

10.43 If T : Ab -► Ab is a left exact functor, prove that LqT is an exact functor. Conclude, for any 
abelian group B, that Lq Hom(B, ) is not naturally equivalent to Hom(B, ). 


10.6 Ext and Tor 


We now examine Ext and Tor more closely. As we said in the last section, all properties of 
these functors should follow from versions of Theorem 10.45, the axioms characterizing 
them (see Exercises 10.44 and 10.45 on page 869); in particular, their construction as 
derived functors need not be used. 

We begin by showing that Ext behaves like Horn with respect to sums and products. 



Sec. 10.6 Ext and Tor 


853 


Proposition 10.81. If {A k : k e K} is a family of modules, then there are natural 
isomorphisms, for all n, 

Ex^ ( A k, «)=n Ext '^’ «»• 

keK ksK 

Proof. The proof is by dimension shifting; that is, by induction on n > 0. The base 
step is Theorem 7.33, for Ext°( , B) is naturally equivalent to the contravariant functor 
Hom( , B). 

For the inductive step, choose, for each keK, a short exact sequence 

0 -* L k -* P k -> A k -+ 0, 
where P k is projective. There is an exact sequence 

0 -» J 2 L k "► J 2 p k -* J 2 Ak -+ 0 , 

k k k 

and P k is projective, for every sum of projectives is projective. There is a commutative 
diagram with exact rows; 

Hornet P k , B) ^Hom(^L A -,B) t Ext^ A k , B) -^Ext 1 ^ P k , B) 

r a 

, i 

Fl Hom(7\, B) [I HomlL^, B) > f] Ext 1 (A*, B) -*■ [1 Ext'(£ p k, B), 

where the maps in the bottom row are just the usual induced maps in each coordinate, and 
the maps r and er are the isomorphisms given by Theorem 7.33. Now Ext 1 P k , B) = 
{0} = P| Ext 1 (P k . B), because P k and each P k are projective, so that the maps 3 and 
d are surjective. This is precisely the sort of diagram in Proposition 8.93, and so there 
exists an isomorphism Ext 1 (ff A k , B) —> P[ Ext 1 (A k . B) making the augmented diagram 
commute. 

We may now assume that n > 1, and we look further out in the long exact sequence. 
There is a commutative diagram 

Ext"(£P*, B) -»Ex1»(£L*, B) -l Ext" +1 (£ A k , B) Ext" +1 (£ P k , B) 

O 

" , V 

nExt"(P,, B) P[Ext" (L k , B) -1 p[Ext" +1 (A£, B) ->■ p[Ext” +1 (P*, B), 

where o : Ext"(^ L k , /i) —> |~[ Ext "(Lk, B) is an isomorphism that exists by the induc¬ 
tive hypothesis. Since n > 1, all four Ext’s whose first variable is projective are {0}; it 
follows from exactness of the rows that both 3 and d are isomorphisms. Finally, the com¬ 
posite dod~ l : Ext' ,+1 A k , B) -> p[Ext" +1 (A i t, B) is an isomorphism, as desired. • 

There is a dual result in the second variable. 
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Proposition 10.82. If { B k : k e A'} is a family of modules, then there are natural 
isomorphisms, for all n, 

Ext" (4, Y\ «,) = Y\ Ext''(A, If). 
keK keK 

Proof. The proof is by dimension shifting. The base step is Theorem 7.32, for Ext 0 (A, ) 
is naturally equivalent to the covariant functor Hom(A, ). 

For the inductive step, choose, for each k e K, a short exact sequence 

0 -> B k -* E k N k -* 0, 

where E k is injective. There is an exact sequence 

o-> -► ri £ * n** °> 

k k k 

and E k is injective, for every product of injectives is injective, by Proposition 7.66. 
There is a commutative diagram with exact rows: 

Hom(A, n E k ) ->■ Hom(A, f] Nk) 4- Ext 1 (A, B k ) +>- Ext 1 (A, [~[ E k ) 

x a 

w Y V 

II Horn (A, E k ) n Hom(A, N k ) -i- f] Ext 1 (A, B k ) > f] Ext 1 (A, E k ), 

where the maps in the bottom row are just the usual induced maps in each coordinate, and 
the maps r and a are the isomorphisms given by Theorem 7.32. The proof now finishes as 
that of Proposition 10.81. • 

It follows that Ext" commutes with finite direct sums in either variable. 

Remark. These last two propositions cannot be generalized by replacing sums by direct 
limits or products by inverse limits; the reason is that direct limits of projectives need not 
be projective and inverse limits of injectives need not be injective. < 

When the ring R is noncommutative, Horn/A A. B) is an abelian group, but it need not 
be an A-module. 

Proposition 10.83. 

(i) Let r e Z(R) be a central element, and let A and B be left R-modules. If p r : B —> 
B is multiplication by r, then the induced map 

p* : Ext'^(A, B) -+ Ext'^fA, B) 

is also multiplication by r. A similar statement is true in the other variable. 

(ii) If R is a commutative ring, then Ext^(A, B) is an R-module. 

Proof, (i) This follows at once from Example 10.47. 

(ii) This follows from part (i) if we define scalar multiplication by r to be pf. • 
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Example 10.84. 

(i) We show, for every abelian group B, that 

Extz(I„, B) = B/nB. 

There is an exact sequence 

where ji n is multiplication by n. Applying Hom( , B) gives exactness of 

Hom(Z, B) -%■ Hom(Z, B) -> Ext'(I„, B) -> Ext'(Z, B). 

Now Ext 1 (Z, B) — { 0} because Z is projective. Moreover, n* is also multiplication by n, 
while Hom(Z, B) = B. More precisely, Hom(Z, ) is naturally equivalent to the identity 
functor on Ab, and so there is a commutative diagram with exact rows 


B 


Hom(Z, B) 




B 


■ Hom(Z, B) 


B/nB 


■ Ext 1 (I„, B) 


By Proposition 8.93, there is an isomorphism B/nB = Ext 1 (I„, B). 

(ii) We can now compute Ext^(A, B) whenever A and B are finitely generated abelian 
groups. By the fundamental theorem, both A and B are direct sums of cyclic groups. Since 
Ext commutes with finite direct sums, Ext^(A, B) is the direct sum of groups Ext^(C, D ), 
where C and D are cyclic. We may assume that C is finite, otherwise it is projective, and 
Ext 1 (C. D) = {0}. This calculation can be completed using part (i) and Exercise 5.5 on 
page 267, which says that if D is a cyclic group of finite order m, then D/nD is a cyclic 
group of order d, where d = (m, n) is their gcd. 4 


We now give the obvious definition analogous to extensions of groups. 


Definition. Given /^-modules C and A, an extension of A by C is a short exact sequence 

o -> a - 4 s 4 c -> o . 

An extension is split if there exists an A 1 -map s : C —> B with ps = 1 c- 

Of course, if0->A-*fi-*C-*0isa split extension, then B = A © C. 

Whenever meeting a homology group, we must ask what it means for it to be zero, for its 
elements can then be construed as being obstructions. For example, factor sets explain why 
a group extension may not be split. In this section, we will show that ExtJjCC, A) = {0} if 
and only if every extension of A by C splits. Thus, nonzero elements of any Extjj(C', A') 
describe nonsplit extensions (indeed, this result is why Ext is so called). 

We begin with a definition motivated by Proposition 10.17. 
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Definition. Given modules C and A, two extensions £ : 0 —>■ A —>■ B —> C —»■ 0 and 
f' : 0 -* A -* B' —> C —> 0 of A by C are equivalent if there exists a map tp: B B' 
making the following diagram commute: 



We denote the equivalence class of an extension f by [£], and we define 

e{C, A) = {[£] : f is an extension of A by C}. 

If two extensions are equivalent, then the five lemma (Exercise 8.52 on page 604) shows 
that the map <p must be an isomorphism; it follows that equivalence is, indeed, an equiva¬ 
lence relation (for we can now prove symmetry). However, the converse is false: There can 
be inequivalent extensions having isomorphic middle terms, as we saw in Example 10.18 
(all groups in this example are abelian, and so we may view it as an example of Z-modules). 

Proposition 10.85. If ExtJj(C, A) = {0}, then every extension 

0-+A-^B-Uc-*0 

is split. 

Proof. Apply the functor Hom(C, ) to the extension to obtain an exact sequence 

Hom(C, B) -A- Hom(C, C) Ext x (C, A). 

By hypothesis, Ext x (C, A) — {0}, so that p. t is surjective. Hence, there exists s e 
Hom(C, B) with 1 q = p js)\ that is, lp = ps, and this says that the extension splits. • 

Corollary 10.86. An R-module P is projective if and only if Ext^(P, B ) = {0 }for every 
R-module B. 

Proof. If P is projective, then Ext[, ( l\ B ) = {0} for all B, by Corollary 10.52. Con¬ 
versely, if ExtJj(P, B) — {0} for all B. then every exact sequence 0 —> B ^ X P -x 0 
splits, by Proposition 10.85, and so P is projective, by Proposition 7.54. • 

We are going to prove the converse of Proposition 10.85 by showing that there is a 
bijection i jr : e(C, A) —> Ext 1 (C, A). Let us construct the function if. 

Given an extension £ : 0 -> ,4 /i —> C —> 0 and a projective resolution of C, form 
the diagram 
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By the comparison theorem (Theorem 10.46), we may fill in dotted arrows to obtain a 
commutative diagram. In particular, there is a map a: P[ -» A with ach = 0; that is, 
(a) = 0, so that a e ker di, is a cocycle. The comparison theorem also says that any two 
fillings in of the diagram are homotopic; thus, if a': P\ —»■ A is part of a second filling in, 
there are maps so and si with a' — a — 0 • sj + sq^i = so^E 



Thus, a 1 — a e im d*. and so the homology class a + imd* e Ext l (C, A) is well-defined. 
We leave as an exercise for the reader that equivalent extensions § and f' determine the 
same element of Ext. Thus, 


f: e(C,A) Ext 1 (C, A), 


given by 


^(K]) = a+ imrff, 


is a well-defined function. In order to prove that xfr is a bijection, we first analyze the 
diagram containing the map a. 


I 8 

Lemma 10.87. Let S : 0 —> X\ —> Xq —-> C —> 0 be an extension of a module X \ 

by a module C. Given a module A, consider the diagram 


E : 0 


X! *0 c 


a 

V 


A 


r 

c 


0 


(i) There exists a commutative diagram with exact rows completing the given diagram: 


0 

0 


Xi — U- X Q —C->■ 0. 




0 


(ii) Any two bottom rows of completed diagrams are equivalent extensions. 


Proof, (i) We define B as the pushout of j and a. Thus, if 

S = { (ax \, —jx i) e A © Xq : x\ e X \}, 
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define B = (A © Xq)/S, 

i: a i-> a + S, ft: xo i-> xo + S, and rj: (a, xo ) + S i->- sx o- 

That i] is well-defined, that the diagram commutes, and that the bottom row is exact are 
left for the reader to check. 

(ii) Let 

0-»• Xi — U- X 0 —?-*• C -*- 0. 

a p' lc 

i> rf v 

0->■ A ->■ B' ->■ C-^ 0 

be a second completion of the diagram. Define f: A® X o -* B' by 

/: (a, xo) i—> i'a + //xo. 

We claim that / is surjective. If b' e B'. then rj'b' e C, and so there is xo e Xo with 
exo = rj'b'. Commutativity gives rj'fi'x o = exo = rj'b'. Hence, b' — fi'xo € ker rj' — im i\ 
and so there is a e A with i'a — b' — P'xq- Therefore, b’ = i'a + P'xq e im /, as desired. 

We now show that ker / = S. If (axi, — 7 x 1 ) e S, then f(ax\, —jx 1 ) = i'ax 1 — 
fi'jx 1 = 0, by commutativity of the first square of the diagram, and so S c ker /. For the 
reverse inclusion, let (a, xo) e ker /, so that i'a + P'x 0 = 0. Commutativity of the second 
square gives exo = rj'fi'x 0 = —rj'ia = 0. Hence, xo e kere — imj, so there is xi e Xj 
with 7 x 1 = xo- Thus, i'a — —fi'xo = —fi'jx 1 = — i'axi. Since i' is injective, we have 
a — — axi; replacing x\ by y\ = —xi. We have (a, xo) = (ctyi, — jyi) € S, as desired. 
Finally, define <p : B —>■ B' by 

<p : (a, xo) + S i-> f(a, xo) = i'a + fi'xo 

[(p is well-defined because B = (A © Xq)/S and S = ker/]. To show commutativity of 
the diagram 


0-^ A —B —^ C -^0 



we use the definitions of the maps i and rj in part (i). For the first square, if a e A, then 
(pia — < 7 >((a, 0) + S) = i'a. For the second square, 

rj'cp: (a, xo) + S i-> ri'(i'a + fi'xo) 

= n'P'x 0 

= exo 

= r)((a, xq) + S ). 


Therefore, the two bottom rows are equivalent extensions. 
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Notation. Denote the extension of A by C just constructed by 

»3. 

The dual result is true; it is related to the construction of Ext using injective resolutions 
of the second variable A. 

Lemma 10.88. Let A and To be modules, and let E r : 0 —> A — > To —> Ti — > 0 be an 
extension of A by Y\. Given a module C, consider the diagram 


A C 



(i) There exists a commutative diagram with exact rows completing the given diagram: 

E'y 

E' 

(ii) Any two top rows of completed diagrams are equivalent extensions. 

Proof. Dual to that of Lemma 10.87; in particular, construct the top row using the pull¬ 
back of y and p. • 

Notation. Denote the extension of A by C just constructed by 

BV 

Theorem 10.89. The function f : e(C. A) -> Ext’(C, A) is a bijection. 

Proof. We construct an inverse 0 : Ext 1 (C. A) -» e(C, A) for f. Choose a projective 
resolution of C, so there is an exact sequence 

-> Pi El A Pq C -»■ 0, 

and choose a 1-cocycle a: P\ A. Since a is a cocycle, we have 0 = d^ia) — ad2, so 
that a induces a homomorphism a' \ Pj/imab -»■ A [if X] e P\, then a 1 : x\ + imd 2 
a(xi)]. Let E denote the extension 

S : 0 Pi / im d 2 Pq -> C -* 0. 
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As in the lemma, there is a commutative diagram with exact rows: 


0-> Pi/imrfe->- Pq -> C-> 0 


0 


A 


B 


T 

>■ C -^0 


Define 6 : Ext 1 (C, A) —> e(C, A) using the construction in the lemma: 

6(a + imt/*) = [a E], 

We begin by showing that 0 is independent of the choice of cocycle a. Suppose that 
£ : P\ -> ,4 is another cocycle. Now a and f are parts of a chain map that the comparison 
theorem says are homotopic. Hence, there is a map s : Pq —»• A with £ = a + sd\. But it 
is easy to see that the following diagram commutes: 


Pi 


dl 


>Pl 


di 


Po 


>c 


^0 




P+is 


lc 


B 


c 


As the bottom row has not changed, we have [a'E] = [£'E], 

It remains to show that the composites i frO and Oijj are identities. If a + im d* e 
Ext 1 (C, A), then 9(a + im d*) is the bottom row of the diagram 


0-> Pi/imd 2 ->- Pq -> C-> 0 


0 


a' 

V 


^ A 


I 

B 


l le 

>■ C 


^0 


and \[r0(a + im d*) is the homology class of a cocycle fitting this diagram. Clearly, a is 
such a cocycle; and so i jr6 is the identity. For the other composite, start with an extension 
§, and then imbed it as the bottom row of a diagram 


do d\ 

Pi —Pi-Po-> C-*- 0 


V V , V 

0- A ->- B 


r 

c —^0 


Both £ and «' E are bottom rows of such a diagram, and so Lemma 10.87(ii) shows that 

m = [a'3], . 


We can now prove the converse of Proposition 10.85. 
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Corollary 10.90. For any modules C and A, every extension of A by C is split if and only 
ifExt\(C,A) = {0}. 

Proof. Since Ext 1 (C. A) — {0}, we have | e(C, A)| = 1, so there is only one equivalence 
class of extensions of A by C. But the split extension always exists, and so every extension 
is equivalent to the split extension; that is, every extension of A by C splits. • 

Example 10.91. 

If p is a prime, then Ext^(I p , I p ) = 1^, as we saw in Example 10.84(i). On the other hand, 
it follows from Theorem 10.89 that there are p equivalence classes of extensions 0 —» 
I p —»• B —»■ I p —> 0. But |.B | = p 2 , so there are only two choices for B to isomorphism: 
B = I p 2 or B = Ip © Ip. Of course, this is consistent with Example 10.18. ◄ 

Here is a minor application of Ext. 

Proposition 10.92. 

(i) If F is a torsion-free abelian group and T is an abelian group of bounded order 
(that is, nT — {0} for some positive integer n), then Ext 1 (F , T ) = {0}. 

(ii) Let G be an abelian group. If the torsion subgroup tG ofG is of bounded order, then 
tG is a direct summand ofG. 


Proof, (i) Since F is torsion-free, it is a flat Z-module, by Corollary 9.6, so that exactness 
of 0 —> Z -> Q gives exactness of 0 —> Z ® F —*■ Q <g> F. Thus, F = Z <g> F can be 
imbedded in a vector space V over Q, namely, V = Q 0 F. Applying the contravariant 
functor Hom( , T ) to 0 —» F -> V —> V/F 0 gives an exact sequence 

Ext^V, T) -» Ext 1 (F, T) -> Ext 2 (y/E, T). 

Now the last term is {0}, by Exercise 10.39 on page 852, and Ext 1 (V, T) is (torsion-free) 
divisible, by Example 10.70, so that Ext 1 ( F. T) is divisible. Since T has bounded order. 
Exercise 10.41 on page 852 gives Ext^E, T) = {0}. 

(ii) To prove that the extension 0 t G G G / 1 G 0 splits, it suffices to prove 
that Ext 1 (G / tG. tG) — {0}. Since G/tG is torsion-free, this follows from part (i) and 
Corollary 10.90. • 

The torsion subgroup of a group may not be a direct summand; the following proof by 
homology is quite different from that of Exercise 9. l(iii) on page 663. 

Proposition 10.93. There exists an abelian group G whose torsion subgroup is not a 
direct summand of G\ in fact, we may choose tG — p , where the sum is over all 
primes p. 
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Proof. It suffices to prove that Ext 1 (Q, ff p I p ) ^ 0, for this will give a nonsplit extension 
0 -> ff p Ip —> G -> Q —> 0; moreover, since Q is torsion-free, it follows that ff p l p = 
tG. 

Consider the exact sequence 0 -» J2 P l P Up h D B y Exercise 9.6 

on page 663, we know that D is divisible (in truth, D A R: it is a torsion-free divisible 
group, hence it is a vector space over Q, by Proposition 9.23, and we check that dim(Z>) = 
continuum, which is the dimension of R as a vector space over Q). There is an exact 
sequence 

Hom(Q, ]~[ I p ) -> Hom(Q, D) -U Ext 1 (Q, ^ I p ) -* Ext'(Q, f~[lp). 
p p 

But 3 is an isomorphism: Ext 1 (Q, ]~[ I p ) = IlExt^Q, I p ) = {0}, by Proposition 10.81 
and Proposition 10.92, and Hom(Q, \\ p I ; ,) = ]~[Hom(Q, l p ) = {0}, by Theorem 7.33. 
Since Hom(Q, D) f {0}, we have Ext 1 (Q, ff p I p ) {0}. • 


Remark. We can prove that a torsion abelian group T has the property that it is a direct 
summand of any group containing it as its torsion subgroup if and only if T = B © l). 
where B has bounded order and D is divisible. ■* 

If £ is a set and i//: £ -> G is a bijection to a group G, then there is a unique group 
structure on £ that makes it a group and f an isomorphism [if e, e' e £. then e — f _1 (g) 
and c' = define ee' = ir~ 1 (gg')]. In particular. Theorem 10.89 implies that there 

is a group structure on e(C, A); here are the necessary definitions. 

Define the diagonal map A c : C -> C © C by A c : c (c, c), and define the codiag¬ 
onal map Va ■ A © A -> A by Va : («i, 02 ) «i + « 2 - Note that if f, f : C —> A is a 
homomorphism, then the composite Va(/ © / ')A maps C->C©C—>A©A^-A. 
It is easy to check that Va(/ © f')A = / + /\ so that this formula describes addition 
in Hom(C, A). Now Ext is a generalized Horn, and so we mimic this definition to define 
addition in e(C. A). 

Iff:0->A->B->C—>-0 and £ ’ : 0 -» A' —»■ B' —»■ C’ -> 0 are extensions, then 
their direct sum is the extension 

£ © £' : 0 -» A © A' -*£©£' C © C' —>0. 

The Baer sum [f] + [§'] is defined to the equivalence class [Va(| © ( we have 

already defined a a and S 'y). To show that Baer sum is well-defined, we first show that 
a(E'y) is equivalent to (a S')y. We then show that e(C , A) is a group under this operation 
by showing that f ([§] + [§']) = V / ([Va(? © $') Ac])- The identity element is the class of 
the split extension, and the inverse of [f ] is [(— 1a)§]- 

This description of Ext 1 has been generalized by N. Yoneda to a description of Ext" for 
all n. Elements of Yoneda’s Ext"(C, A) are certain equivalence classes of exact sequences 

0 -» A B { -*- > B n -+C 0, 
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and we add them by a generalized Baer sum (see Mac Lane, Homology, pages 82-87). 
Thus, there is a construction of Ext that does not use derived functors. Indeed, we can 
construct Ext" without using projectives or injectives. 

In their investigation of finite-dimensional algebras, M. Auslander and I. Reiten intro¬ 
duced the following notion. 

Definition. An exact sequence of left R-modules, over any ring R, 

is almost split if it is not split, if both N and M are indecomposable modules, and for 
for all R-modules C and every /Amap <p : C —¥ M that is not an isomorphism, the exact 
sequence aq> is split. 

Another way to say this is that [3] is a nonzero element of Ext l R (N, M), where N and 
M are indecomposable, and [ 3] e ker <p* for every <p: C -> M that is not an isomorphism. 
Auslander and Reiten proved that for every indecomposable module M that is not projec¬ 
tive, there exists an almost split exact sequence ending with M. Dually, they proved that 
for every indecomposable module N that is not injective, there exists an almost split exact 
sequence beginning with N. 

It is now Tor’s turn. We begin with a result that has no analog for Ext. 

Theorem 10.94. If R is a ring, A is a right R-module, and B is a left R-module, then 

Tor*(A, B) = Tor*° P (B, A) 
for all n > 0, where R op is the opposite ring of R. 

Proof. Recall Proposition 8.11: Every left R-module is a right R op -module, and every 
right R-module is a left R op -module. Choose a deleted projective resolution Pa of A. It is 
easy to see that t : Pa <S >r B — > B op Pa is a chain map of Z-complexes, where 

t n '■ Pn ®R B —»■ B op P n 


is given by 


t n : x n <g> b i-> b <g> x n 


Since each t„ is an isomorphism of abelian groups (its inverse is b ® x„ i->- x n ® b ), the 
chain map t is an isomorphism of complexes. By Exercise 10.22 on page 827, 


Tor*(A, B) = H n { P A ®r B) = H n (B P a ) 


for all n. But Pa, viewed as a complex of left R op -modules, is a deleted projective resolu¬ 
tion of A qua left R op -module, and so H n (B 0 r«p Pa) = Tor*° P (R, A). • 

In light of this result, theorems about Tor(A, ) will yield results about Tor( , R); we 
will not have to say “similarly in the other variable.” 
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Corollary 10.95. If R is a commutative ring and A and B are R-modules, then for all 
n > 0, 

Tor*(4, B) = Tor* (B, A). 

We know that Tor,, vanishes on projectives; we now show that it vanishes on flat mod¬ 
ules. 

Proposition 10.96. A right R-module F is flat if and only if Tor* ( A. M) — {0 } for all 
n > 1 and every left R-module M. 

Proof. Let 0 —> N P -* M -» 0 be exact, where P is projective. There is an exact 
sequence 

Tori (F, P) -> Tori (F, M)-±F®N^%F®P. 

Now Ton(F, P) — {0}, because P is projective, so that Tori (F, M) = ker(l ® ;). Since 
F is flat, however, ker(l <g> i) = {0}, and so Tori ( F. M) = {0}. The result for all n > 1 
follows by dimension shifting. 

For the converse, 0 -> A —E B exact implies exactness of 

0 = Tori(F, B/A) —> F 0 A F <g> B. 

Hence, 1 <g) i is an injection, and so F is flat. (Notice that we have only assumed the 
vanishing of Tori in proving the converse.) • 

Proposition 10.97. If {If : k E K} is a family of left R-modules, then there are natural 
isomorphisms, for all n, 

Tor«{A,J2 B k) = Y,Tor«(A,B k ). 
ksK ksK 

There is also an isomorphism if the sum is in the first variable. 

Proof. The proof is by dimension shifting. The base step is Theorem 8.87, for Toro(A, ) 
is naturally equivalent to 4® . 

For the inductive step, choose, for each k e K, a short exact sequence 

0 -> AT -* P k If: -* 0, 
where If is projective. There is an exact sequence 

Bk °- 

k k k 

and k P k is projective, for every sum of projectives is projective. There is a commutative 
diagram with exact rows: 

Ton (A, E Pk) -- Ton (A, E B k ) 4^ A ® E N% -- A 0 E Pk 

x a 

v Y 

E Tori (A, P k ) -- E Tori (A, B k ) ' - E A ® N k -*- E A ® P k . 
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where the maps in the bottom row are just the usual induced maps in each coordinate, and 
the maps r and o are the isomorphisms given by Theorem 8.87. The proof is completed 
by dimension shifting. • 

Example 10.98. 

(i) We show, for every abelian group B , that 

Torf (I„, B) = B[n] = {b € B : nb = 0}. 

There is an exact sequence 


0 -> Z Z -> I„ 0, 

where pt n is multiplication by n. Applying ®B gives exactness of 

Tori (Z, B ) —> Tori (In , B) —»■ Z ® B Z ® B. 

Now Tori (Z, B) — {0}, because Z is projective. Moreover, 1 ® p, n is also multiplication 
by n, while Z ® B — B. More precisely, Z® is naturally equivalent to the identity functor 
on Ab, and so there is a commutative diagram with exact rows 


0 

0 


B[n] 


B 


M n 


B 






Tori(I w , B) 


t® fi„ 

->■ Z ® B -»■ Z ® B 


By Proposition 8.94, there is an isomorphism B[n] = Tori(I„, B). 

(ii) We can now compute Tor*( A. B) whenever A and B are finitely generated abelian 
groups. By the fundamental theorem, both A and B are direct sums of cyclic groups. Since 
Tor commutes with direct sums, Tor*(A, B) is the direct sum of groups Tor*(C. D ), where 
C and D are cyclic. We may assume that C and D are finite, otherwise they are projective 
and Tori = {0}. This calculation can be completed using part (i) and Exercise 5.6 on 
page 267, which says that if D is a cyclic group of finite order m, then D[n] is a cyclic 
group of order d, where d = (in, n) is their gcd. ■* 


In contrast to Ext, Proposition 10.97 can be generalized by replacing sums by direct 
limits. 


Proposition 10.99. //{B, , q)'j} is a direct system of left R-modules over a directed index 

set /, then there is an isomorphism, for all right R-modules A and for all n > 0, 

Tor ; f (A, limB,) = limTorf (A, Bf). 

Proof. The proofis by dimension shifting. The base step is Theorem 8.101, for Toro(A, ) 
is naturally equivalent to A® . 
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For the inductive step, choose, for each i e /, a short exact sequence 

0 Pi -* P, -> 0, 

where P, is projective. Since the index set is directed. Proposition 7.100 says that there is 
an exact sequence 

0 -> lim Nj -> lim P, -> lim Bj -> 0. 

Now lim Pi is flat, for every projective module is flat, and a direct limit of flat modules is 
flat, by Corollary 8.102. There is a commutative diagram with exact rows: 


Tori (A, lim Pi) ->■ Tori (A, lim Bj) ——^ A <g> lim N, -=► A <g> lim P, 

r o 

Y -» Y Y 

lim Tori (A, Pi) -»• lim Tori (A, Bj) ——^ lim A <g> Nj -^ lim A <g> P,, 

where the maps in the bottom row are just the usual induced maps between direct limits, 
and the maps r and a are the isomorphisms given by Theorem 8.101. The step n > 2 is 
routine. • 

This last proposition generalizes Lemma 8.97, which says that if every finitely generated 
submodule of a module M is flat, then M itself is flat. After all, by Example 7.97(iv), M 
is a direct limit, over a directed index set, of its finitely generated submodules. 

When the ring R is noncommutative, A <%)/,■ B is an abelian group, but it need not be an 
R -module. 


Proposition 10.100. 

(i) Let r e Z(R) be a central element, let A be a right R-module, and let B be a left 
R-modules. If /x,-: B —> B is multiplication by r, then the induced map 

p r *: Tor,f (A, B) -> Tor,f(A, B ) 

is also multiplication by r. 

(ii) If R is a commutative ring, then Tor ; ^(A, B) is an R-module. 


Proof (i) This follows at once from Example 10.47. 

(ii) This follows from part (i) if we define scalar multiplication by r to be /x, „. • 

We are now going to assume that R is a domain, so that the notion of torsion submodule 
is defined, and we shall see why Tor is so called. 
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Lemma 10.101. Let R be a domain, let Q = Frac(7?), and let K — Q/R. 

(i) If A is a torsion R-module, then Tor^(/f, A) = A. 

(ii) For every R-module A, we have Tor,, (/f, A) = {0 } for all n > 2. 

(iii) If A is a torsion-free R-module, then Tori(W, A) = {0}. 

Proof, (i) Exactness of 0 —> R Q —*■ K —> 0 gives exactness of 

Torj(Q, A) -> Tori(X, A) -> /? <g> A —> 0 ® A. 

Now Q is flat, by Corollary 8.103, and so Tori (6, A) = {0}, by Proposition 10.96. The last 
term Q <g> A = {0} because Q is divisible and A is torsion, by Exercise 9.15 on page 665, 
and so the middle map Tori (IT, A) -> R <g> A is an isomorphism. 

(ii) There is an exact sequence 

Tor,,(Q, A) -> Tor,,(IT, A) -> Tor„_i(/?, A). 

Since n > 2, we have n — 1 > 1, and so both the first and third Tor’s are {0}, because Q 
and R are flat. Therefore, exactness gives Tor,, (K, A) — {0}. 

(iii) By Theorem 8.104, there is an injective .R-module E containing A as a submodule. 
Since A is torsion-free, however, A (T tE = {0}, and so A is imbedded in E/tE. By 
Lemma 7.72, injective modules are divisible, and so E is divisible, as is its quotient E/tE. 
Now E/tE is a vector space over Q, for it is a torsion-free divisible R-module (Exer¬ 
cise 9.7 on page 664). Let us denote E/tE by V. Since every vector space has a basis, V 
is a direct sum of copies of Q. Corollary 8.103 says that Q is flat, and Lemma 8.98 says 
that a direct sum of flat modules is flat. We conclude that V is flat. 14 

Exactness of0-*A—>V-*V/A->0 gives exactness of 

Tor 2 (fC V/ A) —» Tori(fC A) —> Tori(W, V). 

Now Tor 2 (/C V/A) — {0}, by part (ii), and Tori(fC V) — {0}, because V is flat. We 
conclude from exactness that Tori (K, A) = {0}. • 

The next result shows why Tor is so-called. 

Theorem 10.102. 

(i) If R is a domain, Q = Fracf R), and K — Q/R, then the functor T'orj 4 f /C, ) is 
naturally equivalent to the torsion functor. 

(ii) Tor^(fif, A) = t A for all R-modules A. 


^Torsion-IYee Z-modules are flat, but there exist domains R having torsion-free modules that are not flat. In 
fact, domains for which every torsion-free module is flat, called Priifer rings, are characterized as those domains 
in which every finitely generated ideal is a projective module. 
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Proof. Exactness of 

Tor 2 (X, A/tA) -> Tor, (K, tA) Tor, (A", A) -> Tori (K, A/tA). 

The first term is {0}, by Lemma 10.101(ii), and the last term is {0}, by Lemma 10.101(iii). 
Therefore, the map : T’or | ( K, tA) -> Tor \(K. A) is an isomorphism. 

Let /: A —> B and let /': tA —»■ tB be its restriction. The following diagram com¬ 
mutes, because Tori (K, ) is a functor, and this says that the isomorphisms i,\ constitute a 
natural transformation. 


Torj (K, tA) --—a- Tori (K, A) 


Tori (K, tB) 


r 

—->■ Tori (/if, B) 




There is a construction of TorjTA, B) by generators and relations. Consider all triples 
(a, n, b), where a e A, b e. B , na = 0, and nb — 0. Then Tor f(A, B) is generated by all 
such triples subject to the relations (whenever both sides are defined): 

(a + a',n,b) — (a,n, b) + (a 1 , n, b) 

(a, n, b + b ) — (a, n, b) + (a,n,b ) 

( ma, n,b) — ( a, mn , b) = ( a , m. nb). 

For a proof of this result, and its generalization to Tor,f (A, B) for arbitrary rings R , see 
Mac Lane, Homology , pp. 150-159. 

The Tor functors are very useful in algebraic topology. The Universal Coefficients The¬ 
orem gives a formula for the homology groups //„ ( X: G) with coefficients in an abelian 
group G. 

Theorem (Universal Coefficients). For every topological space X and every abelian 
group G, there are isomorphisms for all n > 0, 

H n (X- G) = H n (X) feGffi Torf(tf„_i(X), G). 

Proof. See Rotman, An Introduction to Algebraic Topology, page 261. • 

If we know the homology groups of spaces X and Y, then the Kunneth formula gives a 
formula for the homology groups of X x Y, and this, too, involves Tor in an essential way. 

Theorem (Kunneth Formula). For every pair of topological spaces X and Y, there are 
isomorphisms for every n > 0, 

H n (X xY) = J2 HfiX) H n -i(X) © J]Torf (H p (X), H n - { - p (Y)). 

i P 

Proof. See Rotman, An Introduction to Algebraic Topology , page 269 • 
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Exercises 

10.44 Prove the following analog of Theorem 10.45. Let £ n : jjMod -*■ Ab be a sequence of 
covariant functors, for n > 0, such that 

(i) for every short exact sequence 0 —>■ A —> B —► C —»■ 0, there is a long exact sequence 
and natural connecting homomorphisms 

-» £ n (A) £ n (B) -» £ n (C) F" +1 (A) -►■■■; 

(ii) there is a left F-module M such that £ 0 and Hom^(M, ) are naturally equivalent; 

(iii) £ n (E) = {0) for all injective modules E and all n > 1. 

Prove that £ n is naturally equivalent to Ext "(M, ) for all n > 0. 

10.45 Let TOR" : ^Mod -* Ab be a sequence of covariant functors, for n > 0, such that 

(i) for every short exact sequence 0 —> A —> B —> C ^ 0, there is a long exact sequence 
and natural connecting homomorphisms 

■ ■ ■ -> TOR„(A) -»• TOR„(fi) ->• TOR„(C) TOR„_!(A) -> ■ ■ ■ ; 

(ii) there is a left F-module M such that TORg and are naturally equivalent; 

(iii) TOR,, ( P) = {0} for all projective modules P and all n > 1. 

Prove that TOR,, is naturally equivalent to Tor,, ( , M) for all n > 0. (There is a similar 
result if the first variable is fixed.) 

10.46 Prove that any two split extensions of modules A by C are equivalent. 

10.47 Prove that if A is an abelian group with nA = A for some positive integer n, then every 

extension 0 —> A —>■£—>■ —>■ 0 splits. 

10.48 If A is a torsion abelian group, prove that Ext 1 (A, Z) = Hom(A, 5 1 ), where S 1 is the circle 
group. 

10.49 Prove that a left R -module E is injective if and only if Extjj(A, E) = {0} for every left 
f?-module A. 

10.50 For any ring R. prove that a left /^-module B is injective if and only if Ext 1 (R/I, B) = {0} 
for every left ideal I. 

Hint. Use the Baer criterion. 

10.51 Prove that an abelian group G is injective if and only if Ext 1 (Q/Z, G) = {0}. 

10.52 Prove that an abelian group G is free abelian if and only if Ext 1 (G, F) = {0) for every free 
abelian group F. 15 

10.53 If 0 —>■ A —»■ B —»• C —»■ 0 is an exact sequence of right F-modules with both A and C flat, 
prove that B is flat. 

10.54 If A and B are finite abelian groups, prove that Tor^(A, B) = A B. 

15 The question whether Hxt 1 ( G. Z) = {0} implies G is free abelian is known as Whitehead’s problem. It 

turns out that if G countable, then it must be free abelian, but S. Shelah proved that it is undecideable whether 

uncountable such G must be free abelian. 
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10.55 Let R be a domain, Q — Frae( R ). and K = Q/R. 

(i) Prove, for every /^-module A, that there is an exact sequence 

0 —>■ rA A —>■ 

(ii) Prove that a module A is torsion if and only if Q ® A = {0}. 

10.56 Let R be a domain. 

(i) If B is a torsion ^-module, prove that Tor„(A, B) is a torsion i?-module for all R- 
modules A and for all n > 0. 

(ii) For all R-modules A and B, prove that Tor,, (A, B) is a torsion R-module for all n > 1. 

10.57 Let k be a field, let R = k[x , y], and let / be the ideal (x, y). 

(i) Prove that x ® y — y ® x £ I I is nonzero. 

Hint. Consider (I/I 2 ) <g> (I/I 2 ). 

(ii) Prove that x(x ® y — y <8> x) — 0, and conclude that 1 I is not torsion-free. 


10.7 Cohomology of Groups 

Recall that Proposition 10.30 and Proposition 10.31 say that there is an exact sequence 
U 3 F 2 Fi F 0 -> Z -> 0, 

where Fq, F \, Fj, and F 3 are free (7-modules and Z is viewed as a trivial g-module. In 
light of the calculations in Section 10.3, the following definition should now seem reason¬ 
able. 

Definition. Let G be a group, let A be a G-module (i.e, a left ZG-module), and let Z be 
the integers viewed as a trivial G-module (i.e, gm = m for all g e G and in e Z). The 
cohomology groups of G are 


H n (G, A) =Ex4 3 (Z, A); 


the homology groups of G are 

H n (G, A) = Torf^Z, A). 


The history of cohomology of groups is quite interesting. The subject began with the 
discovery, by the topologist W. Hurewicz in the 1930’s, that if X is a connected aspherical 
space (in modern language, if the higher homotopy groups of X are all trivial), then all 
the homology and cohomology groups of X are determined by the fundamental group tc = 
7Z\(X). This led to the question of whether H n ( X) could be described algebraically in terms 
of tc . For example, Hurewicz proved that ll\(X) = jc/tt', where tc' is the commutator 
subgroup. In 1942, H. Hopf proved that if tc has a presentation F/R, where F is free, then 
H^iX) = (R fl F')/[F, R], where [/-\ R] is the subgroup generated by all commutators of 



Sec. 10.7 Cohomology of Groups 


871 


the form frf~ l r~ l for / e F and r e R. These results led S. Eilenberg, S. Mac Lane, 
Hopf, H. Freudenthal, and B. Eckmann to create cohomology of groups. 

In what follows, we will write Home instead of Hom^G and <8>g instead of Czg- Be¬ 
cause of the special role of the trivial G -module Z, the augmentation 

s: ZG -> Z, 


defined by 


y^ m x x i 


xeG 


xeG 


is important. Recall that we have seen, in Exercise 8.37 on page 573, that e is a surjective 
ring homomorphism, and so its kernel, Q , is a two-sided ideal in ZG, called the augmen¬ 
tation ideal. Thus, there is an exact sequence 


ZG 


Z-* 0. 


Proposition 10.103. Let G be a group with augmentation ideal Q. A? an abelian group, 
Q is free abelian with basis G — 1 = {x — 1 : x e G, x ^ 1}. 

Proof. An element u = f /, m x x e ZG lies in kere = Q if and only if m x — 0. 
Therefore, if u e Q, then 

u — u — (y~^ m x ) 1 = y - ' m x (x — 1). 

X X 

Thus, Q is generated by the nonzero x — 1 for i e G. 

Suppose that n x (x — 1) = 0. Then n x x ~ (X^l n x) 1 = 0 in ZG, which, 

as an abelian group, is free abelian with basis the elements of G. Hence, n x — 0 for all 
t ^ 1. Therefore, the nonzero x — 1 comprise a basis of Q. • 

We begin by examining homology groups. 

Proposition 10.104. If A is a G-module, then 

Hq(G, A) = Z <g>c A = A/QA. 


Proof. By definition, Hq(G, A) = TorQ G (Z, A) — Z Cg A. Applying the right exact 
functor Cia A to the exact sequence 

0-*£-*ZG-»Z-*0 

gives exactness of the first row of the following commutative diagram: 

Q < 8 >g A -ZG ®g A -> Z ®g A -> 0 

Y V V 

QA -A->■ A/QA -^ 0 

The two solid vertical arrows are given by u <g> a i-> ua. By Proposition 8.93, there is an 
isomorphism Z ®g A = A/QA. • 
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It is easy to see that A/QA is G-trivial; indeed, it is the largest G-trivial quotient of A. 

Example 10.105. 

Suppose that E is a semidirect product of an abelian group A by a group G. Recall that 
[G, A] is the subgroup generated by all commutators of the form [ x.a] = xax~ l a~ l , 
where x e G and a <= A. If we write commutators additively, as we did at the beginning 
of this chapter, then 


[x, a] = x + a — x — a = xa — a = (x — 1 )a 


(recall that G acts on A by conjugation). Therefore, A/Q A = A/[G, A] here. ◄ 

We are now going to use the independence of the choice of projective resolution to 
compute the homology groups of a finite cyclic group G. 

Lemma 10.106. Let G = (x) be a cyclic group of finite order k. Define elements D and 
N in Z G by 

D — x — 1 and N — 1 + x + x 2 + ■ ■ ■ + x k ~ 1 . 

Then the following sequence is a G-free resolution of Z: 

N D N D e 

-» ZG ZG -A ZG —-» ZG -U ZG -A Z -» 0, 

where e is the augmentation and the other maps are multiplication by N and D, respec¬ 
tively. 

Proof Obviously, every term ZG is free; moreover, since ZG is commutative, the maps 
are G-maps. Now DN — ND = x k — 1 = 0, while if it e ZG. then 

eD(u ) = s((x — 1 )u) — six — l)e(n) = 0, 

because s is a ring map. Thus, we have a complex, and it only remains to prove exactness. 

We have already noted that e is surjective. Now kere = Q = imZ), by Proposi¬ 
tion 10.103, and so we have exactness at the zeroth step. 

Suppose u — ff!/ (I m i x ' e kerZ); that is, (x — l)u — 0. Expanding, and using the fact 
that ZG has basis {l,x,x 2 ,, x k ~ { ). we have 


m o = m\ = ■ ■ • = mk—i. 


so that u = ihqN e im N, as desired. 

Finally, if u — nijx' e kerN, then 0 = s(Nu ) = e(N)e(u) — ks(u), so that 

s(u) — JZ/Zq nij — 0. Therefore, 

u — —Dfnol + (mo + m\)x + • • • + (mo + • • • + nit-iji 1-1 ) e im D. • 
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Definition. If A is a G-module, define submodules 


and 


A[iV] = {fl e A : Na — 0} 


A G = {a e A : ga = a for all g e G }. 


Theorem 10.107. If G is a cyclic group of finite order k and A is a G-module, then 

Hq(G, A) = A/GA- 

H 2 n-i(G, A) = A g /N A for all n > 1; 

HiniG, A) — A[N]/QA for all n > 1. 

Proof. Apply (do A to the resolution of Z in Lemma 10.106, noting that ZG (da A = 
A. The calculation of ker/im is now simple, using im D = QA, which follows from 
Proposition 10.103 and the fact that (jc — 1) | (jc* — 1). • 


Corollary 10.108. If G is a finite cyclic group of order k and A is a trivial G-module, 
then 

Hq(G, A) = A; 

H 2 n-i{G, A) = A/kA for all n > 1; 

H 2 „{G, A) — A[k] for all n > 1. 

In particular, 

Hq(G,Z) = Z; 

H 2 „-i(G,Z) = Z/kZ for all n > 1; 

H 2 n(G, Z) = {0} for all n > 1. 

Proof. Since A is G-trivial, we have A G — A and QA — {0} (for Da — (x — l)a = 0 
because xa — a). • 

We now compute low-dimensional homology groups of not necessarily cyclic groups. 
Lemma 10.109. For any group G, we have 

H l (G,z) = g/g 2 . 

Proof. The long exact sequence arising from 

0^g^ZG-^Z->0 

ends with 

Hi(G, ZG) H\ (G, Z) H q (G , G) H 0 (G , ZG) H 0 (G, Z) 0. 
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Now H i (G, 1G) = {0}, because ZG is projective, so that 3 is an injection. Also, 


H 0 (G,ZG) = Z, 

by Proposition 10.104. Since e* is surjective, it must be injective as well (if kere* ^ {0}, 
then Z/kere* is finite; on the other hand, Z/kere* = ime* = Z, which is torsion-free). 
Exactness of the sequence of homology groups (3 is surjective if and only if e* is injective) 
now gives 3 a surjection. We conclude that 


3: Hi(G,Z) = H 0 (G,G) = G/G 2 , 


by Proposition 10.104. • 


Proposition 10.110. For any group G, we have 

Hi (G, Z) = G/G', 

where G' is the commutator subgroup of G. 

Proof. It suffices to prove that G/G' = G/G 2 - Define 6 ; G -> G/G 2 by 

6: x i-> (x — 1) + G 2 ■ 

To see that 0 is a homomorphism, note that 

xy - 1 - (x - 1) - (y - 1) = (x - l)(y - 1) e G 2 , 


so that 


0(xy) — xy - 1 + G 2 

= (x -l) + (y-l) + G 2 
= x- l + G 2 + y- l + G 2 
= e(x) + e(y). 

Since G/G 2 is abelian, ker KG', and so 0 induces a homomorphism O' : G/G' —> G/G 2 , 
namely, xG' i-* x — 1 + G 2 - 

We now construct the inverse of O'. By Proposition 10.103, G is a free abelian group 
with basis all x — 1, where x e G and x f 1. It follows that there is a (well-defined) 
homomorphism cp: G —> G/G', given by 

(p: x - 1 h-> xG'. 

If G 2 C ker (p. then <p induces a homomorphism G/G 2 —> G/G' that, obviously, is the 
inverse of 6', and this will complete the proof. 
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If u e Q 2 , then 

u = (y^ m x (x - l))(y^« y (y - 1)) 
x^l y£l 

— 'Y^m x n y (x - l)(y - 1) 

x,y 

= X! m x n y(( x y - 1) - (x - 1) - (y - 1)). 

x,y 

Therefore, <p(u ) = ]~[ t v (xyx~ 1 y - 1 G' = G'. and so a e ker<p, as desired. • 

The group Ih{G. Z) is useful; it is called the Schur multiplier of G. For example, 
suppose that G = F/R , where F is a free group; that is, we have a presentation of a group 
G. Then Hopf’s formula is 

H 2 (G, Z) = (RnF)/[F,R] 

(see Rotman, An Introduction to Homological Algebra, page 274). It follows that the group 
(R fl F)/[F, R | depends only on G and not upon the choice of presentation of G. 

Definition. An exact sequence 0-*A—> lisa central extension of a group 
G if ,4 < Z(E). A universal central extension of G is a central extension 0 -> M —> 
U —y G -* 1 for which there always exists a commutative diagram 

0- A -^ E - G -^ 1 

1G 

y v I 

0-»■ M ->- U ->- G ->-1 

Theorem. If G is a finite group, then G has a universal central extension if and only 
if G — G', in which case M = H 2 (G, Z). In particular, every finite simple group has a 
universal central extension. 

Proof. See Milnor, Introduction to Algebraic K-Theory, pages 43^-6. • 

This theorem is used to construct “covers” of simple groups. 

We now consider cohomology groups. 

Proposition 10.111. Let G be a group, let A be a G-module, and let Z be viewed as a 
trivial G-module. Then 

H°(G, A) = Hom G (Z, A) = A G . 

Proof. By definition, 

H°(G, A) = Ext^ G (Z, A) = Hom G (Z, A). 

Define ta : Hom G (Z, A) -* A c by / m* /(1). Note that /(1) e A G : If g e G, then 
gf( 1) = f{g ■ 1) (because / is a G-map), and g ■ 1 = 1 (because Z is G-trivial); therefore, 
gfi 1) = /(1), and /(l) e A G . That r a is an isomorphism is a routine calculation. • 
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It follows that H q (G, A ) is the largest G-trivial submodule of A. 

Theorem 10.112. Let G — (a) be a cyclic group of finite order k, and let A be a G- 
module. If N — yfZn er ! and D = a — 1, then 

H°(G, A) = A g ; 

H 2n ~\G, A) = ker A/(cr - 1 )A for all n > 1; 

H 2 "(G,A) = A g /NA for all n > 1. 

Proof. Apply the contravariant Home( , A) to the resolution of Z in Lemma 10.106, 
noting that Home(ZG, A) = A. The calculation of ker / im is now as given in the state¬ 
ment. • 

Note that Proposition 10.103 gives im D — QA. 

Corollary 10.113. If G is a cyclic group of finite order k and A is a trivial G-module, 
then 

H°(G, A) = A; 

H 2n ~ l (G, A) = A[k] for all n > 1; 

H 2n (G,A) = A/kA for all n > 1. 

In particular, 

H°(G, Z) = Z; 

H 2n ~ l (G, Z) = {0} for all n > 1; 

H 2n (G, Z) = Z/fcZ for all n > 1. 

Remark. A finite group G for which there exists a nonzero integer d such that 

H n (G, A) = H" +d (G, A), 

for all n > 1 and all G-modules A, is said to have periodic cohomology. It can be proved 
that a group G has periodic cohomology if and only if its Sylow /^-subgroups are cyclic, for 
all odd primes p, while its Sylow 2-subgroups are either cyclic or generalized quaternion 
(see Adem-Milgram, Cohomology of Finite Groups, p. 148). For example, G = SL(2, 5 ) 
has periodic cohomology: it is a group of order 120 = 8 • 3 • 5, so its Sylow 3-subgroups 
and its Sylow 5-subgroups are cyclic, having prime order, while its Sylow 2-subgroups are 
isomorphic to the quaternions. -4 

We can interpret H 1 (G, A) and H 2 (G, A), where G is any not necessarily cyclic group, 
in terms of derivations and extensions if we can show that the formulas in Section 10.3 do, 
in fact, arise from a projective resolution of Z. Alas, we need a technical interlude. 
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Definition. If G is a group, define Bq(G) to be the free G-module on the single generator 
[ ] (hence, Bq(G) = ZG) and, for n > 1, define B n ( G ) to be the free G-module with basis 
all symbols [x\ \ x 2 \ ■ ■ ■ \ x n ], where x* e G. Define s: Bq(G) —> Z by e([ ]) = 1 and, 
for n > 1, define d n : B n (G ) -> B„_i(G) by 


d„ : [x\ | • • • | x n \ i-> xi[x 2 | • • • | x n ] 


n —1 

+ I • • • I XjX i+ 1 I ••• I x n ] 

i =1 

+ (-l)"[Xl | ••• \x n -ll 


The bar resolution is the sequence 

B.(G) :-> B 2 (G) B\(G) ^ B 0 (G ) -U Z -> 0. 

Let us look at the low-dimensional part of the bar resolution. 
d[: [x] x[ ]; 

d 2 - [x | y] i ^ x[y] - [xy] + [x]; 

^ 3 : [x I y I z] h* x[y | z] - [xy | z] + [x | yz] - [x | y] 

These are the formulas that arose in the earlier sections, but without the added conditions 
[x | 1] = 0 = [1 | y] and [1] = 0. In fact, there are two bar resolutions; the bar resolution 
just defined, and another we shall soon see, called the normalized bar resolution. 

The bar resolution is a free resolution of Z, although it is not a routine calculation to 
see this; we prove that it is a resolution by comparing B.(G) to a resolution familiar to 
algebraic topologists. 

Definition. If G is a group, let P„(G ) be the free abelian group with basis all (n + 1)- 
tuples of elements of G; make P n (G ) into a G-module by defining 

x(xo, XI, . . . , X n ) = (xxo, XXI, . . . , XX„). 

Define 3„ : P„(G) -> P n -\ (G), whenever n > 1, by 

n 

3„ ; (x 0 , xi,..., x„) (-^ ^(-1)' (xo-- %,..., x„), 

i =o 

where x,- means that x, has been deleted. P.(G) is called the homogenous resolution of 
Z. 


Note that Po(G) is the free abelian group with basis all (y), for y e G, made into a 
G-module by x(y) = (xy). In other words, Pq(G) = ZG. 

The proof that P.(G) is a projective resolution of Z will be broken into two parts. 
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Lemma 10.114. The sequence 

P.(G) :-> P 2 (G) Pi(G) Pq(G) -U Z -> 0, 

where e is the augmentation, is a complex. 

Proof. It suffices to prove that 3„_i3„(xo, xi,..., x n ) — 0. Now 

n 

dn— 1 dn X\ > • • • 5 Xn) = ^ 1) $n— 1 C^0> ... j Xi, , X n ) 

i=0 
n 

- 1)' (^^(—!) J C^o, ...,Xj,...,Xi,... ,x n )j 

i=0 j<i 

n 

+ y^(-iy (Yi-d'-Hxq—... ,xj,... 

i= 0 j>i 

In the last equation, the first summation has inner sign (— 1 ) J , because j < i and so x,- 
is still in the /th position after the deletion of x, from the original n-tuple. In the second 
summation, however, the inner sign is (—l)-' -1 , because i < j and so Xj is in position 
j — 1 after deletion of the earlier x,\ Thus, 3„_i3„(xo, xi,..., x n ) is a sum of (n — 2)- 
tuples (x(),,~Xj,... ,x n ) with i < j, each of which occurs twice: once upon 
deleting x, by 3„ and then deleting xj by 3„_i; a second time upon deleting xj by 3„ and 
then deleting x, by 3„_i. In the first case, the sign of the (n — 2)-tuple is (— in 

the second case, its sign is (— \) l+J . Therefore, the in — 2)-tuples cancel in pairs, and 

3«-|3n = 0. • 

Proposition 10.115. The complex 

P. (G ) :-> P 2 (G) P ] (G) -X p 0 (G) Z -> 0, 

where 3q = e is the augmentation, is a G-free resolution ofL. 

Proof. We let the reader prove that P„ (G ) is a free G-module with basis all symbols of 
the form (1, xi, ..., x n ). 

To prove exactness of P.(G), it suffices, by Proposition 10.40, to construct a contracting 
homotopy; that is, maps 


P 2 (G) 


P\ (G) 


so 


Po(G) 


■S-l 


Z 


with es_i — lz and 


3//) ]Sn -T s n —[d n — 1 p n (G)i for all n > 0. 
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Define 5 _i: Z —»■ Pq(G) by m i-> m( 1), where the 1 in the parentheses is the identity 
element of the group G, and, for n > 0, define ,v„ : P„ (G) -* P„+i (G) by 

S n : (x'0, XI, . . . , X n ) I-* ( 1 , X0,X1,- Xyi ). 

These maps s„ are only Z-maps, but Exercise 10.32 on page 829 says that this suffices to 
prove exactness. Here are the computations. 

£5—1(1) = e((l)) = l 

If n > 0, then 

dn+lSn (xo, • • • ? %n) = 1 ( 15 Xo, • • • > 

n 

= (xo, ■ • • , x„) + y~'(-l) i + 1 (l,XQ, ...Si -- x„) 

i=0 

[the range of summation has been rewritten because x,- sits in the (i + l)st position in 
(1, xo,..., x„)]. On the other hand, 

n 

S n -1 3n(XQ,-X„) = 5„_l y^(~l ) J (XQ,-X), . . . ,X„) 

;=0 

n 

— y>iV(i,x 0 — Sj, —x„). 

7=0 

It follows that ^ 9 h _|_i 5 ;; T 5 /;— 1 3?^)) (xo, * • •, x^) — (xo,. • *, x^). • 


Proposition 10.116. The bar resolution B.(G) is a G-free resolution ofZ. 
Proof. For each n > 0, define r„ : P„ (G) —> B n (G) by 

r„ : (x 0 , ..., x„) x 0 [Xq J xi | xf l x 2 | • • • | x~\x„], 
and define a n : B n (G ) —»■ P n (G) by 

<T„ : [X'l | • • • | X„] ( 1 , Xl, X'lX'2, X1X2X3, . . . , X'lX'2 • • • x„). 


It is routine to check that r„ and <t„ are inverse, and so each r n is an isomorphism. 

The reader can also check that r: P. (G) -> B. (G) is a chain map; that is, the following 
diagram commutes: 

P n (G) -——>■ B n (G) 


V 

Pn-l(G) 


^n —1 


dn 

v 


Bn-l(G) 


Finally, Exercise 10.22 on page 827 shows that both complexes have the same homology 
groups. By Proposition 10.115, the complex P.(G) is an exact sequence, so that all its 
homology groups are {0}. It follows that all the homology groups of B.(G) are {0}, and so 
it, too, is an exact sequence. • 
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Definition. Define 


[X\ | • • • | X„]* 


\[x\\---\x n ] if all x ( - ^ 1; 

| 0 if some x; = 1. 


The normalized bar resolution , B*(G), is the sequence 


B:(G) :-» -A B*(G) -A B*(G) Az^O, 

where B*(G) is the free G-module with basis all nonzero [xi | • • • | x n ]*, and the maps 
d„ have the same formula as the maps d n in the bar resolution (except that the symbols 
[xi | • • • | x„] now occur as [xi | • • • | x„]*). 


Since we are making some of the basis elements 0, it is not obvious that the normalized 
bar resolution B* (G) is a complex, let alone a resolution of Z. 


Theorem 10.117. The normalized bar resolution B*(G) is a G-free resolution ofZ. 
Proof. We begin by constructing a contracting homotopy 

• • • 4 - BUG) A- BUG) A- BUG) A- Z, 

where each t n is a Z-map. Define f_i: Z -> Bq(G) by t -\: m h-4 m[ ]. Note that B*(G) 
is a free G-module with basis all nonzero [x\ \ ■ ■ ■ | x„]*; hence, it is a direct sum of 
copies of ZG. Since ZG is a free abelian group, B*(G) is also a free abelian group; the 
reader may check that a basis of B*{G ), as a free abelian group, consists of all nonzero 
x[xj | • • • | x,,]*. To define t n for n > 0 , we take advantage of the fact that t„ need only be 
a Z-map, by giving its values on these Z-basis elements (and freeness allows us to choose 
these values without restrictions). Thus, for n > 0 , define t„ : B* (G) —> B* + ] (G) by 

t n : x[xi | ••• | x„]* (->■ [x | xi | • • • | x„]*. 

That we have constructed a contracting homotopy is routine; the reader may check that 
st- 1 = lz and, for n > 0, that 

dn+\tn + tn—ldn = 1b*(G )• 

The proof will be complete once we show that B*(G) is a complex. Since B* +1 (G) is 
generated, as a G-module, by im t n , it suffices to show that d n d„+\ — 0 on this subgroup. 
We now prove, by induction on n > —1, that d n d n+ \ t„ — 0. The base step is true, for 
s — t- 1 and 0 — sd\ — t-idi. For the inductive step, we use the identities in the definition 
of contracting homotopy and the inductive hypothesis d n -\d n — 0: 

dudf !) \hi — d f! (1 t n -\d n ) 

— dn d n tn—ldn 

— d n (1 tn—ldn — \)dn 

— d n dn t n -2d n -.%d n 

= 0 . • 
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We can now interpret H l (G , A) and H 2 (G, A). 

Corollary 10.118. For every group G and every G-module A, the groups H [ (G, A) and 
H 2 (G, A) constructed in Section 10.3 coincide with the cohomology groups. 

Proof. We have proved that factor sets, coboundaries, derivations, and principal deriva¬ 
tions do, in fact, arise from a projective resolution of Z. • 

Proposition 10.119. If G is a finite group of order m, then mH n (G , A) — {0 } for all 
n > 1 and all G-modules A. 

Proof. Sum the cocycle formula, as in the proof of Theorem 10.21. • 

Corollary 10.120. If G is a finite group and A is a finitely generated G-module, then 
H n (G, A) are finite for all n > 0. 

Proof. H"(G , A) is a finitely generated abelian group (because A is finitely generated) 
of finite exponent. • 

Both Proposition 10.119 and its corollary are true for homology groups as well. 

There are several aspects of the cohomology of groups that we have not mentioned. 
Aside from being a useful tool within group theory itself, these groups also form a link with 
algebraic topology. For every group G, there exists a topological space K(G, 1), called an 
Eilenberg-Mac Lane space, whose fundamental group is G and whose cohomology groups 
coincide with the algebraically defined cohomology groups. 16 There is, in fact, a deep 
connection between group theory and algebraic topology, of which this is a first sign. 

An important property of the cohomology of groups is the relation between the co¬ 
homology of a group and the cohomology of its subgroups and its quotient groups. If 
cp: S —> G is a homomorphism, every G-module A becomes an 5-module if we define 
sa — (p(s)a for all s e S and a e A. What is the connection between H"(S, A) and 
H n (G , A)? What is the connection between H n (S, A) and H n (G, A)? [There is also a 
connection between homology groups and cohomology groups: H"(G, A)* A H n (G , A*), 
where A* = Horn z(A, Q/Z).] 

There are three standard maps, which we will define in terms of the bar resolution. The 
first is restriction. If S is a subgroup of a group G, then every function /: B n {G) —>■ A 
is defined on all [x\ | • • • | x n ] with x,- e G; of course, / is defined on all n-tuples of 
the form [si | • • • | s„] with s,- e S C G, so that its restriction, which we denote by /|5, 
maps B„(S ) -> A. If / is an n-cocycle, let us write els/ to denote its cohomology class: 
els/ = / + imt/,,+ 1 . Then 

Res: H n (G, A) -> H n (S,A) 

is defined by Res(cls/) = cls(/|5). One result is that if G is a finite group, S p is a 
Sylow p-subgroup, and n > 1, then Res: H n (G , A) -> H n (S p , A) is injective on the 

'^Because of this topological connection, many authors use the notation H" (n. Z) to denote cohomology 
groups, for tt\ is the standard notation for the fundamental group. 
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/^-primary component of H n (G, A); thus, the cohomology of G is strongly influenced by 
the cohomology of its Sylow subgroups. 

If S < G, there is a map 


Cor: H n (S, A) -* H n (G, A) 

in the reverse direction, called corestriction , which is defined when S has finite index in 
G. We first define Cor in dimension 0; that is. Cor 0 : A s A G , by a i-> J2 t eT ta ~ where 
T is a left transversal of S in G (of course, we must check that Cor 0 is a homomorphism 
that is independent of the choice of transversal). There is a standard way of extending a 
map in dimension 0 to maps in higher dimensions (essentially by dimension shifting), and 
if [G : 5] = m, then 


Cor" o Res" : H n (G, A) -* H n (G, A) = m; 
that is, the composite is multiplication by m. Similarly, in homology, the map 

Cor 0 : A/QA A/S A, 

defined by a + QA t~ l a + SA, extends to maps in higher dimensions. When 

n — 1 and A = Z, we have Cori: H\(G, Z) —»■ H\{S , Z); that is, Cori: G/ G' —> S/S'. 
There is such a homomorphism well known to group theorists, called the transfer Vc^s, 
and it turns out that Cori = Vq^s- 

The third standard map is called inflation. Suppose that N is a normal subgroup of a 
group G. If A is a G-module, then A A is a G/A-module if we define ( gN)a = ga for 
a e A n [if gN — hN , then h = gx for some x e N, and so ha = ( gx)a — g(xa ) = ga, 
because xa — a]. Define 

Inf: H n {G/N, A N ) -> H n (G, A) 

by els/ cls(/ # ), where 

/ #; L?1 I ••• \gn]^ f[glN | ••• | gnN], 

A useful result here is the five term exact sequence: If N <1 G and A is a G-module, 
there is an exact sequence of abelian groups: 

0 -> H\G/N, A n ) H l (G, A) H l (N, A) G/N 

—» H 2 (G/N, A n ) —> H 2 (G, A) 

(there is a version of this sequence in homology as well). For an excellent discussion of 
these ideas, we refer the reader to Serre, Corps Locaux, pp. 135-138. 

We can force cohomology groups to be a graded ring by defining cup product on 
H*(G, R) = E„>o H n (G, R), where R is any commutative ring (see Evens, The Co¬ 
homology of Groups), and this added structure has important applications. 

We now consider the cohomology of free groups. 
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Lemma 10.121. If G is a free group with basis X, then its augmentation ideal Q is a free 
G-module with basis 

X - 1 = [x - 1 : x e X}. 

Proof. We show first that Q is generated by all X — 1. The identities 

xy — 1 = (x — 1) + x (y — 1 ) 
and 

x~ l — 1 = —x~ l (x — 1) 

show that if w is any word in X , then w — 1 can be written as a G-linear combination of 

X - 1. 

To show that Q is a free G -module with basis X — 1, it now suffices to show that the 
following diagram can be completed: 


0 

... 

where A is any G-module and <p is any function (uniqueness of such a map O follows from 
X — 1 generating Q). Thus, we are seeking <J> e Homed?, A). By Exercise 10.59 on 
page 886 , we have Horn g(G, A) = Der(G, A) via /:xk f(x — 1), where / e Q —> A, 
and so we seek a derivation. 

Consider the (necessarily split) extension 0 ^•A-»£’->G—> 1, so that E consists 
of all ordered pairs (g, a) e G x A. The given function <y: X — \ ~> A defines a lifting i 
of the generating set X of G, namely, 

£(x) = (tp(x — 1), x). 

Since G is free with basis X, the function l : X —> E extends to a homomorphism L : G —» 
E. We claim, for every g e G, that L(g) = (dig), g), where d: G -> A. Each g e G has 
a unique expression as a reduced word g — x^ 1 • • ■ x e f, where xy e X and e, — ±1. We 
prove the claim by induction on n > 1. The base step is clear, while 

L(g) = L(x[C .. x e f) 

= Lixl 1 ) ■ ■ ■ L(x e n -) 

= (<p(x 1 - l),xi) ei • ■ ■ (<p(x n - 1 ),x n ) e " 

= (d(g),g), 

and so the first coordinate d(g) lies in A. 

Exercise 10.59 on page 886 now says that there is a homomorphism <J>: Q 
by <J>(,g — 1) = dig) for all g e G. In particular, <t>(x — 1) = d(x) — tp(x — 
does extend <p. • 


—»• A defined 
1), so that T> 
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Theorem 10.122. If G is a free group, then H"(G , A) = {0} for all n > 1 and all 
G-modules A. 

Proof The sequence 0 —> Q —* ZG -> Z -* 0 is a free resolution of Z because Q is now 
a free G-module. Thus, the only nonzero terms in the deleted resolution occur in positions 
0 and 1, and so all cohomology groups vanish for n > 1. • 

We are now going to state an interesting result (the Stallings-Swan theorem), which 
was discovered using homomological methods but which does not mention homology in 
its statement. 

If G is a group and S < G is a subgroup, then every G-module A can be viewed as an 
5-module, for Z S is a subring of ZG. 

Definition. A group G has cohomological dimension < n, in symbols, cd(G) < n, if 

H n+X {S,A) = {0} 

for all G-modules A and every subgroup S of G. We write cd(G) = oo if no such integer 
n exists. 

We say that cd(G) = n if cd(G) < n but it is not true that cd(G) < n — 1. 


Example 10.123. 

(i) If G = {1}, then cd(G) = 0; this follows from Theorem 10.112 because G is a cyclic 
group of order 1. 

(ii) If G is a finite cyclic group of order k > 1, then cd(G) = oo, as we see from Corol¬ 
lary 10.113 with A — Z. 

(iii) If G ^ {1} is a free group, then Theorem 10.122 shows that cd(G) = 1, for every 
subgroup of a free group is free. 

(iv) If cd(G) < oo, then G must be torsion-free; otherwise, G has a subgroup 5 that is 
cyclic of finite order k > 1, and H'fS, Z) 0 for all even n. 

(v) It is known that if G is a free abelian group of finite rank n, then cd(G) — n. < 

Proposition 10.124 (Shapiro’s Lemma). Let G be a group and let S < G be a sub¬ 
group. If A is a 7iS-module, then for all n > 0, 

H n (S, A) = H"{G , Hom zs (ZG, A)). 

Proof Let • • • —> Pi —» If —»• Z —> 0 be a ZG-free resolution. If we denote 
HomzsCZG, A)) by A*, then 


H n (G, A*) = H n ( Hom ZG (P„ A*)). 
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By the adjoint isomorphism, 

Hom ZG (P/, A*) = Hom ZG (P ; , Hom zs (ZG, A)) 

= Horn zs(Pi ZG, A) 

= Homzj(?/, A). 

But (the proof of) Lemma 8.141 (i) shows that ZG is a free Z.S'-module, and so the free 
ZG-modules P, are also free ZS-modules. It follows that we may regard P. as a Z.S’-frcc 
resolution of Z, and there is an isomorphism of complexes: 

Hom Z5 (P„ A) = Hom ZG (P„ A*). 

Hence, their homology groups are isomorphic; that is, H"(S, A) = H"(G, A*). • 

Corollary 10.125. If G is a group and S < G is a subgroup, then 

cd(S) < cd(G). 

Proof. We may assume that cd(G) — n < oo. If m > n and there is a Z.S’-module A with 
H m (S, A) ^ {0}, then Shapiro’s lemma gives H m (G , Hom Z s(ZG, A)) = H' n (S, A) 
{0}, and this contradicts cd(G) — n. • 

Corollary 10.126. A group G of finite cohomological dimension has no elements ( other 
than 1) of finite order. 

Proof. This follows at once from Example 10.123(h) and the preceding corollary. • 

Are there groups G with cd(G) = 1 that are not free? In 1970, J. Stallings proved the 
following nice theorem (F 2 G denotes the group algebra over F 2 ). 

Theorem. If G is a finitely presented group for which //*(G,F 2 G) has more than 2 
elements, then G is a free product, G = H * K, where H ^ {1} and K { 1} (free product 
is the coproduct in Groups). 

As a consequence, he proves the following results. 

Corollary. If G is a finitely generated group with cd(G) = 1, then G is free. 

Corollary. If G is a torsion-free finitely generated group having a free subgroup of finite 
index, then G is free. 

R. G. Swan showed that both corollaries remain true if we remove the hypothesis that 
G be finitely generated. 

Theorem (Stallings-Swan). A torsion-free group having a free subgroup of finite index 
must be free. 
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Corollary 10.127 (Nielsen-Schreier). Every subgroup S of a free group F is itself free. 

Proof By Corollary 10.125, we have cd(S) < 1, and so the result follows from the 
theorem of Stallings and Swan. • 

We refer the reader to D. E. Cohen, “Groups of Cohomological Dimension 1,” Lecture 
Notes in Mathematics, Vol. 245, Springer-Verlag, New York, 1972, for proofs of these 
theorems. 

Exercises 

10.58 (i) Prove that the isomorphisms in Proposition 10.104 constitute a natural equivalence 

to A A/QA. 

(ii) Prove that the isomorphisms in Proposition 10.111 constitute a natural equivalence 
Homg(Z, ) to A A G . 

10.59 For a fixed group G, prove that the functors Hom^fC/, ) and Der(G, ) are naturally equiva¬ 
lent. 

Hint. If /: Q —► A is a homomorphism, then df.x fix — 1) is a derivation. 

10.60 (i) If G is a finite cyclic group and 0— > A —> B —> C —>• 0 is an exact sequence of 

G-modules, prove that there is an exact hexagon ; that is, kernel = image at each vertex 
of the diagram 


H°(G, A) - H°(G, B) 



H l (G, B) -=- H l {G.A) 

We remark that this exercise is a key lemma in class field theory. 

(ii) If G is a finite cyclic group and A is a G-module, define the Herbrand quotient by 

h{A) = \H°(G,A)\/\H l (G,A)\ 

[h{A) is defined only when both H°(G, A) and H l (G , A) are finite]. 

Let 0 —^ A —^ B —^ C —^ 0 be an exact sequence of G-modules. Prove that if the 
Herbrand quotient is defined for two of the modules A. B.C, then it is defined for the 
third one, and 

h(B) = h(A)hiC). 


n +1 

P n (G)= (g)ZG, 
1 


10.61 If G is a group, prove that 
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where P n (G) is the /7th term in the homogeneous resolution P.(G) and 
n 

(^) ZG = ZG ®i ZG ®z ■ ■ ■ ®z ZG, 

1 

the tensor product over Z of ZG with itself n times. 

10.62 If G is a finite cyclic group, prove, for all G-modules A and for all n > 1 , that H"(G, A) = 
H n+ i(G.A). 

10.63 Let G be a group. 

(i) Show, for any abelian group A, that A* — Hom^CZG, A) is a left ZG-module. We call 
A* a coinduced module. 

Hint. If <p : ZG —/■ A and g e G, define g<p by jc i—> gip(g~ l x). 

(ii) For any left ZG-module B, prove that Horn zg(B. A*) = Horn z(B, A). 

Hint. Use the adjoint isomorphism, Theorem 8.99. 

(iii) If A* is a coinduced module, prove that H n (G, A*) = {0} for all n > 1. 

10.64 If G is a group and A is an abelian group, call the ZG-module A*ZG ®z A an induced 
module. Prove that H„(G, A*) = {0} for all n > 1. 


10.8 Crossed Products 


This section is essentially descriptive, showing how cohomology groups are used to study 
division rings. Let us begin with a return to Galois theory. 

Theorem 10.128. Let E/k be a Galois extension with Galois group G = Gal ( E/k). The 
multiplicative group E x is a kG-module, and 

H l (G,E x ) = {0}. 

Proof. If c: G —> E x is a 1-cocycle, denote c(o) by c n . In multiplicative notation, the 
cocycle condition is the identity o(c T )cf^c a — 1 for all <r. r e G; that is, 

<J(C T ) = CarCf 1 . ( 1 ) 


Fore e E x , consider 

b — ^ c r r(e). 
r eG 


By independence of characters. Proposition 4.30, there is some e e E x with b f 0. For 
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such an element <?, we have, using Eq. (1), 

o{b) = ^2 cr(c T )(jr(e) 
zeG 

= ^2,c az c~ x o r(e) 

zeG 

= C a l X! c °z<JT(e) 
zeG 

= C~ l c o^(e) 
toeG 


Hence, c a = bo(b) , and c is a coboundary. Therefore, // 1 (G. E x ) = {0}. • 

Theorem 10.128 implies Theorem 4.50, which describes the elements of norm 1 in a 
cyclic extension. 

Corollary 10.129 (Hilbert’s Theorem 90). Let E/k be a Galois extension whose 
Galois group G — Ga \(E/k) is cyclic, say, with generator a. If u e E x , then Nu — 1 if 
and only if there is v e E x with 

u = ct(v)i> -1 . 

Proof. By Theorem 10.112, we have H l (G, E x ) = ker N/imD, where N is the norm 
(remember that E x is a multiplicative group) and De — o(e)e~ 1 . Theorem 10.128 gives 
H l (G, E x ) = {0}, so that ker N = im D. Hence, if u e E x , then Nu — 1 if and only if 
there is v e E x with u — o(v)v~ l . • 

Theorem 10.128 is one of the first results in what is called Galois cohomology. Another 
early result is that H n (G, E) — {0} for all n > 1, where E (in contrast to E x ) is the 
additive group of the Galois extension (this result follows easily from the normal basis 
theorem). We are now going to see that H 2 (G, E x ) is useful in studying division rings. 

Only one example of a noncommutative division ring has been given in the text: the 
quaternions H (this is an R-algebra) and its ^-algebra analogs for every subfield k C R 
(actually, another example is given in Exercise 9.80 on page 740). W. R. Hamilton discov¬ 
ered the quaternions in 1843, and F. G. Frobenius, in 1880, proved that the only ]R-division 
algebras are R, C, and H (see Theorem 9.124). No other examples of noncommutative 
division rings were known until cyclic algebras were found in the early 1900s, by J. M. 
Wedderburn and by L. E. Dickson. In 1932, A. A. Albert found an example of a crossed 
product algebra that is not a cyclic algebra, and in 1972, S. A. Amitsur found an example 
of a noncommutative division ring that is not a crossed product algebra. 

Wedderburn proved that every finite division ring is a field (see Theorem 8.23). Are 
there any division rings of prime characteristic? 

We begin with an elementary calculation. Suppose that V is a vector space over a field 
E having basis \u„ : a e G} for some set G, so that each v e V has a unique expression 
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as an /-’-linear combination v = ac,u a for a n e E. For a function ji: V x V —> V. 
with [r(u a , u T ) denoted by u„ii T , define structure constants g%’ T e E by 


U cj I't x 


E cr,z 

8a “a- 

aeG 


To have the associative law, we must have u a (u T u M ) — (u a u T )u c0 \ expanding this equation 
gives, for all indices, 

Zc'i"e" = Y,4-"C r - 

a,/3 y,8 

Let us simplify these equations. Let G be a group and suppose that g°’ r =0 unless a — 
err; that is, m ct m t = /(er, z)u ar , where /(er, r) = g°'*. The function /: G x G —> E x , 
given by /(er, r) = g/ r r , satisfies the following equation for all er, r, a> e G: 

/(o', r)f(or, w) = f(r, co)f(a, zco), 

an equation reminiscent of the cocycle identity written in multiplicative notation. This is 
why factor sets enter into the next definition. 

Let E/k be a Galois extension with Gal (E/k) — G. and let /: G x G —>• E x be a 
factor set: In multiplicative notation 


/(er, 1) = 1 = /(l, T ) for all er, r e G 
and, if we denote the action of er e G on a e E x by a a , then 

/(a, r)/(crr, co) = /(r, co) a f (a. zco). 

Definition. Given a Galois extension E/k with Galois group G — Gal( E/k) and a factor 
set /: G x G —> E x . define the crossed product algebra (E. G. f) to be the vector space 
over E having basis all symbols {u„ : n <= G | and multiplication 

(au a )(bu T ) — ab a f(a, z)u ar 

for all a, b e E. If G is a cyclic group, then the crossed product algebra ( E , G , /) is called 
a cyclic algebra. 

Since every element in ( E , G, f) has a unique expression of the form the 

definition of multiplication extends by linearity to all of (E. G, /). We note two special 


cases: 


u a b = b a u a \ 

UcrU T — f (a, z)u ar . 
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Proposition 10.130. If E / k is a Galois extension with Galois group G — Gal ( E / k) and 
if f : G x G — > E x is a factor set, then ( E, G, f) is a central simple k-algebra that is 
split by E. 

Proof Denote (E, G, f) by A. First, we show that A is a ^-algebra. To prove that A is 
associative, it suffices to prove that 

au a (bu z cu m ) = ( au a bu z )cu M , 

where a, b, c e E. Using the definition of multiplication, 

auaiburCUo)) = au a {bc z /(r, co)u za> ) 

= a(bc T f(x, to)) a f(o , xco)u azo} 

= ab a c aT f(r, co) cr f(a, rco)u aT(0 . 


We also have 


( au a bu T )cUo> = ab a f(cr, x)u az cu C0 

= ab a f(<j, x)c ar /(err, (i>)Ucr ZU) 

= ab°c ax f(a, r)f (err, co)u aza) . 

The cocycle identity shows that multiplication in A is associative. 

That u i is the unit in A follows from our assuming that factor sets are normalized: 

u\u z — /(1, x)u\ T = u z and u a u\ = f(o , 1)m ct i = u a . 

We have shown that A is a ring. We claim that ku\ = {an i : a e k] is the center Z(A). 
If a e E, then u a au\ — a a u a . If a e k — E G , then a a — a for all o e G, and so 
k C Z(A). For the reverse inclusion, suppose that z = a o u a e Z(A). For any b e E, 
we have zbu i = bu\z. But 

zbu i = a a u a bu\ — q a b a u a . 


On the other hand, 

bu\z — ba a u a . 

For every cr e G, we have a a b a — ba a , so that if ci a f 0, then b n — b. If a ^ 1 and 
H — (o), then E H ^ l?* 1 * = E, by Theorem 4.33, and so there exists b e E with b a b. 
We conclude that z — a\u\. For every a e G, the equation ( a\u\)u a = u a (ci\u\) gives 
a° — ai, and so a\ e E G = k. Therefore, Z(A) = ku\. 

We now show that A is simple. Let I be a nonzero two-sided ideal in A, and choose 
a nonzero ye/ of shortest length; that is, y = c a u„ has the smallest number of 
nonzero coefficients c a . Suppose that y = c a u a + c T u z + • • • has at least two nonzero 
coefficients c a and c z . Since u z u z -i a — fix, x _1 ct)m (T , it follows that yu T -i a e /, and 
that y— c a cf l f(x, x~ l a)~ 1 yu z -i a is an element of / whose length is shorter than that of y 
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(this element is nonzero because the coefficient of u ax -\ a is nonzero). We conclude that y 
has length 1; that is, y — c a u a . Hence, I contains c~ l f(a, cr -1 ) -1 (c a u a )u a -i = u i, the 
identity of A. Therefore, I = A, and A is simple. 

Finally, Theorem 9.127 says that A is split by K , where K is any maximal subfield of 
A. The reader may show, using Lemma 9.117, that Eu\ = E is a maximal subfield. • 

In light of Proposition 10.130, it is natural to expect a connection between relative 
Brauer groups and cohomology. 

Theorem. Let E/k be a Galois extension with G = Gal (E/k). There is an isomorphism 
H 2 (G, E x ) -> Br (E/k) with els/ i-> [(G, E, /)]. 

Sketch of Proof. The usual proofs of this theorem are rather long. Each of the items: 
the isomorphism is a well-defined function; it is a homomorphism; it is injective; it is 
suijective, must be checked, and the proofs are computational. For example, the proof in 
Herstein, Noncommutative Rings covers pages 110 through 116. There is a less computa¬ 
tional proof in Serre, Corps Locaux , pages 164 - 167, using the method of descent. • 

What is the advantage of this isomorphism? In Corollary 9.132, we saw that 

Br (k) = U Br (E/k). 

E/k finite 


Corollary 10.131. Let k be afield. 

(i) The Brauer group Br (k) is a torsion group. 

(ii) If A is a central simple k-algebra, then there is an integer n so that the tensor product 
of A with itself r times (where r is the order of [A] in Br(k)) is a matrix algebra: 

A A A = Mat,, (k). 

Sketch of Proof, (i) Br (k) is the union of the relative Brauer groups Br (E / k), where E/k 
is finite. It can be shown that Br(k) is the union of those Br (E/k) for which E/k is a Galois 
extension. We may now invoke Proposition 10.119, which says that |G| H 2 (G, E x ) — {0}. 
(ii) Tensor product is the binary operation in the Brauer group. • 

Recall Proposition 9.129: there exists a noncommutative division k-algebra over a field 
k if and only if Br(k) {0}. 

Corollary 10.132. Let k be a field. If there is a cyclic Galois extension E/k such that 
the norm N: E x —> k x is not surjective, then there exists a noncommutative k-division 
algebra. 

Sketch of Proof. If G is a finite cyclic group, then Theorem 10.112 gives 
H 2 (G , E x ) = (E X ) G / im N = k x /imN. 

Therefore, Br (E/k) ^ {0} if N is not surjective, and this implies that Br(k) {0}. • 



892 


Homology Ch. 10 


If k is a finite field and E/k is a finite extension, then it follows from Wedderburn’s 
theorem on finite division rings (Theorem 8.23) that the norm N: E x —» k x is surjective. 

Corollary 10.133. If p is a prime, then there exists a noncommutative division algebra 
of characteristic p. 

Proof If A: is a field of characteristic p, it suffices to find a cyclic extension E/k for 
which the norm N: E x —> k x is not surjective; that is, we must find some z e k x which 
is not a norm. 

If p is an odd prime, let k = F p (x). Since p is odd, t 2 — x is a separable irreducible 
polynomial, and so E = k(^/x ) is a Galois extension of degree 2. If u e E, then there are 
polynomials a, b, c e Fp[x] with u — (a + b^/x)/c. Moreover, 

N(u) = (a 2 — b 2 x)/c 2 . 

We claim that x 2 + x is not a norm. Otherwise, 

a 2 — b 2 x — c 2 (x 2 + x). 

Since c ^ 0, the polynomial c 2 (x 2 + x ) f 0, and it has even degree. On the other hand, if 
b 0, then a 2 — b 2 x has odd degree, and this is a contradiction. If h — 0, then u = a/c; 
since a 2 = c 2 (x 2 + x), we have c 2 \ a 2 , hence c | a , and so u e Fp[x] is a polynomial. 
But it is easy to see that x 2 + x is not the square of a polynomial. We conclude that 
N: E x —>■ k x is not surjective. 

Here is an example in characteristic 2. Let k — IIGU'), and let E = k(a), where a is a 
root of fit) — t 2 +1 + x + 1 [fit) is irreducible and separable; its other root is a + 1]. As 
before, each u e E can be written in the form u — (a + ba)/c , where a,b,c e F 2 [x], Of 
course, we may assume that x is not a divisor of all three polynomials a, b and c. Moreover, 

N(u ) = ((a + ba)(a + ba + b))/c 2 — ( a 2 + ab + b 2 (x + 1 ))/c 2 . 

We claim that x is not a norm. Otherwise, 

a ab T b (x ~t~ 1) — c~x. (2) 

Now a( 0), the constant term of a, is either 0 or 1. Consider the four cases arising from the 
constant terms of a and Ir, that is, evaluate Eq. (2) at x = 0. We see that a(0) — 0 — b{ 0); 
that is x | a and x | b. Hence, x 2 | a 2 and x 2 | b 2 , so that Eq. (2) has the form x 2 d = c 2 x, 
where d e F 2 [x]. Dividing by x gives xd — c 2 , which forces c(0) = 0; that is, x | c, and 
this is a contradiction. • 

For further discussion of the Brauer group, see the article by Serre in Cassels-Frohlich, 
Algebraic Number Theory , Jacobson, Basic Algebra II, pages 471^4-81, Reiner, Maximal 
Orders , Chapters 5, 7, and 8, and the article by V. R Platonov and V. I. Yanchevskii, Finite- 
Dimensional Division Algebras, in Kostrikin-Shafarevich, Encyclopaedia of Mathematical 
Sciences, Algebra IX. In particular, a global field is a field which is either an arithmetic 
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number field [i.e., a finite extension of Q] or a function field [a finite extension of k(x), 
where k is a finite field]. To each global field, we assign a family of local fields. These 
fields are best defined in terms of discrete valuations. A discrete valuation on a field L is 
a function v: L x —> N such that, for all a. b e L. 

v(a) = 0 if and only if a = 0; 
v(ab ) = v(a)v(b)\ 
v(a + b ) = max{u(a), u(fr)}. 

Now R = {a e L : v(a ) < 1} is a domain and P — {a e L : via) < 1} is a maximal 
ideal in R. We call R/P the residue field of L with respect to the discrete valuation v. A 
local field is a field which is complete with respect to the metric arising from a discrete 
valuation on it, and whose residue field is finite. It turns out that every local field is either 
a finite extension of Q p , the p-adic numbers (which is the fraction field of the p-adic 
integers Z p ) or it is isomorphic to F g [[x]], the ring of formal power series in one variable 
over a finite field F ? . If & is a local field, then Br(k) = H 2 (k s , k x ), where k s /k is the 
maximal separable extension of k in the algebraic closure k. If A is a central simple K- 
algebra, where K is a global field, and if K v is a local field of K. then K, <g>^ A is a 
central simple K v -algebra. The Hasse-Brauer-Noether-Albert theorem states that if A 
is a central simple algebra over a global field K , then A = K if and only if K v <S)k 
A = K v for all associated local fields K v . We merely mention that these results where 
used by C. Chevalley to develop class field theory [the branch of algebraic number theory 
involving Galois extensions (of possibly infinite degree) having abelian Galois groups]. 
See Neukirch-Schmidt-Wingberg, Cohomology of Number Fields. 

For generalizations of the Brauer group [e.g., Br (k), where k is a commutative ring] and 
ties to Morita theory, see Orzech-Small, The Brauer Group of Commutative Rings, and 
Caenepeel, Brauer Groups, Hopf Algebras, and Galois Theory. 

Exercises 


10.65 Show that the structure constants in the crossed product ( E , G, /) are 

a, r _ [/(^ T > if a = err; 

I 0 otherwise. 

10.66 Prove that 1HI HI = Mat 4 (R). 


10.9 Introduction to Spectral Sequences 

The last topic we mention is spectral sequences, whose major uses are in computing ho¬ 
mology groups and in comparing homology groups of composites of functors. This brief 
section merely describes the setting for spectral sequences, in the hope that it will ease the 



894 


Homology Ch. 10 


reader’s first serious encounter with them. For a more complete account, we refer the reader 
to Mac Lane, Homology, Chapter XI, McCleary, User’s Guide to Spectral Sequences, or 
Rotman, An Introduction to Homological Algebra, Chapter 11. 

Call a series of submodules of a module K, 


K = K q d Ki 2 K 2 2 • • • 2 K e = {0}, 


a filtration (instead of a normal series), and call the quotients Kj / K j+ \ the factor modules 
of the filtration. We know that a module K may not be determined by the factor modules of 
a filtration; on the other hand, knowledge of the factor modules does give some information 
about K. For example, if all the factor modules are zero, then K — {0}; if all the factor 
modules are finite, then K is finite (and |X| is the product of the orders of the factor 
modules); or, if all the factor modules are finitely generated, then K is finitely generated. 

Definition. If K is a module, then a subquotient of K is a module isomorphic to S/T, 
where T C S C K are submodules. 

Thus, a subquotient of K is a quotient of a submodule. It is also easy to see that a 
subquotient of K is also a submodule of a quotient (S/T C K/T). 

Example 10.134. 

(i) All the factor modules of a filtration of a module K are subquotients of K. 

(ii) The nth homology group of a complex (C., d,) is a subquotient of C n . < 

A spectral sequence computes a homology group H n in the sense that it computes the 
factor modules of some filtration of H n . In general, this gives only partial information 
about H„, but, if the factor modules are heavily constrained, then they can give much more 
information and, indeed, might even determine H n completely. For example, suppose that 
only one of the factor modules of K is nonzero, say, Ki/Kj+i = A {0}; we claim that 
K = A. The beginning of the filtration is 


K = K 0 2 K x 2 ••• 2 K{. 


Since Kq/Ki — {0}, we have K — Kq — K\. Similarly, K\/Ki = {0} gives K\ — Ky, 
indeed, K — = K{ — ■ ■ ■ = K,. Similar reasoning computes the end of the filtration. 

For example, since K^\/K^ = {0}, we have K\ — K f = {0}. Thus, the filtration is 

K = K 0 = ■ ■ ■ = Kt D K i+] = ... = K l = {0}, 
and so K = K/{ 0} = Kj/K i+1 = A. 

In order to appreciate spectral sequences, we must recognize an obvious fact: very 
general statements can become useful if extra simplifying hypotheses can be imposed. 

Spectral sequences usually arise in the following context. A bigraded module M — M„ 
is a doubly indexed family of modules M pq , where p,q e Z; we picture a bigraded 
module as a collection of modules, one sitting on each lattice point (p, q) in the plane. 
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Thus, there are first quadrant bigraded modules, for example, with M p q = {0} if either 
p or q is negative; similarly, there are third quadrant bigraded modules. A bicomplex is 
a bigraded module that has vertical arrows d' p : M pq —> M pq -1 making the columns 
complexes, horizontal arrows d' pq : M pq -> M p -\ q making the rows complexes, and 
whose squares anticommute; 




u p-l, q 


dp.q 


M 


p.q 


d" 




p.q 

Mp.q— 1 


that is, d'd" + d"d' = 0. The reason for the anticommutativity is to allow us to define the 
total complex. Tot (M), of a bicomplex M: Its term in degree n is: 

Tot(Af)„ = Y M P-fi 

p+q=n 

its differentiation d„ : Tot(M)„ Tot(M)„_i is given by 

d" = 'Yj d p , q 9rd pq . 

p+q=n 

Anticommutativity forces d n _ i d„ — 0: 

dd = (d' + d")(d' + d") = d'd' + {d'd" + d"d') + d"d" = 0; 
thus. Tot (M) is a complex. 

All bigraded modules form a category. Given an ordered pair of integers ( a , b ), a family 
of maps f pq : M pq -> L p+a q+ f, is called a map f ; M.. -» L„ of bidegree {a, b). For 
example, the maps d' and d" above have respective bidegrees (0, —1) and (—1,0). It is 
easy to check that all bigraded modules and all maps having some bidegree form a category. 
One nice feature of composition is that bidegrees add: if / has bidegree {a, b ) and f has 
bidegree {a ', //), then their composite f'f has bidegree {a + a!, b + b'). Maps of bigraded 
modules are used in establishing certain exact sequences. For example, one proof of the 
five term exact sequence on page 882 uses these maps. 

A spectral sequence is a sequence of bicomplexes, E‘ p q , for all r > 2, where each E r p + q 
is a subquotient of E p (we must also specify that the homomorphisms of the bicomplex 
E r jfi q arise from those of E p q ). Most spectral sequences arise from a filtration of Tot(M), 
where M„ is a bicomplex. In particular, there are two “usual” filtrations (if M.. is either 
first quadrant or third quadrant), and the spectral sequences they determine are denoted by 
l E' p q and ll E' p q . 

We say that a spectral sequence E' p converges to a (singly graded) module H,, denoted 
by E 2 p q => H„, if each H n has a filtration with factor modules 

,n. bL[ p —i, . . . , E n Q, 
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and, for all p. q with p + q = n, the factor module E pq is a subquotient of E 2 p . There 
are two steps to establish before using spectral sequences. 

Theorem I. If M„ is a first quadrant or third quadrant bicomplex, then 

l E 2 p q =► ff„(Tot(M)) and lI E 2 p q =► ff„(Tot(M)). 

Thus, for each n, there are two filtrations of Tot(M)„; one whose factor modules are 
subquotients of 1 E 2 , and another whose factor modules are subquotients of 11 E 2 (as 
usual, p + q = n in this context), and both converge to the same thing. 

Theorem II. If M„ is a first quadrant or third quadrant bicomplex, then there are formulas 
for 1 E 2 q and 11 E 2 for every p,q. 

Theorem II offers the possibility that subquotients of E 2 can be computed. 

We illustrate the technique by sketching a proof that Tor,, (A, B) does not depend on the 
variable resolved; that is, the value of Tor,,(A, B), defined as If (Pa <8> B ), where Pa is 
a deleted projective resolution of A, coincides with Tor„(A, B), defined as H n (A <g> Qg), 
where Qg is a deleted projective resolution of B. The idea is to resolve both variables 
simultaneously, using resolutions of each. Define a first quadrant bigraded module M = 
Pa <S> Qg whose p, q term is P p (g) Q q ; make this into a bicomplex by defining vertical 
arrows d' p q — (—l) p l ® d q : P p <g> Q q —*■ P p <g> Q q -i and horizontal arrows d' p q = 
A p ® 1 '■ P p ® Q q P p -1 <8> Q q , where the d„ are the differentiations in Qg and the 
A„ are the differentiations in Pa (the signs force anticommutativity). The formula whose 
existence is stated in Theorem II for the first spectral sequence l E 2 gives, in this case, 

i e 2 = J {°} if q > 0; 

p ' q jff p (P A ®B) if? = 0. 

Since a subquotient of {0} must be {0}, all but one of the factor modules of a filtration of 
H n (Tot(M)) are zero, and so 

H„ (Tot(M)) = H n ( P A ® B). 

Similarly, the formula alluded to in Theorem II for the second spectral sequence gives 

n E 2 = | {0} if P > 0; 

p ’ q |fl,(A®Qg) if p = 0. 

Again, there is a filtration of //„(Tot(M)) with only one possible nonzero factor module, 
and so 

Hn (Tot(M)) = H n {A <8> Qg). 

Therefore, 

H n ( Pa ® B) = H n (Tot(M)) = H„{ P A ® B). 

We have shown that Tor is independent of the variable resolved. 

Here is a cohomology result illustrating how spectral sequences can be used to compute 
composite functors. The index raising convention extends here, so that one denotes the 
modules in a third quadrant bicomplex by M p ' q instead of by M- p - q . 
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Theorem 10.135 (Grothendieck). Let F : B —> C and G : A —»■ B be additive functors, 
where A, B, and C are module categories. If F is left exact and if E injective in A implies 
(R m F)(GE) = {0 } for all m > 0 ( where R m F are the right derived functors of F), then 
for every module A e A, there is a third quadrant spectral sequence 

Ej' q = (R p F)(R q G(A)) => R"(FG)(A). 

For a proof, see Rotman, An Introduction to Homological Algebra, page 350. 

The next result shows that if A is a normal subgroup of a group n, then the cohomology 
groups of N and of n /N can be used to compute the cohomology groups of n. 

Theorem 10.136 (Lyndon-Hochschild-Serre). Let Y1 be a group with normal sub¬ 
group N. For each Yl-module A, there is a third quadrant spectral sequence with 

E[' q = H'fTl/N, H q (N , A)) =$ H n (Tl, A). 

Proof. Define functors G : znMod -> z(n/W)Mod and F: z(n//v)Mod —> Ab by G = 
Hom,v(Z, ) and F = Homn/iv(Z, ). Of course, F is left exact, and it is easy to see 
that FG — Homn(Z, ). A proof that H'"(Yl/N, E ) = {0} whenever E is an injective 
Il-module and m > 0 can be found in Rotman, An Introduction to Homological Algebra, 
page 307. The result now follows from Theorem 10.135. • 

This theorem was found by Lyndon in his dissertation in 1948, in order to compute the 
cohomology groups of finitely generated abelian groups n. Several years later, Hochschild 
and Serre put the result into its present form. 
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11.1 Local and Global 

Quite often, it is easier to examine algebraic structures “one prime at a time.” Let G 
and H be finite groups. If G St //, then their Sylow p-subgroups are isomorphic for 
all primes p; studying G and H locally means studying their p-subgroups. This local 
information is not enough to determine whether G = H\ for example, S 3 and Ig are 
nonisomorphic groups having isomorphic Sylow subgroups. The global problem assumes 
that the Sylow p-subgroups of groups G and H are isomorphic, for all primes p, and 
asks what else is necessary to conclude that G = H. In the case of groups, this leads 
to the extension problem and cohomology of groups (but even this is inadequate to solve 
the global problem: for example, S 3 and E(, have isomorphic Sylow subgroups and the 
same composition factors). An illustration of the success of this technique is provided by 
finite abelian groups. The local problem involves primary components (Sylow subgroups), 
which are direct sums of cyclic groups, and the global problem is solved by the primary 
decomposition: Every finite abelian group is the direct sum of its primary components. In 
this case, the local information is sufficient to solve the global problem. The advantage 
of the local/global approach is that the local problem is simpler than the global and its 
solution is valuable. We begin this section with another group-theoretic illustration of 
local and global investigation, after which we will consider localization of commutative 
rings. 

Definition. Let R be a domain with Q — Frac(/?). If M is an R-module, define 

rank(M) = dimg(Q A/). 

For example, the rank of an abelian group G is defined as dimQ(Q G). 

Recall that if R is a domain, then an /E-module M is torsion-free if it has no nonzero 
elements of finite order; that is, if r e R and m e M are nonzero, then rm is nonzero. 


898 




Sec. 11.1 Local and Global 


899 


Lemma 11.1. Let R be a domain with Q = Frac( R ) and let M be a torsion-free R- 
module. Then M has rank 1 if and only if it is isomorphic to a nonzero R-submodule 

ofQ. 

Proof. If rank(M) = 1, then M f {0}. Exactness of 0 —* R —> Q gives exactness of 
Tor f(Q/R, M ) —» R ®r M -> Q <S>r M. 

By Lemma 10.101(iii). we have Tor ^{Q/R, M) = tM , the torsion submodule of M, and 
soTor f{Q/R,M) = {0} because M is torsion-free. But Proposition 8.86 gives R®rM = 
M. while Q ®r M = Q because M has rank 1. Therefore, M is isomorphic to an R- 
submodule of Q. 

Conversely, if M is isomorphic to an /?-submodule of Q, there is an exact sequence 
0 —> M —> Q. Since Q is a flat R -module, by Corollary 8.103, we have exactness of 
0-> Q <E)r M —> Q ®r Q. This is an exact sequence of vector spaces over Q, with 
Q <8 )r Q = Q being one-dimensional. Therefore, the nonzero subspace Q <S>r M is also 
one-dimensional; that is, rank(M) = 1. • 

Example 11.2. 

The following abelian groups are torsion-free of rank 1: 

(i) The group Z of integers; 

(ii) The additive group Q; 

(iii) the set of all rationals having a finite decimal expansion; 

(iv) the set of all rationals having squarefree denominator. ■* 


Proposition 11.3. Let R be a domain with Q — Frac (A 1 ). Two submodules A and B of 
Q are isomorphic if and only if there is c e Q with B — cA. 

Proof. If B = cA , then A = B via a ca. 

Conversely, suppose that /: A —> B is an isomorphism. We show first that if a e A is 
nonzero, then / is determined by its values on (a): If g: A —> B and g\{a) — f\(a). then 
f — g. If x e A, then there are r, ,v e R with sx = ra e (a) (because A is a submodule of 
Q ), and so 

f(sx) = f (ra) = rf (a ) = rg(a) = g(ra) = g(sx). 

Hence, s(f(x) — g (x )) = 0, and, since B is torsion-free, we have fix) — g(x). 

If f (a) — b , define c = b/a. In order to show that fix) — cx for all x e A, it now 
suffices to prove that f(ra) = c(ra) for all r e R. But 

f{ra) = rf(a) — rb — r(b/a)a = c(ra). 

It follows that fix) — cx for all x e A and that B — cA. • 
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Definition. For each prime p, we define a subring of Q, 


Z {p) = {a/beQ:(b,p) = l}. 


Proposition 11.4. 

(i) For each prime p, the ring Z( p) is a local 1 PID. 

(ii) If G is a torsion-free abelian group of rank 1, then Z (p ) G is a torsion-free 
Z ( p y module of rank 1. 

(iii) If M is a torsion-free Z( p )-module of rank 1, then M = Z ( p ) or M = Q. 

Proof (i) We show that the only nonzero ideals I in Z(p> are (/?"), for n > 0; it will then 
follow that Z( p _) is a PID and that ( p) is its unique maximal ideal. Since 1( P ) C Q, each 
nonzero x e p~ l Z has the form a/b for integers a and b, where (b, p) = 1. But a = p n a ', 
where n > 0 and (a r , p) = 1; that is, there is a unit u e Z (;) ), namely, u = a' /b, with 
x = up n . Let I f {0} be an ideal. Of all the nonzero elements in /, choose x — up n e /, 
where u is a unit, with n minimal. Then I = (. x ) = (p n ), for if ye/, then y = vp m , 
where v is a unit and n < m. Hence, p" \ y and y e (p n ). 

(ii) Since Z( p ) C Q, it is an additive torsion-free abelian group of rank 1, and so it is flat 
(Corollary 9.6). Hence, exactness of 0 —> G —> Q gives exactness of 

0 -> Z( P ) ®z G -> Z ip) ®z Q- 

By Exercise 11.5 on page 920, Z ^ <S>z Q = Q = Frac(Z(p)), so that Z( p ) <S>z G is a 
torsion-free Z (;)) -module of rank 1. 

(iii) There is no loss in generality in assuming that MCQ and that 1 e M. Consider the 

equations p"y„ — 1 for n > 0. We claim that if all these equations are solvable for y„ e M, 
then M = Q. If a/b e Q, then a/b = a/p n b\ where (b\ p) — 1, and so a/b = ( a/b')y n \ 
as a/b' e Zq,), we have a/b e M. We may now assume that there is a largest n > 0 
for which the equation p"y„ — 1 is solvable for y n e M. We claim that M = {y„), 
the cyclic submodule generated by y n , which will show that M = Z( p ). If m e M, then 
m — c/d — p r c'/p s d' = {c'/d'){\/p s - r ), where (c', p) — l — (d\ p). Since c'/d 1 is a 
unit in Z( p) , we have 1 /p s ~' e M, and so s — r < n; that is, s — r = n — £ for some I > 0. 
Hence, 1 /p s ~ r — 1 /p n ~ l = p l /p n — p l y n - and so m = ( c'p l /d')y n e {y n }. • 

Definition. A discrete valuation ring , abbreviated DVR, is a local PID that is not a field. 
For example, Z( P ) is a DVR. 

1 Recall that a local ring is a commutative ring having a unique maximal ideal. Most authors insist that local 
rings are noetherian [%(p) is even a PID]. Other authors allow local rings to be noncommutative, defining a ring 
R to be local if it has a unique maximal left ideal m. In this case, m = J(R), the Jacobson radical, so that it is a 
two-sided ideal. 
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Definition. Two torsion-free abelian groups of rank 1, G and H, are locally isomorphic 
if X( P ) <g>z G = Z ( P ) H for all primes p. 

We have solved the local problem for torsion-free abelian groups G of rank 1; associate 
to G the family Z ( P ) G of Z( p ) -modules, one for each prime p. 

Example 11.5. 

Let G be the subgroup of Q consisting of those rationals having squarefree denominator. 
Then G and Z are locally isomorphic, but they are not isomorphic, because G is not finitely 
generated. ■* 

We now consider the global problem for torsion-free abelian groups of rank 1. 

Definition. Let G be an abelian group. If x € G and p is a prime, we say that x is 
divisible by p" in G if there exists y„ e G with p"y„ — x. Define the p-height of x, 
denoted by h p (x), by 


I oo if x is divisible by p n in G for all n > 0 
k if x is divisible by p in G but not by p + . 

The height sequence (or characteristic ) of x in G, where x is nonzero, is the sequence 

X(x) = Xg(x ) = (h 2 (x), hj,(x), h 5 (x),..., h p (x), ...). 

Thus, x(x) is a sequence (h p ), where h p — oo or h p e N. Let G c Q and let x e G 
be nonzero. If/(x) = ( h p ) and a — p[ l ■ ■ ■ pj" , then ^x e G if and only if f Pi < h Pi for 
i = \,... ,n. 


Example 11.6. 

Each of the groups in Example 11.2 contains x = 1. 

(i) In Z, 

Xz (l) = (0,0, 0,...). 


(ii) In Q, 


Xq(1) = (00,00,00,...). 


(iii) If G is the group of all rationals having a finite decimal expansion, then 


Xg( 1) = (oo, 0, oo, 0, 0,...). 


(iv) If H is the group of rationals having squarefree denominators, then 


xh( 1) = (i.i.i,...). « 
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Different elements in a torsion-free abelian group of rank 1 may have different height 
sequences. For example, if G is the group of rationals having finite decimal expansions, 
then 1 and ^ lie in G, and 

X(l) = (oo, 0, oo,...) and x(^) = (o°, 2, oo, 1,0, 0,...). 

Thus, these height sequences agree for infinite p-heights, but they disagree for two finite 
p -heights. 

Definition. Two height sequences (h 2 ,I 13 ,..., h p ,.. .) and (k 2 , £ 3 ,..., k p ,.. .) are equiv¬ 
alent , denoted by 

(h 2 , h-i, - h p ,...)~ (k 2 , k 2 , - k p ,...), 

if there are only finitely many p for which h p ^ k p and, for such primes p, neither h p nor 
k p is 00 . 

It is routine to see that equivalence is, in fact, an equivalence relation. 

Lemma 11.7. If G is a torsion-free abelian group of rank 1, and if x, y e G are nonzero, 
then their height sequences x (at) and x (y) are equivalent. 

Proof. We may assume that G C Q. If b = p j 1 • • • p’ 1 , then it is easy to see that 
hp(bx) — hp(x) for all p {pi,..., p„}, while 

h Pi (bx) = ei + h Pi (x) 

for i = 1 ,... ,n (we agree that ei+oo — 00 ). Hence, x (x) ~ x (bx). Since x, y e G C Q, 
we have x/y = a/b for integers a,b, so that bx — ay. Therefore, x(x) ~ x(bx) = 

x(ay) ~ x(y)- • 

Definition. The equivalence class of a height sequence is called a type. If G is a torsion- 
free abelian group of rank 1, then its type, denoted by r (G), is the type of a height sequence 
X(x), where x is a nonzero element of G. 

Lemma 11.7 shows that r(G) depends only on G and not on the choice of nonzero 
element x e G. We now solve the global problem. 

Theorem 11.8. If G and H are torsion-free abelian groups of rank 1, then G = H if and 
only ifr(G) = r (H). 

Proof. Let (p: G -> H be an isomorphism. If x e G is nonzero, it is easy to see that 
X(x) = X (tp(x)), and so r(G) = r(H). 

For the converse, there is no loss in generality in assuming that both G and H are 
subgroups of Q. Choose nonzero x e G and y e H. By the definition of equivalence, 
there are primes p\,..., p n ,qi,..., q m with h pi (x) < h pi (y) < 00 , with 00 > h qj (x) > 

h qj (>’), and with h p (x) — h p (y) for all other primes p. Define b = Y\p i P '^ ^ Then 



Sec. 11.1 Local and Global 


903 


bx e G and h Pi (bx) = (h Pl (y) — h Pi (x)) + h Pi (x) — h Pi (y). A similar construction, 

hq ■ (x)—h q . (j) 

using a — q ■ 1 1 , gives y (bx) = y (ay). We have found elements x — bx e G 

and y' = ay e H having the same height sequence. 

Define tp: G -> Q by tp(g) = ^ T g . It is obvious that tp is an injective homomorphism. 
We claim that im tp C H. Since every g e G can be written as g — ^x', it suffices to show 
that if *x' e G, then tpO/x') = e H. But if c = p(' ■ ■ ■ pj ', then \ x' e G if and 
only if f p < h p (x'). Since y(x') = y(y'), \y’ e H if and only if f p < h p (y') = h p (x'). 
Thus, we may view q> as a map G —> H. Finally, to see that cp is a surjection, note that its 
inverse is given by h yh, which is a map H —*■ G. • 

The uniqueness theorem just proved is complemented by an existence theorem. 

Proposition 11.9. Given a height sequence (kj, ky ...., k p ....), where 0 < k p < oo, 
there exists a unique subgroup GcQ containing 1 with h p ( 1) = k p for all p. Thus, given 
any type r, there exists a torsion-free abelian group G of rank 1, unique to isomorphism, 
with x (G) = r. 

Proof. Define 


D = [a e Z : a — ]~[ pf with 0 < e, < k Pi for all i } 

(if k Pi — oo, then 0 < e,- < k pj means that e; e N), and define 
G = {m/a e Q : m e Z and a e D}. 

To see that G is a subgroup of Q, it suffices to prove that it is closed under addition. Let 
m/a and n/b be in G, where a — Y\p e ;',b = J”[ pf' , and max{e,-, f] < k p y, that is, 
[a, b] e D. Now 

m n ma' + nb' 
a b [a, b] 

where [a,b] — lcm{a,/7} = fl p™ m ^ e '’I‘\ a’ — [a,b]/a , and b’ = [a,b\/b. Since 
[a,b] e D, we have m/a + n/b e G. It is clear that h p ( 1) = k p for all p , and so 
r(G) = r. 

Let us now prove uniqueness. Let G and H be subgroups of Q containing 1 with 
X/y (1) = xg(!)• Suppose that m/d e H is in lowest terms—(m, d) = 1. Then there are 
integers s and t with 1 = sm + td, so that l/d — s (m/d) + td/d e H. On the other hand, 
1/d e G, by definition of height sequence. It follows that H C G, for H is generated by 
all elements of the form 1 /d. The reverse inclusion is proved similarly, and so G = H. • 

Corollary 11.10. 

(i) There are uncountable many nonisomorphic subgroups of Q. 

(ii) If R is a subring of Q, then the height sequence of 1 consists of 0 ’s and oo ’s. 
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(iii) There are uncountably many nonisomorphic subrings ofQ. In fact, distinct subrings 
of Q are not isomorphic as rings. 

Proof (i) Given any type r, Proposition 11.9 provides a torsion-free abelian group G of 
rank 1 with r(G) — r. But there are uncountably many types; for example, two height 
sequences of 0’s and oo’s are equivalent if and only if they are equal. 

(ii) If hp(l) > 0, then ^ e R. Since R is a ring, (Jj J = -^ e R for all n > 1, and so 
hpi 1) = oo. 

(iii) If R and 5 are distinct subrings of Q, then the height sequences of 1 are distinct, by 
part (ii). Both statements follow from the observation that two height sequences whose 
only terms are 0 and oo are equivalent if and only if they are equal. • 

A. G. Kurosh classified torsion-free abelian groups G of finite rank n with invariants 
n = rank(G), dim(F p <g> G) for all primes p, and an equivalence class of sequences (M p ), 
where M p is an n x n nonsingular matrix over the p-adic numbers Q p (this theorem is not 
easy to use, for it is almost impossible to determine whether two groups have equivalent 
matrix sequences). It is easy to see that every such group G is a direct sum of indecom¬ 
posable 2 groups; however, there is virtually no uniqueness for such a decompostion. For 
example, there exists a group G with 

G — A\ ® A 2 — B\ ® B 2 © B 3 , 

with all the summands indecomposable, with rank(Ai) = 1, rank(A 2 ) = 5, and with 
rank(Bj) = 2 for j = 1,2,3. Thus, the number of indecomposable summands in a 
decomposition is not uniquely determined by G, nor is the isomorphism class of any of 
the indecomposable summands. Here is an interesting theorem of A. L. S. Corner (that 
can be used to produce bad examples of torsion-free groups such as the group G just 
discussed). Let R be a ring whose additive group is countable, torsion-free, and reduced (it 
has no nonzero divisible subgroups). Then there exists an abelian group G, also countable, 
torsion-free, and reduced, with End(G) = R. Moreover, if the additive group of R has 
finite rank n, then G can be chosen to have rank 2 n. For a proof, see Fuchs, Infinite 
Abelian Groups II, page 231. 

The local approach to commutative rings generalizes the construction of the local rings 
Z( P ) from Z. Given a subset S of a commutative ring R closed under multiplication, most 
authors construct the localization S~ l R by generalizing the (tedious) construction of the 
fraction field of a domain R. They define a relation on R x S by (r, a) = (r', o') if there 
exists a" e S with a”(res' — r' 0 ) — 0 (this definition reduces to the usual definition involv¬ 
ing cross multiplication when R is a domain and S is the subset of all nonzero elements). 
After proving that this is an equivalence relation, S~ 1 R is defined to be the set of all equiva¬ 
lence classes, addition and multiplication are defined and proved to be well-defined, all the 
/^-algebra axioms are verified, and the elements of S are shown to be invertible. In other 

2 An abelian group G is indecomposable if there do not exist nonzero groups A and B with G = A © B. 
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words, we regard the elements of S~ l R as fractions with denominators in S. We prefer to 
develop the existence and first properties of S~ l R in another manner, which is less tedious 
and which will show how the equivalence relation generalizing cross multiplication arises. 

Definition. Let R be a commutative ring and let S be any subset of R. A localization 
of R is an /^-algebra S~ x R and an /^-algebra map li: R —*■ S~ l R, called the localization 
map , such that h(s) is invertible in S~ l R, for every ,v e S , and S~ l R is a solution to the 
following universal mapping problem. 



R' 


If R' is a commutative /^-algebra and ip: R —»• R' is an /^-algebra map for which <p{s) is 
invertible in R' for all s e S, then there exists a unique R -algebra map ip: S~ l R R' 
with <ph — <p. 

The localization 5“ 1 R, as any solution to a universal mapping problem, is unique up to 
isomorphism if it exists. 

Theorem 11.11. For every subset S of a commutative ring R, the localization S~ l R 
exists. 

Proof. Let X — {xj : s e S} be a set with x s i-> sa bijection X -> S. and let /?[X] be 
the polynomial ring over R with variables X. Define 

S~ l R = R[X]/I, 

where I is the ideal generated by \sx s — 1 : s e S}, and define h: R —> S~ l R by 
h: r i-^ r + /, where r is a constant polynomial. It is clear that S~ l R is an /^-algebra, 
that h is an R -algebra map, and that each h(s) is invertible. Assume now that R' is an 
/^-algebra, and that <p: R —> R' is an /^-algebra map with <p(s) invertible for all s e S. 
Consider the diagram in which the top arrow i: R -> R [X] sends each r e R to the 
constant polynomial r and v: R\X] —> S~ l R is the natural map. 



The top triangle commutes because both h and vi send r e R to r + I. Since /^l X] is 
the free commutative /^-algebra on X , there is an /^-algebra map (po: R\ X] —> R' with 
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(po(x s ) = <p(s)~ l for all s e S. Clearly, I C ker^o, for (po(sx s — 1) = 0, and so there is 
an /^-algebra map <p: S~ l R — R[X]/I —> R' making the diagram commute. That <p is 
the unique such map follows from S~ l R being generated by im h U {h(s)~ l : s e .S’} as an 
/^-algebra. • 

The next definition is natural in this context, for if s, s' are invertible elements in some 
commutative ring, then their product ss' is also invertible. 

Definition. A subset S of a commutative ring R is multiplicatively closed if 1 e S and 
s,s' e S implies ss' e S. Every commutative ring is a multiplicative monoid. If S is any 
(possibly empty) subset of R , then 

S — the submonoid of R generated by S. 

We call 5 the multiplicatively closed subset generated by S. 

Exercise 11.8 on page 920 says that (S)~ l R = S~ l R. 

Example 11.12. 

(i) If R is a commutative ring and s e R, then (,v} = {1, s, s 2 , s 2 , ...} is the multiplicatively 
closed set generated by the element s. 

(ii) If p is a prime ideal in R , then a £ p and h f p imply ab ^ p. In other words, the 
complement R — p is multiplicatively closed. 

(iii) Let P be the set of all primes in Z. If Sc. P, then 

S = [p\ l ■ ■ ■ pi" : Pi e S and a > 0}. ◄ 

We now describe the elements in S~ l R. 

Proposition 11.13. If S is a subset of a commutative ring R, then each y e W 1 R has a 
(not necessarily unique) factorization 

y — h(r)h(o )~ l , where r e R and a e S. 

Proof. The existence theorem constructs S~ [ R as R\X]/1, where X = {x s : ,y e .S - | and 
I — — 1 : s e S}. Thus, each v e S~ l R has the form y — f(x i,..., x„) + /, where 

Xi = x Sj for some s, e S. The proposition is proved by induction on n > 0. If n — 0, 
then / e R and y = h(f). For the inductive step, let y = f(x i,..., x n ) + I . Write 
(xi,, x n -i) — X, x„ — x. and 

f(X, x„) = g 0 (2f) + g\(X)x H-h g m (X)x m , 

where gi(X) e /?[X], In .S’~ 1 R. we have x = h(s)~ l for some s e S and, by induction, 
gi(X) = h(rj)h(cri)~ 1 , where r, e R and a/ e S. Therefore, 

y = h(r 0 )h(ao)~ l + h(n)h(o\)~ l h(s)~ l H-h h(r m )h(o m )~ x h(s)~ m 

= h(s)-'"(h(r Q )h(o Q r l h(s) m + h(ri)h(ai)~ l h(s) m ~ l H-1- h(r m )h(o m )~ l ) 

— h(r')h(cr)~ l i 
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where r' e R and o — oqo\ ■ ■ ■ o m s m e S. Therefore, y — h(r')h(o) 1 . • 

In light of Proposition 11.13, the elements of S~ l R can be regarded as “fractions” 
h(r)h(o)~ l , where r e R and o e S. 

Notation. Let h: R —> S~ 1 R be the localization map. If r e R and n e S, define 

r/o = h(r)h(p)~ l . 


In particular, r/1 = h(r). 

Is the localization map h: r i-> r/1 an injection? The easiest example in which h has a 
kernel occurs if 0 e S (after all, S is allowed to be any subset of R ). If 0 is invertible, then 
0 = 00“ 1 = 1, and so S~ l R is the zero ring. Thus, h: R -> S~ l R is the zero map, and 
hence it is not injective unless R is the zero ring. The next lemma investigates ker/z. 

Proposition 11.14. If S is a subset of a commutative ring R, and ifh: R -* S -1 R is the 
localization map, then 


ker/z = [r e R : err = 0 for some o e 5}. 

Proof. If or — 0, then 0 = h(o)h{r) in S~ l R. Since h(o) is a unit, we have 0 = 
h(o)~ l h(o)h(r) = h(r ), and so r e ker/z. 

Conversely, suppose that h(r) = 0 in .S’~ 1 R. Since S~ l R = R[X]/I , where I = 
(sx s — 1 : s € S), there is an equation r = fi(X)(siX Sj — 1) in /?[X], If Sq — 

{si,..., s„}U{nonzero coefficients of all fi(X)} and Iiq: R (So) -1 R is the localization 
map, then r e ker/zo- In fact, if .v = si - ■ - s n and It: R —» {.v 1 R is the localization map, 
then every h'(si) is invertible, for si -1 = • • • j} • • • s„. Now {s} -1 R — /?[x]/(sa' — 1), 

so that r e ker/z' says that there is f(x) = Y17=o a ’ x ‘ with 

m m 

r — f(x)(sx — 1) = ajX 1 )(sx — 1) = y~'(zza,x i+1 — a,x ! ) in /?[x], 
i=0 !=0 

Expanding and equating coefficients of like powers of x gives 

v — ao, sgq — , ..., saffi —j — ciffi ^ sciyyi — 0. 

Hence, sr = —sao = —a\, and, by induction, s l r = —ai for all i. In particular, s m r = 
—a m , and so s m+l r — —sa m — 0, as desired. • 

When are two ‘fractions” r/o and f jo’ equal? 

Corollary 11.15. Let S be a subset of a commutative ring R. Ifr/o, r’/o' e S 1 R, where 
er, o' € S, then r/o — r'/o' if and only if there exists o" e S with o"(ro' — r'o) — 0 
in R. 
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Remark. If S contains no zero divisors, then a"(rex'—r'a) — 0 if and only if ra’ — r'a = 
0, because a" is a unit, and so ro’ — r'a. ◄ 

Proof. If r/a = r'/a' , then multiplying by oo' gives (ra'—r'a)/ 1 = 0 in S~ l R. Hence, 
ra' — r'a e ker/z, and Proposition 11.14 gives a" e S with a"(ra' — r’a) — 0 in R. 

Conversely, if a"(ra' — r'a) = 0 in R for some a" e S , then h(a")h(ra’ — r'a) = 0 
in S _l R. As h(a") is a unit, we have h(r)h(a') — h(r')h(a); as h(a) and h(a') are units, 
h(r)h(a)~ l — h(r')h(a')~ l ; that is, r/a — r'/a'. • 

Corollary 11.16. Let S be a subset of a commutative ring R. 

(i) If S contains no zero divisors, then the localization map h : R —> S -1 R is an injec¬ 
tion. 

(ii) If R is a domain with Q = Frac( R), then S~ l R C Q. Moreover, if S = R — {0}, 
then S~ l R — Q. 

Proof, (i) This follows easily from Proposition 11.14. 

(ii) The localization map h: R —> S~ l R is an injection, by Proposition 11.14. Consider 
the diagram 



Q 


where tp is the inclusion. If lp(h(r)h(a)~ l ) — 0, then Ip(h(r )) = 0, because h(a) is a unit 
in S~ l R. But commutativity of the diagram gives cph(r) — <p(r). As cp is an injection, 
r — 0; hence, h(r)h(a)~ l — 0, and so <p is an injection. • 

As a consequence of Corollary 11.16(ii), when R is a domain and S is a multiplicatively 
closed subset not containing 0, then S~ i R consists of all elements a/s e Frac( R) with 
a e R and s e S. 

Let us now investigate the ideals in S~ l R. 

Definition. If S is a subset of a commutative ring R, and if I is an ideal in R, then we 
denote the ideal in S~ l R generated by h(I) by S~ l I. 


Example 11.17. 

(1) If S is a subset of a commutative ring R, and if I is an ideal in R containing an element 
a e S — that is, I (T S 0, then S~ l I contains a/a — 1, and so S~ { I — S~ l R. 

(ii) Let S consist of all the odd integers [that is, S is the complement of the prime ideal 

(2) ], let I — (3), and let I' — (5). Then S -1 / = S~ l Z — S~ l I'. Therefore, the function 
from the ideals in Z to the ideals in S~ l Z — Z( 2 ), given by / i-> S~ l I, is not injective. 
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In the next corollary, we will see an improvement when we restrict our attention to prime 
ideals contained in (2). < 

Corollary 11.18. Let S be a subset of a commutative ring R. 

(i) Every ideal J in S _l R is of the form S~ l I for some ideal I in R. In fact, if R is a 
domain and I = J D R, then J = S~ l I\ in the general case, if I = h~ l (h(R) fl J), 
then J — S~ l I. 

(ii) If I is an ideal in R, then S~ l I — S~ l R if and only if I fl S ^ 0. 

(iii) If q is a prime ideal in R with q fl S = 0, then S _1 q is a prime ideal in S~ 1 R. 

(iv) The function q i-> 5 _l c| is fl bijection from the family of all prime ideals in R that 
are disjoint from S to Spec (S~ i R). 

(v) If R is noetherian, then S~ l R is also noetherian. 

Proof, (i) Let J = (jx : X e A). By Proposition 11.14, we have jx — h(rx)h(ox)^ ] , 
where r\ e R and o, e S. Define I to be the ideal in R generated by \r-, : X e A}; that is, 
I — h~ 1 (h(R) fl J). It is clear that S~ l I — /; in fact, since all <j\ are units in S~ l R, we 
have J = (ji{rx) '■ k £ A). 

(ii) If a e I fl S, then a/1 e S~ l I. But a/1 is a unit in S~ l R, and so S~ l I — S~ l R. 
Conversely, if S~ 1 / = S~ l R, then h(a)h(o )~ 1 = 1 for some a e I and n e S. Therefore, 
a — a e ker/z, and so there is o" e S with o "(a — a) — 0. Therefore, a"a = o"a e I. 
Since S is multiplicatively closed, a"a e I fl S. 

(iii) Suppose that q is a prime ideal in R. First, S -1 q is a proper ideal, for q fl S = 0. 
If {a/o){b/r) = q/a), where a,b e R and o,r,co e S, then there is a" e S with 
a"(toab — arq) = 0. Hence, o"coab e q. Now a"a> f. q (because o"a> e S and 
S D q = 0); hence, ab e q (because q is prime). Thus, either a or b lies in q, and either 
a/ct or b /r lies in S~* q. Therefore, S -1 q is a prime ideal. 

(iv) Suppose that p and q are prime ideals in R with S -1 p = S -1 q; we may assume that 
pfiS = 0 = qnS. If a e p, then there is b e q and a e S with a/1 = b/o. 
Hence, a a — b e ker/z, where h is the localization map, and so there is o' e S with 
o' oa — o'b e q. But o'a e S, so that o' a f. q. Since q is prime, we have a e q; that is, 
pc q. The reverse inclusion is proved similarly. 

Let be a prime ideal in S~ l R. By part (i), there is some ideal I in R with = S~ l I. 
We must show that I can be chosen to be a prime ideal in R. Now hiR) fl 'ip is a prime 
ideal in h(R), and so p = h~ [ (h(R) fl (P) is a prime ideal in R. By part (i), *p = S -1 p. 

(v) If / is an ideal in .W 1 R. then part (i) shows that / = W 1 / for some ideal / in R. Since 
R is noetherian, we have I — ir\,..., r„), and so J = (r\/l,... ,r n /\). Hence, every 
ideal in S~ 1 R is finitely generated, and so S~ l R is noetherian. • 


Definition. If p is a prime ideal in a commutative ring R, then the complement S = R — p 
is multiplicatively closed, and S~ l R is denoted by R p . 
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Example 11.19. 

If p is a prime in Z, then p = ( p ) is a prime ideal, and Z p = Z( P ). ◄ 

Proposition 11.20. If R is a domain, then f~j m R m = R, where the intersection is over 
all the maximal ideals m in R. 

Proof. Since R is a domain, R m C Frac ( R ) for all m, and so the intersection in the 
statement is defined. Moreover, it is plain that R C R m for all m, so that R C p| m R m . 
For the reverse inclusion, let a e f~j R m . Define 

I = (R : a) = {r e R : ra e R}. 

If I — R, then 1 e /, and a = la e R, as desired. If I is a proper ideal, then there 
exists a maximal ideal m with / Cm. Now a/1 e R m , so there is r e R and of m with 
a/1 = r/er; that is, oa = r e R. Hence, a e I C m, contradicting o f m. Therefore, 

R — rim Rm- • 

The next proposition explains why S~ l R is called localization. 

Proposition 11.21. If p is a prime ideal in a commutative ring R, then Rp is a local ring 
with maximal ideal p/? p = {r/s : r e p and s f p}. 

Proof If x e Rp, then x = r/s, where r e R and s f p. If r f p, then r/s is a unit in 
Rp ; that is, all nonunits lie in pRp. Hence, if I is any ideal in R p that contains an element 
r/s with r f p, then I = Rp. It follows that every proper ideal in Rp is contained in p Rp, 
and so Rp is a local ring with unique maximal ideal p Rp. • 

The fundamental assumption underlying the local/global strategy is that the local case 
is simpler than the global. The structure of projective modules over a general ring can be 
quite complicated, but the next proposition shows that projective modules over local rings 
are free. 

Lemma 11.22. Let R be a local ring with maximal ideal m. An element r e R is a unit 
if and only if r f m. 

Proof. It is clear that if r is a unit, then r f m, for m is a proper ideal. Conversely, assume 
that r is not a unit. By Zorn’s lemma, there is a maximal ideal containing the principal ideal 
(r). Since R is local, there is only one maximal ideal, namely, m, and so r e m. • 

Proposition 11.23. If R is a local ring, then every finitely generated 3 projective R- 
module B is free. 

Proof. Let R be a local ring with maximal ideal m, and let {Z?i, be a minimal set 

of generators of B\ that is, B cannot be generated by fewer than n elements. Let F be the 

3 It is a theorem of Kaplansky that the finiteness hypothesis can be omitted: Every projective module over a 
local ring is free. He even proves freeness when R is a noncommutative local ring. 
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free R -module with basis x\,... ,x„, and define <p: F —>• B by ip(xi) — /?, for all i. Thus, 
there is an exact sequence 


0-> K 


B -* 0, 


( 1 ) 


where K — ker q>. 

We claim that K C mf. If, on the contrary, K C m F, there is an element y = 
Y^l= i r i x i e K which is not in mf; that is, some coefficient, say, r\ f. m. Now r\ 
is a unit, by Lemma 11.22. Now y e K — ker q> gives = 0. Hence, b\ — 

—r j _1 (Y^l = 2 r ibi), which implies that B = (£> 2 , ■.., b n ), contradicting the minimality of 
the original generating set. 

Returning to the exact sequence (1), projectivity of B gives F — K © B' , where B' is 
a submodule of F with B' = B. Hence, m F = m K © m B'. Since m/if C K C m F, 
Corollary 7.18 gives 

K = mK © (KHmBf. 

But K D m B' C K (T B' = {0}, so that K — mK. The submodule K is finitely generated, 
being a summand (and hence a homomorphic image) of the finitely generated module F, so 
that Nakayama’s lemma (Corollary 8.32) gives K = {0}. Therefore, (p is an isomorphism 
and B is free. • 

Having localized a commutative ring, we now localize its modules. If M is an //-module 
and .s' e R, let // A denote the multiplication map M -> M defined by m ho* sm. Note that 
if 5 is a subset of R. then /i v : M —> M is invertible for every ,v e S if and only if M is an 
S ~ 1 R-module. 

Definition. Let R he a commutative ring and let S be any subset of R. A localization of 
an //-module M is an S -1 //-module S~ l M (i.e.., ji s : S~ l m —»• S~ l M is invertible for all 
s e S) and an //-map h m : M -> S~ l M , called the localization map , which is a solution 
to the following universal mapping problem: 

M - - - S~ l M 

N *\ A * 

M' 


If (p: M —> M' is an //-map, where M' is an S 1 //-module, then there is a unique S 1 R- 
map ip: S~ l M —> M' making the diagram commute. 

The obvious candidate for S~ l M —namely, .S’~ 1 // M —is, in fact, its localization. 

Proposition 11.24. Let R be a commutative ring, let S be any subset of R, and let M 
be an R-module. Then S~ l R <Sir M and the R-map h: M —> S~ [ R M, given by 
m (->■ 1 ® m, is a localization of M. 
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Proof. Let ip: M —»■ M' be an R-map, where M' is an S _l R-module. The function 
S~ l R x M —> M', defined by (r/cr, m) i-> (r/a)(p(m), where r e R and o e 5, is 
easily seen to be R -bilinear. Hence, there is a unique R-map <p : 5 _l R ®r M —> M' with 
<ph — (p. Since h(M) generates S~ l R M, <p is the unique R-map making the diagram 
commute. We let the reader check that <p is an R-map. • 

One of the most important properties of S _l R is that it is flat as an R-module. To prove 
this, we first generalize the argument in Proposition 11.14. 

Proposition 11.25. If S is a subset of a commutative ring R, if M is an R-module, and if 
h m : M —> S~ l M is the localization map, then 

kerliM — {m e M : am = 0 for some a e S}. 

Proof Denote [m e M : am = 0 for some a e S} by K. If am — 0, for m e M 
and a e S, then — (l/cr)hM(am) = 0, and so K C ker/z*/- For the reverse 

inclusion, proceed as in Proposition 11.14: If m e K , there is a e S with am = 0. 
Reduce to the case S — {it} for some a e S, so that S~ { R — R[x]/(ax — 1). Now 
R[x]<S)rM = Rx‘ ® R M, because R[.r] is the free R-module with basis {1, x, x 2 ,...}. 
Hence, each element in R[x]®rM has a unique expression of the form JT x l ®nii, where 
nij e M. In particular, if m e kcr/zv/, then 

n n 

0 = 1 <g> m — (ax — 1) ® nij = ^^(ctx' + 1 ® m,- — x l <8> m,). 

1=0 1=0 

The proof now finishes as the proof of Proposition 11.14. Expanding and equating coeffi¬ 
cients gives equations 


1 ® m = — 1 ® iuq, x ® aniQ = x ® ni{, ..., 

x n ® am ,,-1 — x n ® m n , x " +1 ® am,, = 0 . 


It follows that 


m =—mu, erm 0 =: mi, ... am„-\—m„, am n =0. 

Hence, am — —aniQ — —mi, and, by induction, a‘m — —nij for all i. In particular, 
a n m = —m n and so a" +l m = —am,, = 0 in M. Therefore, ker/?,v/ c K, as desired. • 


Corollary 11.26. Let S be a subset of a commutative ring R and let M be an R-module. 

(i) Every element u e S~ l M has the form u — a~ 1 mfor some a e S and some m e M. 

(ii) if'mi = if *; 7 Z 2 in S~^M if and only ifa(sf l m\ — i7*?zz2) in M for some a e S. 
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Proof, (i) If u e S 1 M, then u = ^,(r,/cr,)w ; -, where r, e R, o, e S, and m, e M. If 
we define o — ]~[ oy and ay = n jjti °j^ then 

u — y^(l /(Ti)rjmj 

— (l/g) y ^CTinnij 

— (1 /a)m, 

where m — ^ ayrymy e M. 

(ii) If a e 5 with afomi — si»! 2 ) = 0 in M, then (a/l)(s 2 »M — simf) — 0 in S~ l M. As 
a/1 is a unit, S 2 »M — Sim 2 = 0, and so sj -1 ;;!] = sf l ni 2 - 

Conversely, if sf x m\ — sf l mi in S~ l M. then (1 /s\S 2 )(s 2 >n\ — s\ni 2 ) — 0. Since 
l/siS 2 is a unit, we have (s 2 m\ — s\ni 2 ) — 0 and S 2 >n\ — s\ni 2 e ker Iim- By Proposi¬ 
tion 11.25, there exists o e S with o{s 2 m\ — s\ni 2 ) = 0 in M. • 

Corollary 11.27. Let S be a subset of a commutative ring R. If A is an S 1 R-module, 
then A = S -1 A. 

Proof. Define <p: A —>• S _I A by a i-> 1 ® a. If <p(a) = 0, then there is a e S with 
a a = 0. Since a is a unit in S -1 R and A is an S -1 /?-module, the equation a — cr~ l oa — 
0 makes sense in A. Hence, <p is an injection. To see that q> is a surjection, note that 
(1/a) ® a — (p(o~ l a). • 

Theorem 11.28. If S is a subset of a commutative ring R, then S~ l R is aflat R-module. 
Proof. We must show that if 0 -> A —B is exact, then so is 

0 -> S~ 1 R®r A l -H S~ 1 R®r B. 

Let u e ker(l ® /); by Corollary 11.26, it = a -1 <g> a for some a e S and a e A. Now 
0 = (1 <g> f)(u) = a^ 1 <g> /(a), so that f(a) e ker Iim- By Proposition 11.25, there is 
r e S with 0 = r fa = f(ra). Thus, r a e ker / = {0}, because / is an injection. 
Therefore, 0 = 1 <g> ra — r(l ® a) = ru. Finally, u = 0, because r is a unit. Therefore, 
1 ® / is an injection, and so S~ l R is a flat /?-module. • 

Corollary 11.29. If S is a subset of a commutative ring R, then localization M i—>- 
S~ l M — S~ l R (S)r M defines an exact functor y?Mod —*■ 5 i^Mod. 

Proof. Localization is the functor S~ l R<8 ir , and it is exact because S~ l R is a flat R- 
module. • 

Notation. In the special case S = R — p, where p is a prime ideal in R. we write 

S~ l M = M p . 
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If /: M —>■ N is an /Nmap, write / p : M p -> Np. where / p = 1 ® /. 

We restate Corollary 11.18(iv) in this notation. The function q i-> q p is a bijection from 
the family of all prime ideals in R that are contained in p to Spec(/? p ) (see Exercise 6.67 
on page 398). 

Here are some globalization results. 

Proposition 11.30. Let I and J be ideals in a domain R. If I m — J m for every maximal 
ideal m, then I — J. 

Proof Take be J, and define 

(/ : b) = {r e R : rb e I). 

Let m be a maximal ideal in R. Since I m — J m . there are a e I and s f m with b/l — a/s. 
As R is a domain, sb = a e I, so that s e (/ : b); but s / m, so that (I : b) C m. Thus, 
(/ : b ) cannot be a proper ideal, for it is not contained in any maximal ideal. Therefore, 
(/ : b) = R ; hence, 1 e (/ : b) and b = lb e I. We have proved that / C /, and the 
reverse inclusion is proved similarly. • 

Proposition 11.31. Let R be a commutative ring. 

(i) If M is an R-module with M m — {0 } for every maximal ideal m, then M = {0}. 

(ii) If f : M -> N is an R-map and f m : M m -> N m is an injection for every maximal 
ideal m, then f is an injection. 

(iii) If f : M -> N is an R-map and f m : M m —>• N m is a surjection for every maximal 
ideal m, then f is a surjection. 

(iv) If f : M —> N is an R-map and f m : M m —» N m is an isomorphism for every 
maximal ideal m, then f is an isomorphism. 

Proof, (i) If M ^ {0}, then there is m e M with m f 0. It follows that the annihilator 
I = {r e R : rm — 0} is a proper ideal in R. for 1 j /, and so there is some maximal 
ideal m containing I. Now 1 ® m — 0 in M m , so that m e ker /; v/ • Proposition 11.25 gives 
s f. m with sm — 0 in M. for R — mis multiplicatively closed. Hence, self m, and 
this is a contradiction. Therefore, M — {0}. 

(ii) There is an exact sequence 0 K -+ M —s N. where K = ker /. Since localization 
is an exact functor, there is an exact sequence 

0 -* K m -> M m N m 

for every maximal ideal m. By hypothesis, each f m is an injection, so that K m — {0} for 
all maximal ideals m. Part (i) now shows that K — {0}, and so / is an injection. 

/ 

(iii) There is an exact sequence M —> N —> C —>■ 0, where C = coker / = N / im/. 
Since tensor product is right exact, C m = {0} for all maximal ideals m, and so C — {0}. 
But / is surjective if and only if C — coker / = {0}. 

(iv) This follows at once from parts (ii) and (iii). • 
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We cannot weaken the hypothesis of Proposition 11.31 (iv) to M m = N m for all maximal 
ideals m; we must assume that all the local isomorphisms arise from a given map /: M —* 
N. If G is the subgroup of Q consisting of all a/b with b squarefree, then we saw, in 
Example 11.5, that G( P ) = Z( p ) for all primes p, but G Z. 

Exercises 11.20 and 11.22 on page 921 show that localization preserves projectives and 
flats; that is, if A is a projective /^-module, then S ~ 1 ,4 is a projective (S ~ 1 R (-module, and 
if B is a flat R -module, then S~ l B is a flat (.S’ -1 A 1 (-module. Preserving injectivity is more 
subtle. 


Lemma 11.32. Let S be a subset of a commutative ring R, and let M and A be R-modules 
with A finitely presented. Then there is a natural isomorphism 

r A : S~ l Hom R (A.M) Hom r i R (S~ l A, S~ l M). 

Proof. It suffices to construct natural isomorphisms 

9 a : Hom s (A, S~ l M ) -> Hom r i R (S~ l A. S~ l M ) 

and 

q> A : S -1 Homs(A, M) -> Hom«(A, 
for then we can define r A = 9 A (Pa- 

Assume first that A = R" is a finitely generated free /^-module. If «i. .... a n is a basis 
of A, then a\/\, .. ., a n /\ is a basis of 5 _1 A = S~ l R <g) R R". The map 

9 R n: Horn*(A, S~ l M ) -» Hom s -i R (S~ l A, S~ l M), 

given by / i-^ /, where f(ai/o) = fiafl/a, is easily seen to be a well-defined R- 
isomorphism. 

If, now, A is a finitely presented /^-module, then there is an exact sequence 

A -> 0. (2) 

Applying the contravariant functors Horn/A ,M') and Hom 5 -i R ( . M'), where M' = 
S~ l M is first viewed as an R -module, gives a commutative diagram with exact rows 


0->■ Homs (A, M') 

0 A \ 


> Hom^jk", M') 

e R n I 


5 - Homsl/^, M') 


V Y V 

0 Hom s -i s (5 _1 A, M') Hom^i^ir 1 /?)", M') ^ Hom^i^CCS- 1 /?)', M'). 

Since the vertical maps 0 R n and 0 R t are isomorphisms, there is a dotted arrow 9 a which 
must be an isomorphism, by Proposition 8.94. If ft e Homs(A, M), then the reader may 
check that 


9a(P) — P- a/a f{a)/o, 
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from which it follows that the isomorphisms 9a are natural. 

Construct cpA'- S -1 Hom«(A, M) —» Hom^(A, S~ l M ) by defining cpA : g/o i-> g , T , 
where g„ (a) — g(a)/o. Note that <pa is well-defined, for it arises from the R -bilinear 
function S _1 /fxHom* (A, M) -> Hom^(A, S~ l M) given by (r/er, g) i-> rg a (remember 
that S -1 Hom^(A, M) — S~ l R (S)r Homs (A, Mj). Observe that ipA is an isomorphism 
when A is finitely generated free, and consider the commutative diagram 

0- S~ l Hom«(A, M) -> S~ l Horn R (R n , M) ->- S~ l Horn R (R r , M) 


(PA 

<PR " 




' Y 


0-> Hom^(A, S~ l M) -Horn r (R", S~ l M) --Horn r (R', S~ l M). 

The top row is exact, for it arises from Eq. (2) by first applying the left exact contravari- 
ant functor Hom/?( , M), and then applying the exact localization functor. The bottom 
row is exact, for it arises from Eq. (2) by applying the left exact contravariant functor 
Horn/A , S~ l M). The five lemma. Exercise 8.52 on page 604, shows that q),\ is an isomor¬ 
phism. • 

Example 11.33. 

Lemma 11.32 can be false if A is not finitely presented. For example, let R — Z and 
S~ 1 R — Q. We claim that 

Q Hom z (Q, Z) ^ HomQ(Q 0 Z Q, Q Z). 

The left-hand side is {0} because HomyAQ, Z) = {0}. On the other hand, the right-hand 
side is Hom^CQ, Q) = Q. ◄ 


Proposition 11.34. If S is a subset of a commutative noetherian ring R, and if E is an 
injective R-module, then S~ l E is an injective (S -1 R)-module. 

Remark. This result can fail if R is not noetherian. If A: is a field and R = k[X], where X 
is an uncountable set of variables, then there exists an injective R -module E and a subset 
S of R such that S~ l E is not an injective (.S’~ 1 R (-module (see E. C. Dade, “Localization 
of Injective Modules,” Journal of Algebra 69 (1981), 416—425). ◄ 

Proof. By the Baer criterion. Theorem 7.68, it suffices to prove that 

i*: Horn s - lR (S~ l R. S~ l E) -> Hom s _i s (/, S~ l E ) 

is surjective for every ideal J in S~ 1 /(, where i: J -> S~ l R is the inclusion. Now every 
ideal J in S~ { R has the form J = S~ l I, by Corollary 11.18, where I is an ideal in R. 
Since R is noetherian, every ideal is a finitely presented /^-module, and so Lemma 11.32 
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applies to give a commutative diagram whose vertical arrows are isomorphisms 


S~' Horn r (R, E) ->■ S~ l Hom s (7, E) 


"i 

Hom s -i s (5 -1 /?, S~ l E) 


i" 

Hom s -i^(5 -1 /, S~'E). 


Injectivity of E implies that Hornet R, E) —» Hom^(7, E) is surjective (where the arrow 
is induced from the inclusion 7 —> R), so that exactness of localization shows that the 
arrow in the top row is surjective. Since the vertical arrows are isomorphisms, the arrow in 
the bottom row is surjective. Therefore, J — S~ [ I is an injective (S~ 1 7?)-module. • 

Localization commutes with Tor, essentially because S~ 1 R is a flat 7?-module. 


Proposition 11.35. If S is a subset of a commutative ring R, then there are isomorphisms 
S _1 Tor,f(A, B) = Tor S~ l B) 
for all n > 0 and for all R-modules A and B. 

Proof. First consider the case n — 0. For fixed 7?-module A , there is a natural isomor¬ 
phism 

r B : S _1 (A ® R B) -> S _1 A ® s -i R S~ l B, 
for either is a solution U of the universal mapping problem 

S~ l A x S~ l B - U, 

/ f 

M 

where M is an (.S’ - 1 R (-module, / is (S~ 1 R (-bilinear, and / is an (S~ 1 R )-map. 

If Pb is a deleted projective resolution of 11, then exactness of localization, together with 
localization preserving projectives, show that S~ l (P B ) is a deleted projective resolution of 
S~ 1 II. Naturality of the isomorphisms ta gives an isomorphism of complexes 

S~ l (A ® R P B ) = S~ l A ® s -i R S~\ Pb), 

so that their homology groups are isomorphic. Since localization is an exact functor. Propo¬ 
sition 10.38 applies, and 

H n (S~\A ® R Pb)) = S~ l H„(A ® R P B ) = S” 1 Tor*(A, B). 

On the other hand, since S~ 1 (Pb ) is a deleted projective resolution of S~ 1 B, the definition 
of Tor gives 

H n (S~ l A ® s - lR S _1 (Pb)) = Torf ’ >R (S~ l A, S~ l B). • 
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Corollary 11.36. Let A be an R-module over a commutative ring R. If A m is a flat 
R m -module for every maximal ideal m, then A is aflat R-module. 

Proof The hypothesis, together with Proposition 10.96, give Tor,, m (,4 m , B m ) = {0} for 
all n > 1, for every A 1 -module B, and for every maximal ideal m. But Proposition 11.35 
gives Tor,f (A, B) m = {0} for all maximal ideals m and all n > 1. Finally, Proposi¬ 
tion 11.31 shows that Tor,f (A, B ) = {0} for all n > 1. Since this is true for all R -modules 
B , we have A flat. • 

We must add some hypotheses to get a similar result for Ext (see Exercise 11.23 on 
page 921). 

Lemma 11.37. If R is a left noetherian ring and A is a finitely generated left R-module, 
then there is a projective resolution P. of A in which each P n is finitely generated. 

Proof. Since A is finitely generated, there exists a finitely generated free left R -module 
P 0 and a suijective A 1 -map e: Pq -* A. Since R is left noetherian, kere is finitely gener¬ 
ated, and so there exists a finitely generated free left P-module Pi and a surjective P-map 
di: P\ — > kere. If we define l)\ : P\ —> Pq as the composite id\ , where i: kere —> Pq is 
the inclusion, then there is an exact sequence 

0 —> ker P»i -> Pi P 0 —% A -» 0. 

This construction can be iterated, for kerZ>i is finitely generated, and the proof can be 
completed by induction. (We remark that we have, in fact, constructed a free resolution 
of A.) • 

Proposition 11.38. Let S be a subset of a commutative noetherian ring R. If A is a 
finitely generated R-module, then there are isomorphisms 

S~ l Ext'jj(A, B ) = Ext n s _ lR (S~ l A, S~ l B) 

for all n > 0 and for all R-modules B. 

Proof. Since R is noetherian and A is finitely generated. Lemma 11.37 says there is a 
projective resolution P of A each of whose terms is finitely generated. By Lemma 11.32, 
there is a natural isomorphism 

r A : S~ l Hom^(A, B) -+ Hom s -i R (S~'A, S~ l B) 

for every R -module B (a finitely generated module over a noetherian ring must be finitely 
presented). Now t a gives an isomorphism of complexes 

S-'Olom^PA, B )) = Hom s -i s (S- 1 (P A ), S~ l B). 

Taking homology of the left hand side gives 

//„(£“‘(Horn* (Pa, B ))) = S~- 1 H n (Hom s (P A , B)) = S~ x Ext^(A, B ), 
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because localization is an exact functor (Proposition 10.38). On the other hand, homology 
of the right hand side is 

H n (Hom s - lR (S- l (P A ),S- l B)) = Exf s _ lR (S- l A,S- l B), 
because S - 1 (Pa) is an (S _1 /?)-projective resolution of S _1 A. • 

Remark. An alternative proof of the Proposition 11.38 can be given using a deleted 
injective resolution Eg in the second variable. We must still assume that A is finitely 
generated, in order to use Lemma 11.32, but now we use the fact, when R is noetherian, 
that localization preserves injectives. 4 

Corollary 11.39. Let A be a finitely generated R-module over a commutative noetherian 
ring R. Then A m is a projective R m -niodule for every maximal ideal m if and only if A is 
a projective R-module. 

Proof. Sufficiency is Exercise 11.20 on page 921, and necessity follows from Proposi¬ 
tion 11.38: For every R -module B and maximal ideal m, we have 

Extg(A, B ) m = Extg m (A m , B m ) — {0}, 

because A m is projective. By Proposition 11.31, ExtJjfA, B ) = {0}, which says that A is 
projective. • 

Exercises 


11.1 Prove that Z( p ) Q as Z( p )-modules. 

11.2 If R is a domain with Q = Frac (R), prove that every /Lsubalgebra A of Q is a localization 
of R. 

Hint. Define S = {b e R : 1/b e A). 

11.3 Prove that the following statements are equivalent for a torsion-free abelian group G of rank 1. 

(i) G is finitely generated. 

(ii) G is cyclic. 

(iii) If x S G is nonzero, then h p (x) — 0 for almost all p and h p (x) oo for all primes p. 

(iv) t(G) = t(Z). 


11.4 (i) If G is a torsion-free abelian group of rank 1, prove that the additive group of End(G) 

is torsion-free of rank 1. 

(ii) Let x e G be nonzero with x(-*) — feW. h$(x ),..., h p {x ), ...), and let R be the 
subring of Q in which x(l) = (kj, kj, ... , k p , ...) and 

loo ifh p (x) = oo 
jo if h p (x) is finite. 

Prove that End(G) = R. Prove that there are infinitely many G with Aut(G) = Ho. 
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11.5 Let G and H be torsion-free abelian groups of rank 1. 

(i) Prove that G ®i H is torsion-free of rank 1. 

(ii) If (/ip) is the height sequence of a nonzero element x e G, and if ( k p ) is the height 
sequence of a nonzero element y e H, prove that the height sequence of x ® y is ( m p ), 
where m p = h p + k p (we agree that oo + k p = oo). 

11.6 Let T be the set of all types, and define r < x', for x, x' e T, if there are height sequences 
( k p ) e r and (kp) e x' with k p < k' p for all primes p. 

(i) Prove that < is a partial order on T. 

(ii) Prove that if G and G' are torsion-free abelian groups of rank 1, then x(G) < x(G') if 
and only if G is isomorphic to a subgroup of G'. 

(iii) Prove that T is a lattice, and show that if r = x(G) and x' = x(G'), then r A x' = 
x(G n G') and r V x' = x(G + G'). 

(iv) If G and G' are torsion-free abelian groups of rank 1, prove that Hom(G, G') ^ {0} if 
and only if x(G) < x(G'). 

11.7 If G is a p-primary abelian group, prove that G is a Z( p ) -module. 

11.8 If S is a subset of a commutative ring R, and if 5 is the multiplicatively closed subset it 
generates, prove that (S)~ 1 R = S -1 R. 

11.9 If 5 = {sj,..., s n ] is a finite nonempty subset of a commutative ring R. prove that S -1 R = 

R, where s = s± - ■ ■ s n . 

Hint. If.? -1 exists, then so does s 1 (si • • ■'s) ■ ■ ■ s n ) = s- 1 . 

11.10 Prove that every localization of a PID is a PID. Conclude that if p is a prime ideal in a PID R, 
then Rp is a DVR. 

11.11 If R is a Boolean ring and m is a maximal ideal in R , prove that R m is a field. 

11.12 Let 5 be a subset of a commutative ring R , and let I and J be ideals in R. 

(i) Prove that 5“ 1 (77) = (S~ l I)(S~ l J). 

(ii) Prove that 5 _1 (7 : J) = (S -1 / : S~ l J). 

11.13 A domain R is a valuation ring if, lor all a. b e R, either a \ b or b \ a. 

(i) Prove that every DVR is a valuation ring. 

(ii) Let R be a domain with F = Frac(R). Prove that R is a valuation ring if and only if 
a e R or a -1 e R for each nonzero a e F. 

11.14 (i) Prove that every finitely generated ideal in a valuation ring is principal. 

(ii) Prove that every finitely generated ideal in a valuation ring is projective. 

11.15 An abelian group T is ordered if it is a partially ordered set in which a + b < a' + b' whenever 
a < a' and b < b’\ call V a totally ordered abelian group if the partial order is a chain. A 
valuation on a field k is a function v: k x -*■ T, where T is a totally ordered abelian group, 
such that 


v(ab ) — v(a ) + v(b)\ 
v(a + b) > min{u(fl), v(b)). 

(i) If a/b e Q is nonzero, write a — p"’a' and b — p n b', where m, n > 0 and (a' , p) = 
1 = (b 1 , p). Prove that v: Q x —> Z, defined by v(a/b) = m — n, is a valuation. 
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(ii) If v: k x —>■ T is a valuation on a field k, define R = {0} U (a e k x : v{a) > 0). 
Prove that R is a valuation ring. (Every valuation ring arises in this way from a suitable 
valuation on its fraction field. Moreover, the valuation ring is discrete when the totally 
ordered abelian group F is isomorphic to Z.) 

(iii) Prove that a e R is a unit if and only if v(a) — 0. 

(iv) Prove that every valuation ring is a (not necessarily noetherian) local ring. 

Hint. Show that m = [a e R : v(a ) > 0) is the unique maximal ideal in R. 

11.16 Let T be a totally ordered abelian group and let k be a field. Define &|T] to be the group 
algebra (consisting of all functions /: T —>■ k almost all of whose values are 0). As usual, if 
f(y) = r Y , we denote / by £ ye r r y y. 

(i) Define the degree of f — Ylyer r yY to be a if or is the largest index y with r y ^ 0. 
Prove that fc[F] is a valuation ring, where v{f) is the degree of /. 

(ii) Give an example of a non-noetherian valuation ring. 

11.17 A subset S of a commutative ring R is saturated if it is multiplicatively closed and ab e S 
implies a e S and b e S. 

(i) Prove that U(R), the set of all units in R. is a saturated subset of R. 

(ii) An element r e R is a zero divisor on an IGmodule A if there is some nonzero a e A 
with ra = 0. Prove that Z(A), the set of all zero divisors on an IGmodule A, is a 
saturated subset of R. 

(iii) If S is a multiplicatively closed subset of a commutative ring R, prove that there exists a 
unique smallest saturated subset S' containing S (we call S' the saturation of 5). Prove 
that (S')~ l R = S~ l R. 

(iv) Prove that a multiplicatively closed subset S is saturated if and only if its complement 
R — S is a union of prime ideals. 

11.18 Let S be a subset of a commutative ring R , and let M be a finitely generated .R-module. Prove 
that S~ l M = {0} if and only if there is a e S with crM = {0j. 

11.19 Let 5 be a subset of a commutative ring R , and let A be an /{-module. 

(i) If A is free, prove that S~ 1 A is a free (S _1 IG-module. 

(ii) If A is finitely generated, prove that S~ 1 A is a finitely generated (S -1 /{(-module. 

(iii) If A is finitely presented, prove that 5“ 1 A is a finitely presented (S~ 1 f?)-module. 

11.20 If A is projective, prove that 5 1 A is a projective (5 1 f?)-module. 

11.21 If p is a prime ideal in a commutative ring R and if A is a projective ^-module, prove that Ap 
is a free Rp -module. 

11.22 If B is a flat ^-module, where R is a commutative ring, prove that the localization S -1 B is a 
flat (S -1 ^(-module. 

Hint. The composite of exact functors is exact. 

11.23 (i) Give an example of an abelian group B for which Extg(Q, B ) ^ {0}. 

(ii) Prove that Q Ext^(Q, B) ^ {0} for the abelian group B in part (i). 

(iii) Prove that Proposition 11.38 may be false if R is noetherian but A is not finitely gener¬ 
ated. 

11.24 Let R be a commutative /.'-algebra, where k is a commutative ring, and let M be a ^-module. 
Prove, for all n > 0, that 

R® k /\\M) = /\\R® k M) 
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(of course, /\" (R <S>k Af) means the nth exterior power of the ^-module R ®£ M). Conclude, 
for all maximal ideals m in k, that 

Hint. Show that R /\” (M) is a solution to the universal mapping problem for alternating 
7i-multilinear ^-functions. 

11.25 Let R be a commutative noetherian ring. If A and B are finitely generated ^-modules, prove 
that Tor^(A, B ) and Ext^(A, B) are finitely generated R -modules for all n. 


11.2 Dedekind Rings 

A Pythagorean triple is a triple (a, b, c) of positive integers such that a 2 + b 2 — c 2 . Exam¬ 
ples of Pythagorean triples are (3, 4, 5), (5, 12, 13), and (7, 24, 25), and all Pythagorean 
triples are classified in Exercise 1.23 on page 13 (there is an elegant geometric proof of 
this by Diophantus, ca. 100 AD). P. Fermat proved that there do not exist positive integers 
(a, b, c ) with a 4 + b 4 = c 4 and, in 1637, he wrote in the margin of his copy of a book 
by Diophantus that he had a wonderful proof that there are no positive integers (a, b, c ) 
with a" + b n — c n for any n > 2. Fermat’s proof was never found, and his remark 
(that was merely a note to himself) became known only several years after Fermat’s death, 
when Fermat’s son published his father’s works. There were other such statements left by 
Fermat, many of them true, some of them false, and this statement, the only one unre¬ 
solved by 1800, was called Fermat’s last theorem, perhaps in jest. It remained one of the 
outstanding challenges in number theory until 1995, when A. Wiles proved Fermat’s last 
theorem. 

Every positive integer n > 2 is a multiple of 4 or of some odd prime p. Thus, if there do 
not exist positive integers (a, b, c ) with a p +b p = c p for every odd prime p, then Fermat’s 
last theorem is true [if n = pm, then a n + b n = c n implies ( a m ) p + (b"') p — (c m ) p ]. 
Over the centuries, there were many attempts to prove it. For example, L. Euler published 
a proof (with gaps, later corrected) for the case n — 3, G. P. L. Dirichlet published a proof 
(with gaps, later corrected) for the case n — 5, and G. Lame published a correct proof for 
the case n — 7. 

The first major progress (not dealing only with particular primes p) was due to E. Rum¬ 
mer, in the middle of the 19th century. If a p + b p = c p , where p is an odd prime, then a 
natural starting point of investigation is the identity 

c p — a p + b p = {a + b)(a + t;b)(a + t; 2 b ) • • • {a + t; p ~ l b ), 

where £ = t, p is a primitive /;th root of unity. Rummer proved that if Z[t; p ] is a UFD, 
where Z[g p ] = {f(t, p ) : f(pc) e Z[x]}, then there do not exist positive integers a, b, c 
with a p + b p = c p . On the other hand, he showed that there do exist primes p for which 
Z[£ p ] is not a UFD. To restore unique factorization, he invented “ideal numbers” that he 
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adjoined to Z[£ p ], Later, R. Dedekind recast Rummer’s ideal numbers into our present 
notion of ideal. Thus, Fermat’s last theorem has served as a catalyst in the development of 
both modern algebra and of algebraic number theory. Dedekind rings are the appropriate 
generalization of rings like Z[g p ], and we will study them in this section. 


Integrality 

The notion of algebraic integer is a special case of the notion of integral element. 

Definition. A ring extension R*/R is a commutative ring R* containing R as a subring. 
If R*/R is a ring extension, then an element a e R* is integral over R if it is a root of 
a monic polynomial in R [x ]. A ring extension R*/R is an integral extension if every 
a e R* is integral over R. 

Example 11.40. 

The Noether Normalization Theorem is often used to prove the Nullstellensatz. It states 
that if k is a field and A is a finitely generated k-algebra, then there exist algebraically inde¬ 
pendent elements a\,..., a n in A so that A is integral over k[a\,... ,a n ]. See Matsumura, 
Commutative Ring Theory, page 262. ◄ 

Recall that a complex number is an algebraic integer if it is a root of a monic polynomial 
in Z[x], so that algebraic integers are integral over Z. The reader should compare the next 
lemma with Proposition 7.24. 

Lemma 11.41. If R*/R is a ring extension, then the following conditions on a nonzero 
element u e R* are equivalent. 

(i) u is integral over R. 

(ii) There is a finitely generated R-submodule B of R* with uB C B. 

(iii) There is a finitely generated faithful R-submodule B of R* with uB C B\ that is, if 
dB — {0 } for some d € R, then d = 0. 

Proof, (i) =>■ (ii). If u is integral over R, there is a monic polynomial fix) e R[x] with 
f{u) — 0; that is, there are r; e R with u n — Xw=o r > u ' ■ Define B — (1 ,u, m 2 , ..., 

It is clear that uB C B. 

(ii) =>■ (iii). If B = (bi,..., b m ) is a finitely generated R-submodule of R* with uB C B, 
define B' — ( \ ,h\,..., b m ). Now If is finitely generated, faithful (because 1 e If), and 
uB' c B'. 

(iii) =>■ (i). Suppose there is a faithful R-submodule of R *, say, B — [bj,..., b „), with 
uB C B. There is a system of n equations ubj = ^" =1 Pijbj with ptj e R. If P — [pij] 

and if X — (b\ . b,,)' is an n x 1 column vector, then the n x n system can be rewritten 

in matrix notation: {ul — P)X = 0. Now 0 = (adj (uI — P))(ul — P)X = dX, where 
d = det (ul — P), by Corollary 9.161. Since dX — 0, we have dbj = 0 for all i, and so 
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dB = {0}. Therefore, d — 0, because B is faithful. On the other hand. Corollary 9.154 
gives d = f (m), where fix) e /?[*] is a monic polynomial of degree n; hence, u is 
integral over R. • 

Being an integral extension is transitive. 

Proposition 11.42. If 7 C S C R are commutative rings with S integral over T and R 
integral over S, then R is integral over T. 

Proof. If r e R, there is an equation r" + ,v„_ i r''~ 1 + ■ ■ ■ + ro = 0, where s,- e S for all 
i. By Lemma 11.41, the subring 5' = T[s n -\ , ..., so] is a finitely generated T -module. 
But r is integral over S' , so that the ring S'[r] is a finitely generated ^'-module. Therefore, 
5'[r] is a finitely generated T -module, and so r is integral over T. • 

Proposition 11.43. Let E/R be a ring extension. 

(i) If u, v € E are integral over R, then both uv and u + v are integral over R. 

(ii) The commutative ring Oe/r, defined by 

Oe/r — {u e E : u is integral over R}, 
is an R-subalgebra of E. 

Proof, (i) Since u and v are integral over R. Lemma 11.41(h) says there are /Nsubmodules 
B — (bi,... ,b n ) and C = {ci,..,, c m ) of E with uB c B and tC c C; that is, ubi e B 
for all i and vcj e C for all j. Define BC to be the R -submodule of E generated by all 
biCj\ of course, BC is finitely generated. Now uvBC C BC, for uvbicj = (ubfivcj) is 
an R -linear combination of b^ci s, and so uv is integral over R. Similarly, u + v is integral 
over R, for (u + v)btCj — (ubi)cj + (vcj)bj e BC. 

(ii) Part (i) shows that Oe/r is closed under multiplication and addition. Now R C Oe, 
for if r e R, then r is a root of x — r. It follows that 1 e Oe/r and that Oe/r is an 
/Nsubalgebra of E. • 

Here is a second proof of Proposition 11.43(i) for a domain E which uses tensor prod¬ 
ucts and linear algebra. Let fix) e /?[x] be the minimal polynomial of it, let A be the com¬ 
panion matrix of fix), and let y be an eigenvector [over the algebraic closure of Frac( E)]: 
Ay = uy. Let g(x) be the minimal polynomial of v, let B be the companion matrix of 
g(x), and let Bz = vz. Now 

(A <g> B)(y ® z) = Ay <g> Bz = uy <g> vz = uvfy ® z). 

Therefore, mu is an eigenvalue of A ® B\ that is, uv is a root of the monic polynomial 
det(v/ — A ® B), which lies in R[x] because both A and B have all their entries in R. 
Therefore, uv is integral over R. Similarly, the equation 

(A ® / + / (g> B)iy ® z) — Ay ® z + y <E> Bz = (u + v)y ® z 


shows that u + v is integral over R. 
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Definition. Let E/R be a ring extension. The R -subalgebra Oe/r of E, consisting of 
all those elements integral over R, is called the integral closure of R in E. If Oe/r = R, 
then R is called integrally closed in E. If R is a domain and R is integrally closed in 
F — Tract R ), that is, Oe/r — R, then R is called integrally closed. 

Thus, R is integrally closed if a e Traci R) and a is integral over R, then a e R. 

Example 11.44. 

The ring Oq/% = Z, for if a rational number a is a root of a monic polynomial in Z[x], 
then Theorem 3.43 shows that a e Z. Hence, Z is integrally closed. 4 


Proposition 11.45. Every UTD R is integrally closed. In particular, every PID is inte¬ 
grally closed. 

Proof. Let F = Traci W), and suppose that it e F is integral over R. Thus, there is an 
equation 

u n -\-r n —\u n + • • • + r\u + ro = 0, 

where r, e R. We may write u — b/c, where b, c e R and (b, c) — 1 (gcd’s exist because 
R is a UTD, and so every fraction can be put in lowest terms). Substituting and clearing 
denominators, 

b" + r n -ib n ~ l c + ■ ■ ■ + r\bc n ~ l + roc" = 0. 

Hence, b" — — c(r„-ib"~ l + ■ ■ ■ + r\bc n ~ 2 + roc" -1 ), so that c | b n in R. But ib , c) = 1 
implies (b", c) — 1, so that c must be a unit in R ; that is, c -1 e R. Therefore, u — b/c = 
bc~ l e R. and so R is integrally closed. • 

We now understand Example 6.21. If k is a field, the subring R of k[x], consisting of all 
polynomials f(x) e k[x] having no linear term, is not a UTD because it is not integrally 
closed. It is easy to check that Traci/s') = k(x), for x = x 3 /x 2 e Trac(/?). But x e k(x) is 
a root of the monic polynomial t 2 — x 2 € /?[/], and x f R. 

Definition. An algebraic number field is a finite field extension of Q. If E is an algebraic 
number field, then Oe /z is usually denoted by Oe instead of by Oe /z> an d it is called the 
ring of integers in E. 

Because of this new use of the word integers , algebraic number theorists often speak of 
the ring of rational integers when referring to Z. 

Proposition 11.46. Let E be an algebraic number field and let Oe be its ring of integers. 

(i) If a e E, there there is a nonzero integer m with mu e Oe- 

(ii) Trac(0£) = E. 

(iii) Oe is integrally closed. 
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Proof, (i) If a & E, then there is a monic polynomial f(x) e Q[jc] with f(a) = 0. 
Clearing denominators gives an integer m with 

ma" + c n -\a n ~ { + c n - 2 a n ~ 2 + ■ ■ ■ + c\a + co = 0, 

where all Cj e Z. Multiplying by m"~ l gives 

(, ma) n + c n -i(ma) n ~ l + mc n - 2 {ma) n ~ 2 + ■ ■ ■ + c\m n ~ 2 {ma) + m n ~ l co — 0. 

Thus, ma e Oe- 

(ii) It suffices to show that if a e E, then there are a, b e Oe with a = a/b. But 
ma e Oe, by part (i), me Z C Oe, and a = ( ma)/m. 

(iii) Suppose that a e FracfO^) = E is integral over O p - By transitivity of integral 
extensions. Proposition 11.42, we have a integral over Z. But this means that a e Oe 
which is, by definition, the set of all those elements in E that are integral over Z. Therefore, 
Oe is integrally closed. • 

Example 11.47. 

We shall see, in Proposition 11.76, that if E = Q(i), then Oe — Z[i], the Gaussian 
integers. Now Z[i\ is a PID, because it is a euclidean ring, and hence it is a UFD. The 
generalization of this example which replaces Q (i) by an algebraic number field E is more 
subtle. It is true that Oe is integrally closed, but it may not be not true that the elements 
of Oe are Z-linear combinations of a. Moreover, the rings Oe may not be UFDs. We will 
investigate rings of integers at the end of this section. 4 

Given a ring extension R*/R, what is the relation between ideals in R* and ideals in R‘! 

Definition. Let R*/R be a ring extension. If I is an ideal in R, define its extension I e 
to be R*I , the ideal in R* generated by I. If I* is an ideal in R*. define its contraction 
I* c = R n I*. 

Remark. The definition can be generalized. Let h : R -» R* be a ring homomorphism, 
where R and R* are any two commutative rings. Define the extension of an ideal I in R 
to be the ideal in R* generated by /;(/); define the contraction of an ideal I* in R* to be 
/z~' (/*). If R*/R is a ring extension, then taking h : R —> R* to be the inclusion gives the 
definition above. Another interesting instance is the localization map h: R —»■ S~ l R. ◄ 

Example 11.48. 

(i) In general, the contraction function c: Spec {R*) —> Spec (A 1 ) is neither an injec¬ 
tion nor a surjection. For example, c: Spec(Q) —> Spec(Z) is not surjective, while 
c: Spec(Q[v]) —>• Spec(Q) is not injective. 

(ii) It is easy to see that if R*/R is a ring extension and p* is a prime ideal in R*, then its 
contraction p* fl R is also a prime ideal. If a, b e R and ab e p* fl R C p*, then p* prime 
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gives ciep* orie p*; as a. b e R , either a e p* fl R or b e p* ft R. Thus, contraction 
defines a function c: Specf R*) —*■ Spec(/?). 

(iii) The contraction of a maximal ideal, though necessarily prime, need not be maximal. 
For example, if R* is a field, then {0}* is a maximal ideal in R*, but if R is not a field, then 
the contraction of {0}*, namely, {0}, is not a maximal ideal in R. < 


Example 11.49. 

(i) LetX(R) denote the family of all the ideals in a commutative ring R. Extension defines 
a function e \ T(R ) —>• X(R*); in general, it is neither injective nor surjective. If R* is a 
field and R is not a field, then e: X(R) -* X(R*) is not injective; if R is a field and R* is 
not a field, then e: X(R) —» X(R*) is not surjective. 

(ii) If R*/R is a ring extension and p is a prime ideal in R. then its extension R* p need not 

be a prime ideal. Observe first that if (a) = Ra is a principal ideal in R. then its extension 
is the principal ideal R*a in R* generated by a. Now let R — M[x] and R* = C[x]. The 
ideal ( x 2 + 1) is prime, because x 2 + 1 is irreducible in M[x], but its extension is not prime 
because x 2 + 1 factors in C[x], ◄ 

There are various elementary properties of extension and contraction, such as I* ce c I* 
and I ec O /, that are collected in Exercise 11.28 on page 930. 

Is there a reasonable condition on a ring extension R*/R that will give a good rela¬ 
tionship between prime ideals in R and prime ideals in R * ? This question was posed and 
answered by I. S. Cohen and A. Seidenberg. We say that a ring extension R*/R satis¬ 
fies lying over if, for every prime ideal p in R. there exists a prime ideal p* in R* with 
p* n R — p. We say that R*/R satisfies going lip if p C q are prime ideals in R and if p* 
lies over p, then there exists a prime ideal q* 3 p* which lies over q. 



Lying over Going Up 

We are going to see that extension and contraction are well-behaved in the presence of 
integral extensions. 

Lemma 11.50. Let R* be an integral extension of R. 

(i) Ifp is a prime ideal in R and if p* lies over p, then R*/ p* is integral over R/p. 

(ii) Is S is a subset of R, then S~ l R* is integral over S~ 1 R. 
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Proof, (i) First, the second isomorphism theorem allows us to regard R/p as a subring of 
R*/p*: 

R/p = R/(p* n R) = (R + p*)/p* c R*/p*. 

Each element in R*/p* has the form a + p*, where a e R*. Since R* is integral over R, 
there is an equation 

a n + r„_icn" _1 H-F ro = 0, 

where r/ e R. Now view this equation mod p* to see that a + p* is integral over R/p. 

(ii) If a* e S~ l R*. then a * = a/a, where a e R* and a e S. Since R* is integral over 
R, there is an equation a" + r„_ia" _1 + • • • + ro = 0 with r, e R. Multiplying by 1 /a" 
in S~ l R* gives 


(a/a) n + (r n -\/a)(a/a) n 1 H-F r 0 /a n — 0, 

which shows that a/a is integral over S~ l R. • 

When R*/R is a ring extension and R is a field, every proper ideal in R* contracts to {0} 
in R. The following proposition eliminates this collapse when R* is an integral extension 
of R. 

Proposition 11.51. Let R* /R be a ring extension of domains with R* integral over R. 
Then R* is afield if and only if R is afield. 

Proof. Assume that R* is a field. If u e R is nonzero, then m -1 e R*. and so u~ l is 
integral over R. Therefore, there is an equation (m -1 )" + + • • • + ro = 0, 

where the r, e R. Multiplying by u n gives m -1 = —u(r n -\ + • • • + row" -1 ). Therefore, 
m - 1 e R and R is a field. 

Conversely, assume that R is a field. If a e R* is nonzero, then there is a monic 
fix) e R[x) with /(a) = 0. Thus, a is algebraic over R, and so we may assume that 
f{x) — irr(a, R): that is, f(x) is irreducible. If f(x) — ^”_gr,x ! , then 

a(a n ~ [ + r„_ia" _1 H- F n) = -r 0 . 

Irreducibility of fix) gives rg f 0, so that a -1 lies in R*. Therefore, R* is a field. • 


Corollary 11.52. Let R*/R be an integral extension. Ifp is a prime ideal in R and p* is 
a prime ideal lying over p, then p is a maximal ideal if and only ifp* is a maximal ideal. 

Proof. By Lemma 11.50(i), the domain R*/p* is integral over the domain R/p. But now 
Proposition 11.51 says that R*/p* is a field if and only if R/p is a field; that is, p* is a 
maximal ideal in R* if and only if p is a maximal ideal in R. • 
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Corollary 11.53. If E is an algebraic number field, then every nonzero prime ideal in 
Oe is a maximal ideal. 

Proof. Let p be a nonzero prime ideal in Oe- If p fl Z f {0}, then there is a prime p with 
p fl Z = (p), by Example 11.48(i). But (p) is a maximal ideal in Z, so that p is a maximal 
ideal, by Corollary 11.52. It remains to show that p fl Z ^ {0}. Let a e p be nonzero. 
Since a is integral over Z, there is an equation 

a" + c„_ia" -1 H-b c\a + co = 0, 

where c,- e Z for all i. If we choose such an equation with n minimal, then co f 0. Since 
a e p, we have co = — ch(qi„_i +c„_ia" _2 + - • - + ci) e pflZ, so that pflZ is nonzero. • 

Corollary 11.54. Let R* be integral over R, let p be a prime ideal in R, and let p* and 
q* be prime ideals in R* lying over p. Ifp* C q*, then p* = q*. 

Proof. Lemma 11.50(h) and Corollary 11.18(iii) show that the hypotheses are preserved 
by localizing at p; that is, Rp is integral over Rp and p* R* C q* R* are prime ideals. 
Hence, replacing R* and R by their localizations, we may assume that R* and R are local 
rings and that p is a maximal ideal in R (by Proposition 11.21). But Corollary 11.52 says 
that maximality of p forces maximality of p*. Since p* C q*, we have p* = q*. • 

Here are the theorems of Cohen and Seidenberg. 

Theorem 11.55 (Lying Over). Let R*/R be a ring extension with R* integral over R. If 
p is a prime ideal in R, then there is a prime ideal p* in R* lying over p; that is, p*fi/? = p. 

Proof. There is a commutative diagram 

R =-^>- R*, 

h h* 

Y 

tfp-^ S~ l R* 

j 

where h and h* are localization maps and i and j are inclusions. If S = R — p, then .S’~ 1 IP' 
is an extension of R p (since localization is an exact functor, R contained in R* implies R p 
contained in S - */?*); by Lemma 11.50, S~ l R* is integral over R p . Choose a maximal 
ideal m* in S~ l R*. By Corollary 11.52, m* fl R p is a maximal ideal in R p . But Rp is 
a local ring with unique maximal ideal p Rp, so that m* fl R p — pR p . Since the inverse 
image of a prime ideal (under any ring map) is always prime, the ideal p* = (/!*) _1 (m*) 
is a prime ideal in R*. Now 

(/t*i) _1 (m*) = i~ l (h*)- l (m*) = ; -1 (p*) = p* n R, 

while 

(jh)~ l (m*) = h~ 1 j~ 1 (m*) = h~ l ( m* n R p ) = h~ l (pR p ) = p. 

Therefore, p* is a prime ideal lying over p. • 
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Theorem 11.56 (Going Up). Let R*/R be a ring extension with R* integral over R. If 
p C q are prime ideals in R, and ifp* is a prime ideal in R* lying over p, then there exists 
a prime ideal q* lying over q with p* C q*. 

Proof. Lemma 11.50 says that (R*/p*)/(R/p) is an integral ring extension, where R/p 
is imbedded in R*/p* as (R + p*)/p*. Replacing R* and R by these quotient rings, we 
may assume that both p* and p are {0}. The theorem now follows at once from the lying 
over theorem. • 

There is also a going down theorem, but it requires an additional hypothesis. 

Theorem (Going Down). Let R* /R be an integral extension and assume that R is in¬ 
tegrally closed. If pi 2 p2 2 • • • ^ Pn is a chain of prime ideals in R and if p* 3 p* D 
• • • 3 p* , far m < n is a chain of prime ideals in R* with each p* lying over p,-, then the 
chain in R* can be extended to p* 3 p* D • • • D p* with p* lying over p; for all i < n. 

Proof. See Atiyah-Macdonald, Introduction to Commutative Algebra, page 64. • 


Exercises 

11.26 If R is an integrally closed domain and 5 is a multiplicatively closed subset of R not containing 
0, prove that 5 “ 1 R is also integrally closed. 

11.27 Prove that every valuation ring is integrally closed. 

11.28 Let R*/R be a ring extension. If I is an ideal in R, denote its extension by I e ; if I* is an ideal 
in R*. denote its contraction by I* c . Prove each of the follow assertions. 

(i) Both e and c preserve inclusion: If / C J, then I e C J e \ if /* C J *, then I* c C J* c . 

(ii) I* ce C I* and I ec 2 /. 

(iii) I* cec = I* c and I ece = I e . 

(iv) (I* + J*) c D I* c + J* c and (/ + J f = I e + J e . 

(v) (I* n J*) c = i* c n j* c and (/ n J) e C I e n j e . 

(vi) (I*J*) C D i*cj*c and (/ jy = jeje 

(vii) (s/I*) c = x/7*^ and (SJ) e C VF. 

(viii) ( J* : I*) c C (J* c : I* c ) and (I : J) e C ( I e : J e ). 

11.29 If A is the field of all algebraic numbers, then is the ring of all algebraic integers. Prove 
that 

c>a n q = z. 

Conclude, for every algebraic number field E, that Oe n Q = Z. 

11.30 Let R* /R be an integral ring extension. 

(i) If ci e R is a unit in R*, prove that a is a unit in R. 

(ii) Prove that J(R) = R fl J(R*), where J(R) is the Jacobson radical. 

11.31 Let R* / R be an integral extension. If pj C p 2 C • • • C p„ is a chain of prime ideals in R and 
if P* Q P2 — ’ ’ ’ — Pm ’ f° r m < n is a chain of prime ideals in R* with each p* lying over 
pi, then the chain in R* can be extended to p* C p* C • • • C p* with p* lying over p,- for all 
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i < n. (The going up theorem is so called because the chain of ideals in R* is ascending, in 
contrast to the going down theorem in which the chain of ideals in R* is descending.) 

11.32 Let R*/R be an integral extension. If every nonzero prime ideal in R is a maximal ideal, prove 
that every nonzero prime ideal in R* is also a maximal ideal. 

Hint. See the proof of Corollary 11.53. 

11.33 Let a be algebraic over Q, let E/Q be a splitting field, and let G = Gal(£'/Q) be its Galois 
group. 

(i) Prove that if a is integral over Z, then, for all a € G, a (a) is also integral over Z. 

(ii) Prove that a is an algebraic integer if and only if irr(a, Q) e Z[x], Compare this proof 
with that of Corollary 6.29. 

(iii) Let E be an algebraic number field and let R C E be integrally closed. If a e E. prove 
that irr(a, Frac(/?)) e /?[*]. 

Hint. If E is a Galois extension of Frac(W) containing a , then G = Gal(£/FracG?)) 
acts transitively on the roots of a. 


Nullstellensatz Redux 

In this subsection, we will prove the Nullstellensatz for arbitrary algebraically closed fields 
(recall that our proof in Chapter 6 assumed that k is uncountable). There are different 
proofs of this result, and we present the proof discovered by O. Goldman, as expounded in 
Kaplansky, Commutative Rings. 

Definition. A ring extension A/R is finitely generated if there is a surjective /^-algebra 
map <p: R[x i,..., x„] -* A. If (p (x,-) = a,, then we write 

A — R[a i, ..., a n ]. 

If I is an ideal in a commutative ring R , then the nilpotent elements in R/I arise from 
elements of \fl■ We now begin working toward a theorem of W. Krull that characterizes the 
nilpotent elements in a commutative ring, for this will give us information about radicals 
of ideals. 

Lemma 11.57. Let R be a domain with F — Frac( R). Then F/R is a finitely generated 
ring extension if and only if F/R is a simply generated ring extension ; that is, there is 
u e R with F = (and so the localization {w} — 1 is afield). 

Proof. Sufficiency being obvious, we prove only necessity. If F — R[a\/b\, ..., a„/b„\, 
define u — P|, ■ We claim that F = Clearly, F 2 ) R[u~ ] ], For the reverse 

inclusion, note that a,-/ b ,• = apfi/u e /?[m - 1 ], where m) = b\ ■ ■ ■ b\ ■ ■ ■ b n . • 

Definition. If R is a domain with F — FraciA 1 ). then R is a G-domain if F/R is a finitely 
generated ring extension. An ideal I in a commutative ring R is a G-ideal 4 if R/I is a 
G-domain. 

4 G-ideals are named after O. Goldman. 
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Every field is a G-domain, and so every maximal ideal in a commutative ring is a G- 
ideal. If I is a G-ideal, then R/I is a G-domain, hence a domain; therefore, every G-ideal 
is a prime ideal. Corollary 11.61 says that Z is not a G-domain; it follows that the prime 
ideal (x) in Z[x\ is not a G-ideal. 

Proposition 11.58. Let E/R be a ring extension in which both E and R are domains. If 
E is a finitely generated R-algebra and each a e E is algebraic over R (that is, a is a root 
of a nonzero polynomial in /?[x]), then R is a G-domain if and only if E is a G-domain. 

Proof. Let R be a G-domain, so that F — Frac( R ) = R\u~ ] | for some nonzero u e R. 
Therefore, £[m _ 1 ] C Frac( E), for u e R C E. But E\u~ l ] is a domain algebraic over the 
field F — R[u~ l ], so that E[u~ l ] is a field, by Exercise 11.35 on page 938. Since FracfF) 
is the smallest field containing E , we have = Frac(E), and so E is a G-domain. 

If E is a G-domain, then there is v e E with Frac( E ) = L[i; _l ]. By hypothesis, 
E — R\b \,. b n ], where b, is algebraic over F = Tract A 1 ) for all i. As v e E, we 
have v algebraic over R. and so v ~ 1 is algebraic over R. Thus, there are monic polyno¬ 
mials fo(x), fi(x) e F[x] with fo(v~ l ) — 0 and /, (bj ) = 0 for all i > 1. Clearing 
denominators, we obtain equations fi fU),) — 0, for i > 0, with coefficients in R: 

A)(f“V 0 + • • • = o 

Pibf + • • • = 0 . 

Define R* — R[/3q 1 , fif 1 ,..., /6” 1 ]. Each bi is integral over R*, for each fij is a unit 
in R*. Clearly, = R*[v _1 , bi, ...,b n ]. Thus, the field is integral over 

R*. by Proposition 11.43 (since FIT -1 ] = R*[v~ l , b\,..., b n ] and each of the displayed 
generators is integral over R*), and this forces R* to be a field, by Proposition 11.51. But 
R* — R[J1q 1 , fif 1 , ..., fif 1 ] c F, because fii e R for all i, so that R* = F. Therefore, 
F — R[Pq\ fif 1 ,..., is a finitely generated ring extension of R\ that is, R is a 
G-domain. • 

The next lemma leads to Corollary 11.60, an “internal” characterization of G-domains, 
phrased solely in terms of R, with no mention of Frac( R). 

Lemma 11.59. Let R be a domain with F = Frac( R). The following conditions are 
equivalent for a nonzero element u e R. 

(i) u lies in every nonzero prime ideal of R. 

(ii) for every nonzero ideal I in R, there is an integer n with u n e /. 

(iii) R is a G-domain', that is, F = /?[w -1 ]. 

Proof, (i) =>■ (ii). Suppose there is a nonzero ideal / for which u n I for all n > 0. If 
S — {u n : n > 0}, then I fl S = 0. By Zorn’s lemma, there is an ideal p maximal with 
I C p and p fl S = 0, and p is a prime ideal, by Exercise 6.9. This contradicts u lying in 
every prime ideal. 
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(ii) =>■ (iii). If b e R and b f 0, then n" e (b) for some n > 1, by hypothesis. Hence, 

n" = rb for some r e R, and so b~ l — ru~ n e Therefore ,F — R[m -1 ]. 

(iii) =>• (i). Let p be a nonzero prime ideal. If hep is nonzero, then b~ l = Yl'i=o r i u ~ l ■> 

where r; e R , because F — R[m - 1 ]. Therefore, u" = rju n ~ 1 ') lies in p, because 

be p and JT riu"~ l e R. Since p is a prime ideal, u e p. • 

Corollary 11.60. A domain R is a G-domain if and only if p) p pr i me p f {0}. 

PAO 

Proof. By Lemma 11.59, R is a G-domain if and only if it has a nonzero element u lying 
in every nonzero prime ideal. • 

Corollary 11.61. If R is a PID, then R is a G-domain if and only if R has only finitely 
many prime ideals. 

Proof. If R is a G-domain, then I — f]p f {0}, where p ranges over all nonzero prime 
ideals. If R has infinitely many prime ideals, then there are infinitely many nonassociate 
prime elements pi, p 2 , ...; that is, the (p,) are distinct prime ideals. If a e /, then p, \ a 
for all i. But a — q\ l ■ ■ ■ q'f 1 , where the qj are distinct prime elements, contradicting 
unique factorization in the PID R. 

Conversely, if R has only finitely many nonzero prime ideals, say, (pi),..., ( p m ), 
then the product pi • • • p m is a nonzero element lying in p], * Pi)- Therefore, R is a G- 
domain. • 

It follows, for example, that every DVR is a G-domain. 

Definition. If R is a commutative ring, then its nilradical is 

nil(R) = {r e R : r is nilpotent}. 

We note that nil(R) is an ideal. If r. s e R are nilpotent, then r n — 0 = s m , for positive 
integers m and n. Hence, 

(r + s ) m+n ~ x = ‘ + H ~ 

i=0 ^ 1 ' 

If i > n, then r' — 0 and the ;th term in the sum is 0; if i < n, then m + n — i — 1 > m, 
s m +n n = o, and the ith term in the sum is Ointhis case as well. Thus, (r+s) m+ " _1 = 0 
and r + s is nilpotent. Finally, rs is nilpotent, for ( rs) mn = r mn s ms = 0. 

The next theorem is an improvement on W. Krull’s original version, that characterizes 
the nilradical as the intersection of all the prime ideals. 

Theorem 11.62 (Krull). If R is a commutative ring, then 

mi (R)= n p= n p- 

p=prime ideal p=G-ideal 



934 


Commutative Rings III Ch. 11 


Remark. If R is a domain, then {0} is a prime ideal, and so nil (/?) = {0} (alternatively, 
there are no nonzero nilpotent elements in a domain). The intersection of all the nonzero 
prime ideals in a commutative ring R may be larger than ni 1 (/? ); this happens, for example, 
when R is a DVR. ◄ 

Proof It is obvious that n\\(R) C fVprime ideal P ^ n p =G-ideaiP : nilpotent elements 
lie in every prime ideal; every G-ideal is a prime ideal. Thus, it suffices to prove that 
f~jp_ g- ideal P — n il(^)- Suppose that u </ fj p= G-ideal P- It follows that u" 0 for all 
n > 1, for if u n = 0, then u n e p for every G-ideal p, and so u e p, because G-ideals are 
prime. Therefore, the multiplicatively closed set S — {u" : n > 1} does not contain 0. By 
Zorn’s lemma, there is an ideal q maximal with q fl S = 0 (that there are ideals disjoint 
from S requires 0 ^ S). We claim that q is a G-ideal, which will give u n p =G-ideai P- 
Now q is a prime ideal, by Exercise 6.9, so that R/q is a domain. Suppose there is a nonzero 
prime ideal p* in R/ q not containing u + q. There is an ideal p D q in R with p* = p/q 
(for p* f {0}), contradicting the maximality of q. Therefore, u + q lies in every nonzero 
prime ideal in R/ q. By Corollary 11.60, R/q is a G-domain, and so q is a G-ideal. • 

The next corollary follows easily from KrulTs theorem. 

Corollary 11.63. If I is an ideal in a commutative ring R, then \fl is the intersection of 
all the G-ideals containing I. 

Proof. By definition, \J1 = {r e R : r n e I for some n >1}. Therefore, \fl/I — 
nil (R/I) = Plp'—G-idcai P- For each p*, there is an ideal p containing I with p* = p//,and 
\[J — n p //=G-ideai P- Finally, every p involved in the intersection is a G-ideal, because 
R/ p = (R/I)/(p/I) = (R/I)/ p* and (R/I)/ p* is a G-domain. • 

We now focus on the relation between ideals in R[x] and those in R. 

Proposition 11.64. An ideal I in a commutative ring R is a G-ideal if and only if I is the 
contraction of a maximal ideal in R[x]. 

Proof If I is a G-ideal in R , then R/I is a G-domain. Hence, there is u e Frac( R/I) 
with Frac(R/7) = (R/I)[u~ 1 ]. Let cp: (R//)[x] -> (R/I)[u~ x ] be the R-algebra map 
taking x m _ 1 . Since q> is a surjection onto the field (R/I)[u~ l ] = Frac(R/7), its kernel 
m is a maximal ideal in (R/I)[x]. Since <p\(R/I) is an injection, we have mfl (R/I) = {0}. 
By Exercise 6.2, there is an ideal, necessarily maximal, m' in R[x] with m '/I = m, and 
m'n R = I. 

Conversely, assume that m is a maximal ideal in R[x] with m fl R = I. If v : 7?[v] -> 
R[x]/m be the natural map and u = v(x), then im v = (R/I)[u] is a field. Hence, R/I is 
a G-domain, by Proposition 11.58, and so I is a G-ideal. • 

Notation. If I is an ideal in a commutative ring R and if fix) e R[x], then / (x ) denotes 
the polynomial in (7?//)[v] obtained from fix) by reducing its coefficients mod/; that is, 
if f(x) = a i x ‘ - where a,- e R , then f(x) = J2i( a i + I)x' ■ 
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Corollary 11.65. Let R be a commutative ring, and let m be a maximal ideal in /?[*]. 
If the contraction m / = m fl R is a maximal ideal in R, then m = (tn / , fix)), where 
f(x) e J?[x] and fix) e (/t’/m , )[x] is irreducible. If R/m' is algebraically closed, then 
m = (m', x — a) for some a e R. 

Proof. First, Proposition 11.64 says that m' = m fl R is a G-ideal in R. Consider the 
map (p : J?[x] —> /m')[xr] which reduces coefficients mod m'. Since q> is a surjection, 

the ideal ip(m) is a maximal ideal; that is, ^>(m) = (g(x)), where g(x ) e (.R/m'jtx] is 
irreducible. Therefore, m = (m', fix)), where tpif) = g: that is, fix) — g(x). • 

Maximal ideals are always G-ideals, and G-ideals are always prime ideals. The next 
definition imposes the condition that G-ideals be maximal. In light of Proposition 11.64, 
this will force the contraction of maximal ideals in /?[x] to be maximal ideals in R. 

Definition. A commutative ring R is a Jacobson 5 ring if every G-ideal is a maximal 
ideal. 

Example 11.66. 

(i) Every field is a Jacobson ring. 

(ii) By Corollary 11.61, a PID R is a G-domain if and only if it has only finitely many 
prime ideals. Such a ring cannot be a Jacobson ring, for {0} is a G-ideal which is not 
maximal [J?/{0} = R is a G-domain]. On the other hand, if R has infinitely many prime 
ideals, then R is not a G-domain and {0} is not a G-ideal. The G-ideals, which are now 
nonzero prime ideals, must be maximal. Therefore, a PID is a Jacobson ring if and only if 
it has infinitely many prime ideals. 

(iii) We note that if R is a Jacobson ring, then so is any quotient R* — R/I. If p* is a 
G-ideal in R*, then R*/p* is a G-domain. Now p* = p/7 for some ideal p in R, and 
R/p = iR/I)/ip/I) — R*/p*. Thus, p is a G-ideal in R. Since R is a Jacobson ring, p is 
a maximal ideal, and R/p = /C/p* is a field. Therefore, p* is a maximal ideal, and so R* 
is also a Jacobson ring. 

(iv) By Corollary 11.63, every radical ideal in a commutative ring R is the intersection of 
all the G-ideals containing it. Therefore, if R is a Jacobson ring, then every radical ideal is 
an intersection of maximal ideals. -4 

Example 11.66(iv) suggests the following result. 

Proposition 11.67. A commutative ring R is a Jacobson ring if and only if every prime 
ideal in R is an intersection of maximal ideals. 

Proof. By Corollary 11.63, every radical ideal, hence, every prime ideal, is the intersec¬ 
tion of all the G-ideals containing I. But in a Jacobson ring, every G-ideal is maximal. 

-These rings are called Hilbert rings by some authors. In 1951. W. Krull and O. Goldman, independently, 
published proofs of the Nullstellensatz using the techniques in this subsection. Krull introduced the term Jacobson 
ring in his paper. 



936 


Commutative Rings III Ch. 11 


Conversely, assume that every prime ideal in R is an intersection of maximal ideals. We 
let the reader check that this property is inherited by quotient rings. Let p be a G-ideal in R, 
so that R/p is a G-domain. Thus, there is u ^ Oin R/p with Frac(J?/p) = (R/p)[u~ 1 ]. By 
Lemma 11.59, u lies in every nonzero prime ideal of R/p, and so u lies in every nonzero 
maximal ideal. Now every prime ideal in R/p is an intersection of maximal ideals; in 
particular, since R/p is a domain, there are maximal ideals me with {0} = C\ a m (y . If all 
these trie are nonzero, then u e n a me = {0}, a contradiction. We conclude that {0} is a 
maximal ideal. Therefore, R/p is a field, the G-ideal p is maximal, and R is a Jacobson 
ring. • 


Corollary 11.68. A commutative ring R is a Jacobson ring if and only if J(R/1) = 
nil [R/1) for every ideal I. In particular, J(R) — nil(/?). 

Proof Let R he a Jacobson ring. If I is an ideal in R, then \fl — P| m, where m is a 
maximal ideal containing I. Now J(R/I ) is the intersection of all the maximal ideals in 
R/P, that is, J(R) — p|(m//) = (P| m)// = \fl //. On the other hand, m\(R/I) consists 
of all the nilpotent elements in R/I. But 0 = (/ + I)" — /" + I holds if and only if 
/" e /; that is, / e \fl. To prove the converse, note that condition says that every radical 
ideal in R is an intersection of maximal ideals. In particular, every prime ideal is such an 
intersection, and so R is a Jacobson ring. • 

The next result will give many examples of Jacobson rings. 

Theorem 11.69. A commutative ring R is a Jacobson ring if and only if R[x] is a Jacob¬ 
son ring. 

Proof We have seen that every quotient of a Jacobson ring is a Jacobson ring. Hence, if 
/?[x] is a Jacobson ring, then R = R[x]/(x) is also a Jacobson ring. 

Conversely, suppose that R is a Jacobson ring. If q is a G-ideal in /?[*], then we may 
assume that q IT R — {0}, by Exercise 11.36 on page 938. If v : /?[x] —> /?[x]/q is the 
natural map, then /?[x]/q = /?[m], where u — v(x). Now R\u] is a G-domain, because 
q is a G-ideal; hence, if K — Frac(/?[t/]), then there is v e K with K = /?[m][u - 1 ]. If 
Frac( R) — F, then 

K — .R[m][v -1 ] c F[m][u _1 ] c K, 

so that F[m][u _1 ] = K\ that is, F[u] is a G-domain. But F[u] is not a G-domain if u 
is transcendental over F , by Corollary 11.61, for L[v] = F[u] has infinitely many prime 
ideals. Thus, u is algebraic over F, and hence u is algebraic over R. Since R[u] is a 
G-domain, Proposition 11.58 says that R is a G-domain. Now R is a Jacobson ring, and so 
R is a field, by Exercise 11.34 on page 938. But if R is a field, so is R[u], for u is algebraic 
over R. Therefore, R\u] = R{x]/q is a field, so that q is a maximal ideal, and R[x] is a 
Jacobson ring. • 
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Corollary 11.70. Ifk is afield, then k[x i, ..., x n ] is a Jacobson ring. 

Proof. The proof is by induction on n > 1. For the base step, k[x] is a PID having 
infinitely many prime ideals, by Exercise 11.40, and so it is a Jacobson ring, by Exam¬ 
ple 11.66(h). For the inductive step, the inductive hypothesis gives R = k[x\,..., x n -\] a 
Jacobson ring, and Theorem 11.69 applies. • 

Theorem 11.71. Ifm is a maximal ideal in k[x i,..., x n ], where k is an algebraically 
closed field, then there are a\, ... ,a n e k such that 

m = (xi - a\,... ,x„ - a n ). 

Proof. The proof is by induction on n > 1. If n — 1, then m = ( p(x)), where p(x) e 
k[x~\ is irreducible. Since k is algebraically closed, p (x ) is linear. For the inductive step, 
let R — k[x\,..., x „-\]. Corollary 11.70 says that R is a Jacobson ring, and so m IT R is a 
G-ideal in R, by Proposition 11.64. Since R is a Jacobson ring, m' = m fl R is a maximal 
ideal. Corollary 11.65 now applies to give m = (rtf, f(x n )), where f(x n ) e R[x „] and 
f(x n ) e (R/m')[x n ] is irreducible. As k is algebraically closed and R/m' is a finitely 
generated /-algebra, R/xn' = k, and we may assume that f(x n ) is linear; there is a„ e k 
with f n (x) — x n — a„. By the inductive hypothesis, m' = (x\ — a\,, x n -\ — a„-i) for 
a\,..., a n -1 e k, and this completes the proof. • 

We use Theorem 11.71 to prove the Weak Nullstellensatz, Theorem 6.100. Recall that 
only the special case of the Nullstellensatz for uncountable algebraically closed fields was 
proved in Chapter 6. 

Theorem 11.72 (Weak Nullstellensatz). If f] (X), ..., f,(X) e k[X], where k is an 
algebraically closed field, then I — (/), ..., f t ) is a proper ideal in /[2f] if and only if 
Var(/t ,..., ft) 0. 

Proof. If I is a proper ideal, then there is a maximal ideal m containing it. By Theo¬ 
rem 6.100, there is a — (a\, ..., a „) e k n with m = (x\ — a\, ..., x„ — a n ). Now /Cm 
implies Var(m) c Var(7). But a e Var(m), and so Var(/) 0. • 

We could now repeat the proof of the Nullstellensatz over C, Theorem 6.102, to obtain 
the Nullstellensatz over any algebraically closed field. However, the following proof is 
easier. 

Theorem 11.73 (Nullstellensatz). Let k be an algebraically closed field. If I is an ideal 
in k[x i,..., x n ], then Id(Var(/)) = V/- 

Proof. The inclusion Id(Var(/)) C Jf is easy to see. If f n (a) — 0 for all a e Var(/), 
then f(a) = 0 for all a e Var(/), because the values of / lie in the field k. Hence, 
/ e Id(Var(/)). For the reverse inclusion, note first that k[x\, ..., x„] is a Jacobson ring, 
by Corollary 11.70; hence. Example 11.66(iv) shows that \fl is an intersection of maximal 
ideals. Let g e Id(Var(/)). If m is a maximal ideal containing /, then Var(m) c Var(/), 
and so Id(Var(/)) C Id(Var(m)). But Id(Var(m)) = m: Id(Var(7)) C = m, because 
m is a maximal, hence prime ideal. Therefore, g e P| m = V/, as desired. • 
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Exercises 

11.34 Prove that a commutative ring R is a field if and only if R is a Jacobson ring and a G-domain. 

11.35 Let E/R be a ring extension in which R is a field and £ is a domain. 

(i) Let b e E be algebraic over R , prove that there exists an equation 

b n + r n _ i b' 1 ~ l -1- rib + r 0 = 0. 

where r; e R for all i and i-q ^ 0. 

(ii) If £ = R[b\ ,..., b m \, where each bj is algebraic over R , prove that £ is a field. 

11.36 Let £ be a Jacobson ring, and assume that (£/q , )[.r] is a Jacobson ring for every G-ideal q in 
£[v], where q' = q fl R. Prove that £[.t] is a Jacobson ring. 

11.37 (i) Prove that m = (,r 2 — y, y 2 — 2) is a maximal ideal in Q[x, y], 

(ii) Prove that there do not exist f(x ) e Q[jt] and g(y) e Q[v] with m = (f(x), g(y)). 

11.38 Let k be a field and let m be a maximal ideal in k[x \, ..., x n ]. Prove that 

in = (/i(*i), f 2 (xi,x 2 ), • • •, ■ ■ ■, x n —l)> fn(x l, • • •, x n )). 

Hint. Use Corollary 11.65. 

11.39 Prove that if R is noetherian, then nil(£) is a nilpotent ideal. 

11.40 If k is a field, prove that k[x\ has infinitely many prime ideals. 

Algebraic Integers 

We have mentioned that Kummer investigated the ring Z[£ p ], where p is an odd prime 
and (p is a primitive /;th root of unity. We now study rings of integers in algebraic number 
fields E further. Recall the definition: 

Oe — {a e E : a is integral over Z}. 

We begin with a consequence of Gauss’s lemma. 

Lemma 11.74. Let E be an algebraic number field with [£ : Q] = n, and let a e E be 
an algebraic integer. Then irr(a, Q) e Z[x] and deg(irr(a, Q)) | n. 

Proof. By Corollary 6.29, irr(a, Q) e Z[x], and so the result follows from Proposi¬ 
tion 3.117(v). • 

Definition. A quadratic field is an algebraic number field E with \ E : Q] = 2. 
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Proposition 11.75. Every quadratic field E has the form E = Q(Vrf), where d is a 
squarefree integer. 

Proof. We know that E — Q(a), where a is a root of a quadratic polynomial; say, a 2 + 
ba + c — 0, where b, c e Q. If D — b 2 — 4c, then the quadratic formula gives a — 
— \b± W~D, and so E — Q(a) = Q(\/Z)). Write D in lowest terms: D — U/V , where 
U, V e Z and ( U, V) = 1. Now U — ur 2 and V = vs 2 , where u, v are squarefree; hence, 
uv is squarefree, because (u, v) — 1. Therefore, Q(Vf>) = Q (yfitjv) — Q(^/uv), for 
V u/v — v / uv / v 2 — ~/uv/v. • 

We now describe the integers in quadratic fields. 

Proposition 11.76. Let E — Q (\fd), where d is a squarefree integer (which implies that 
d ^ 0 mod 4). 

(i) If d = 2 mod 4 or d = 3 mod 4, then Oe = 'L\-Jd\. 

(ii) If d = 1 mod 4, then Oe consists of all u + vVd) with u and v rational integers 
having the same parity. 

Proof. If a e E = Q(Vrf), then there are a, b e Q with a = a + b^fd. We first show 
that a e Oe if and only if 

2a eh and a 2 — db 2 e h. (3) 

If a e Oe, then Lemma 11.74 says that p(x) = irr(oi, Q) e Z[x] is quadratic. Now 
Gal(£/Q) = (cr),wherea: £’->£’ carries \[d —*Jd\ that is, 

a (a) — a — bVd. 

Since a permutes the roots of p(x), the other root of p(x) is a(a); that is, 

p(x) — (x — a)(x — cr(a)) = x 2 — lax + ( a 2 — db 2 ). 

Hence, Eqs. (3) hold, because p(x) e Z[x]. 

Conversely, if Eqs. (3) hold, then a e Oe, because a is a root of x 2 — 2 ax + (a 2 —db 2 ), 
a monic polynomial in Z[x]. 

We now show that 2b e Z. Multiplying the second equation in (3) by 4 gives (2a) 2 — 
d(2b) 2 e Z. Since 2 a e Z, we have d(2b) 2 e Z. Write 2b in lowest terms: 2b = m/n, 
where (m, n) — 1. Now dm 2 /n 2 e Z, so that n 2 \ dm 2 . But (n 2 , m 2 ) = 1 forces n 2 \ d\ 
as d is squarefree, n = 1 and 2b = m/n e Z. 

We have shown that a = ^u and b — ^v, where u,v e Z. Substituting these values 
into the second equation in (3) gives 

u 2 = dv 2 mod 4. (4) 

Note that squares are congruent, mod 4, either to 0 or to 1. If d = 2 mod 4, then the 
only way to satisfy Eq. (4) is u 2 = 0 mod 4 and v 2 = 0 mod 4. Thus, both u and v must 
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be even, and so a = \u + ^ vVd e l\\[d]. Therefore, Oe = in this case, for 

Z[Vc/] c Oe is easily seen to be true. A similar argument works when d = 3 mod 4. 
However, if d = 1 mod 4, then m 2 = i> 2 mod 4. Hence, v is even if and only if u is even; 
that is, u and v have the same parity. If u and v are both even, then a, b e Z and a e Oe- 
If u and v are both odd, then u 2 = 1 = v 2 mod 4, and so u 2 = dir mod 4, because 
d = 1 mod 4. Therefore, Eqs. (3) do hold, and so a lies in Oe- • 

If E — Q (s/d), where d e Z, then Z [\fd\ c Oe, but we now see that this inclusion 
may be strict. For example, ^ (1 + V5) is an algebraic integer (it is a root of x 2 — x — 6). 
Therefore, Z[V5] C Oe, where E — Q(\/5). 

The coming brief digression into linear algebra will enable us to prove that rings of 
integers Oe are noetherian. 

Definition. Let E/k be a field extension in which E is finite-dimensional. If n e E, then 
multiplication T„ : E —> E, given by T H : y uy, is a k-map. If e \,..., e n is a basis of 
E , then l'„ is represented by a matrix A = [iiij] with entries in k\ that is, 

/x(e, ) = uej = Y, cijje j. 

Define the trace tr(u ) = tr( F„) and the norm N{u) — det(r„). Define the trace form 
t: E x E -+ R by 

t(u, v) = tr(uv) = tr(T H J. 

The characteristic polynomial of a linear transformation, and hence, any of its coeffi¬ 
cients, is independent of any choice of basis of E/k, and so the definitions of trace and 
norm do not depend on the choice of basis. If u e k, then the matrix of l'„, with respect to 
any basis of E/k, is the scalar matrix ul. Hence, 

tr(n) = [E : k]u and N(u ) = u^ E k ^ if u e k. 

It is also easy to see that tr: E -> k is a linear functional and that N: E x —> A x is a 
(multiplicative) homomorphism. 

It is a routine exercise, left to the reader, to check that the trace form is a symmetric 
bilinear form. 

Example 11.77. 

If E — Q (Vd) is a quadratic field, then a basis for E /Q is 1, Vd. If u — a + bVd, then 
the matrix of F u is 

a bd 
b a ’ 

so that 

tr(w) = 2a and N (u) — a 2 — db 2 . 

Thus, trace and norm arose in the description of the integers in quadratic fields, in Eqs. (3). 
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We now show that u — a + b\[d is a unit in Oe if and only if N(u) — ± 1. If u is a unit, 
then there is v e Oe with 1 = uv. Hence, 1 = N(\) = N(uv ) = N(u)N(v), so that N(u ) 
is a unit in Z; that is, N(u) — ±1. Conversely, if N(u ) = ±1, then N(u) = N(u) — ±1, 
where u = a — b\fd. Therefore, N(uu) — 1. But uu e Q, so that 1 = N (uu) = (uu) 2 . 
Therefore uu — ±1, and so u is a unit. ◄ 


Lemma 11.78. Let E/k be a field extension of finite degree n, and let u e E. If u = 
Ml, . .., u s are the roots, with multiplicity, ofm(u, k) (in some extension field of E), that 
is, iir (u, k) = n- = j(* — Uj), then 

S S 

tr(n) = [£ : k(u)] u ,• and N(u ) = (]~[ Uj)^ E 

i=i i=i 

Remark. Of course, if u is separable over k, then irr (u , k) has no repeated roots and each 
Uj occurs exactly once in the formulas. ◄ 

Sketch of Proof. A basis of k(u) over k is 1, u, u 2 ,..., u s ~ l , and the matrix Ci of 
T„| k(u) with respect to this basis is the companion matrix of irr (u, k). If 1, V 2 ,..., v r 
is a basis of E over k(u), then the list 

1, u, ..., m s_1 , v\,v\u,..., i>i m' s_1 ,..., v r , v r u,..., v r u s ~ l 

is a basis of E over k. Each of the subspaces k(u ) and ( Vj , vju,..., VjU s ~ l ) for j > 2 is 
T„-invariant, and so the matrix of T„ relative to the displayed basis of E over k is a direct 
sum of blocks Ci © • • • © C r . In fact, the reader may check that each Cj is the companion 
matrix of irr (u, k). The trace and norm formulas now follow from tr(Cj © • • • © C r ) = 
Jfj tr (Cj) and det(Ci © • • • © C r ) — ]”[/ det(C 7 ). • 

If E/k is a field extension and u e E, then a more precise notation for the trace and 
norm is 

tr E/k(u) and N E /k(u)- 

Indeed, the formulas in Lemma 11.78 display the dependence on the larger field E. 

Proposition 11.79. Let R be a domain with F = Fraci R), let E/F be afield extension 
of finite degree [E : F] — n, and let u e E be integral over R. If R is integrally closed, 
then 

tr(i<) e R and N(u ) e R. 

Proof. The formulas for tr (u) and N(u) in Lemma 11.78 express each as an elementary 
symmetric function of the roots u — «],..., u s of irr(w, F). Since u is integral over R. 
Exercise 11.33(iii) on page 931 says that irr(w, F) e /?[x]. Therefore, u, and P| m, lie 
in R, and hence tr(u) and N(u) lie in R. • 
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In Example 4.35, we saw that if E/k is a finite separable extension, then its normal 
closure £ is a Galois extension of k. Recall from the fundamental theorem of Galois theory. 
Theorem 4.43, that if G — Gal(£/ k ) and H — Gal(£/ E ), then [G : H] — [E : k]. 


Lemma 11.80. Let E /k be a separable field extension of finite degree n — [E : k] and 
let E be a normal closure of E. Write G — GaI{E/ k) and H — Gal (E/E), and let T be a 
transversal of H in G\ that is, there is a disjoint union G — UctsT a H. 


(i) For all u e E, 


(ii) For all u e E, 


j~~[ (x — o(u)) = iir( m, k^ E[k h>)]_ 

ere T 


tr(w) = o(u) 
oeT 


and 


N(u) = ]”[ o(u). 

oeT 


Proof, (i) Denote Y\oeT^ x ~ CT ( M )) by h(x)\ of course, h(x) e E[x], 

We claim that the set X, defined by X = {o(u) : a e T], satisfies r(X) — X for 
every r e G. If a e T, then to e o'H for some o' e T , because T is a left transversal; 
hence, to = of] for some ?/ e H. But r o{ii) = of]{ii) — o'(u), because rj e H, 
and every element of H fixes E. Therefore, to(u) = o'(u ) e X. Thus, the function (p T , 
defined by o(u) tct(m), is a function X —> X. In fact, rp T is a permutation, because r 
is an isomorphism and so rp T \X is an injection. It follows that every elementary symmetric 
function on X — {o(u) : o e T] is fixed by every r e G. Since E/k is a Galois extension, 
each value of these elementary symmetric functions lies in k. We have shown that all the 
coefficients of h(x) lie in k, and so h{x) e k[x~\. We now compare h(x) and irr(t/, k). If 
o e G, then o permutes the roots of irr(w, k), so that every root o{u) of h(x) is also a root 
of irr (u, k). By Exercise 3.86 on page 197, we have 


h (x) — iiT (m , k) m 


for some m > 1, and so it only remains to compute m. Now 

deg(7z) = m degfirr (u, k)) = m[k(u ) ; k]. 

But deg(/z) — [G : H] — [E : k], and so m — [E : k]/[k(u) : k] = [E : k(u)]. 
(ii) Recall our earlier notation: irr( m, k) = P[ ( v = | (x — m ; ). Since 

Y[ ( x - = trr(u, k) [E:k(u) 1 = (]~[G' - Ui)) [hMu)] , 

oeT i'=l 

their constant terms are the same. 


, * V [E-Mu)] 

± cr(M) = 

oeT i=l 
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and their penultimate coefficients are the same, 

— <j(u) = — [E : k(u )] ui. 

oeT i=i 


By Lemma 11.78, tr(n) = [E : k(u)] ] itj and N(u) — (ri/=i Uj)^ Eki ' u) ^. It follows 

that 

tr(n) = [£ : k(u)] itj = a (it) 

1 = 1 <T€ T 

and 

Af(n)= (fl««) [£ * ( " )] = I1 CT(M) - * 

f=l <re T 


Definition. Let E/ k be a finite field extension, let £ be a normal closure of E, and let T 
be a left transversal of Gal (E/E) in Gal (E/k). If u e E. then the elements a (it), where 
(TeL, are called the conjugates of u. 

If E/k is a separable extension, then the conjugates of u are the roots of irr(n, k)\ in the 
inseparable case, then the conjugates may occur with multiplicities. 

Corollary 11.81. If E/k is a Galois extension with G — Gal(£ /k), then 

tr (u) = cr(u) and N(u) = | a (n). 

<rsG a eG 

Proof. Since E/k is a Galois extension, E is its own normal closure, and so a transversal 
T of G in itself is just G. • 

This last corollary shows that the norm here coincides with the norm occurring in Chap¬ 
ter 4 in the proof of Hilbert’s Theorem 90. 

Let V be a vector space over a field k, and let /: V x V —*■ k be a bilinear form. If 
e \,..., e n is a basis of V. then the discriminant is defined by 

D(e i,..., e„) = det([/(e/, ej)]). 

Recall that / is nondegenerate if there is a basis whose discriminant is nonzero (it then 
follows that the discriminant of / with respect to any other basis of V is also nonzero). 

Lemma 11.82. If E / k is a finite separable 6 field extension, then the trace form is nonde¬ 
generate. 


f 'If E/k is inseparable, then the trace form is identically 0. See Isaacs, Algebra. A Graduate Course, page 
369. 
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Proof. We compute the discriminant using Lemma 11.80 (which uses separability). Let 

T — {(J\ .er„} be a transversal of GaKL/ E ) in Gal (E/k), where E is a normal closure 

of E. 

Die i- ,e n ) = det([t(ei,ej)]) 

= det([tr (etej)]) 

— det[^^ fff (e/ej)] (Lemma 11.80) 

t 

= d et (E^(G'K(^)]) 

e 

— det([ac(e,-)]) det([a£(e,-)]) 

= det(K;(G)]) 2 . 

To see that det([erg(e,)]) 0, we assume otherwise. If [erg (e/)] is singular, there is a 
column matrix C — [ci, ..., c n ]‘ e E" with [<7£(e,)]C — 0. Hence, 

Cicriiej) H-1- c n cr n {ej) = 0 

for j = \,... ,n. It follows that 

ClCTl(u) H-bc„CT„(t>) = 0 

for every linear combination v of the e,. But this contradicts the independence of charac¬ 
ters, Proposition 4.30. • 

Proposition 11.83. Let R be integrally closed, and let F — Frac(/G. If E/F is a finite 
separable field extension of degree n, and if O — Oe/r is the integral closure of R in E, 
then O can be imbedded as a submodule of a free R-module of rank n. 

Proof. Let e \,..., e„ be a basis of E/F. By Proposition 11.46, for each i there is r,- e R 
with r, e; e O; changing notation if necessary, we may assume that each e, e O. Now 
Corollary 9.76, which uses nondegeneracy of bilinear forms, says that there is a basis 
/i,..., f n of E with tiei, fj) = tr(e,//) = Sjj. 

Let a e O. Since fi,..., f n is a basis, there are Cj e F with a — ^ cjfj. For each 
i, where 1 < i < n, we have e,a e O (because cy e O). Therefore, tr(c,«) e R, by 
Proposition 11.79. But 

tr (e/a) = tr(^c ; e,/ ; ) 
j 

j 

= c j&ij 

= Ci. 

Therefore, c, e R for all i, and so a — JT c, /, lies in the free /^-module with basis 

/l. • 
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Definition. If E is an algebraic number field, then an integral basis for Oe is a list 
fix in Oe such that every a e Oe has a unique expression 


a = cifii H-b c n fi n , 


where c\ e Z for all i. 

We now prove that integral bases always exist. 

Proposition 11.84. Let E be an algebraic number field. 

(i) The ring of integers Oe has an integral basis, and hence it is a free abelian group of 
finite rank under addition. 

(ii) Oe is a noetherian domain. 

Proof, (i) Since Q has characteristic 0, the field extension E/Q is separable. Hence, 
Proposition 11.83 applies to show that Oe is a submodule of a free Z-module of finite 
rank; that is, Oe is a subgroup of a finitely generated free abelian group. By Corollary 9.4, 
Oe is itself a free abelian group. But a basis of Oe as a free abelian group is an integral 
basis. 

(ii) Any ideal I in Oe is a subgroup of a finitely generated free abelian group, and hence 
I is itself a finitely generated abelian group, by Proposition 9.7. A fortiori, I is a finitely 
generated O^-module; that is, I is a finitely generated ideal. • 

Example 11.85. 

We show that Oe need not be a UFD, and hence it need not be a PID. Let E = Q(V~5). 
Since —5 = 3 mod 4, Proposition 11.76 gives Oe = Z[V~5j. By Example 11.77, the 
only units in Oe are elements u with N(u) = ±1. If a 2 + 5b 2 = ±1, where a, b e Z, then 
b — 0 and a = ±1, and so the only units in Oe are ±1. Consider the factorization in Oe- 


2 • 3 = (l + V—5)(l — V—5). 


Note that no two of these factors are associates (the only units are ±1), and we now show 
that each of them is irreducible. If v e Oe divides any of these four factors (but is not an 
associate of it), then N( v) is a proper divisor in Z of 4, 9, or 6, for these are the norms of the 
four factors (N (1 + V—5) = 6 = N (1 — V—5)). It is quickly checked, however, that there 
are no such divisors in Z of the form a 2 + 5b 2 other than ±1. Therefore, Oe — Z[V—5 ] 
is not a UFD. ◄ 

Trace and norm can be used to find other rings of integers. 

Definition. If ti ^ 2, then ci cyclotomic Jiald is E — where ^ is cl primitive /ith 

root of unity. 
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Recall that if p is prime, then the cyclotomic polynomial 

OpO) = x p ~ l + x p ~ 2 H-+ x + 1 e Z[x\ 

is irreducible, so that irr(£ p , Q) = O p (x) and [Q(f p )/Q] = p — 1. Moreover, 

Gal(Q(f p )/Q) = {(Ti,... ,o p -\}, 
where cry: i—for i = 1,..., p — 1. 

We do some elementary calculations in E = Q () to enable us to describe Of. 

Lemma 11.86. Let p be an odd prime, and let E — Q(f), where t, = C, p is a primitive 
pth root of unity. 

(i) tr(f') = — 1 for 1 < i < p — 1. 

(ii) tr(l — £') = p for 1 <i < p — 1. 

(iii) P = nfJi 1 (1 - C ) = N{1 - a 

(iv) o £ (i-?)nz = pZ. 

(v) tr(n(l — £)) e pZ for every u e Oe- 

Proof, (i) We have tr(£) = V — ‘I 'p(f) — 1. which is also true for every primitive 

/?th root of unity f. The result follows from tt> p (£) = 0. 

(ii) Since tr(l) = [E : Q] = p — 1 and tr is a linear functional, 

tr(l - C') = tr(l) - tr(C') = (p - l) - (-1) = p. 

(iii) Since <t>(x) = + x p ~ 2 + ■ ■ ■ + x + 1, we have < J > P (1) = p. On the other hand, 

the primitive pth roots of unity are the roots of <t> p (x), so that 

p -1 

*p(x)= 

i'=t 

Evaluating at x — 1 gives the first equation. The second equation holds because the 1 — f' s 
are the conjugates of 1—0 

(iv) The first equation in (iii) shows that p e Oe (1 — £) 0 Z, so that Oe( 1 — f) OZ 3 pZ. 
If this inclusion is strict, then Oe( 1 — f) fl Z = Z, because pZ is a maximal ideal in Z. In 
this case, Oe( 1 -()OZ = Z, hence Z C Oe( 1 — O- and so 1 e Oe( 1 — ?)• Thus, there 
is v e Oe with i>(l — f) = 1; that is, 1 — f is a unit in Oe■ But if 1 — f is a unit, then 
N (1 — £) = ±1, contradicting the second equation in (iii). 

(v) Each conjugate cry (n(l — £)) = cry (n)(l — £') is, obviously, divisible by 1 — tf in Oe- 
But 1 — £' is divisible by 1 — £ in Oe , because 

1 - = (1 - 0(1 + K + C 2 + • • • + C' -1 )- 
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Hence, er,( 1 — £') e Oe( 1 — f) for all i, and so JT(m( 1 — £')) e 0£(1 — f). By 
Corollary 11.81, JT(w(l — £)) = tr(w Cl — £). Therefore, trCw(1 — £) e 0£(1 —f )flZ = pZ, 
by (iv), for tr(M(l — £)) e Z, by Proposition 11.79. • 


Proposition 11.87. If p is an odd prime and E — Q () is a cyclotomic field, then 

o E = m P i 

Proof. Let us abbreviate f p to £. It is always true that Z[f ] C Oe, and we now prove that 
the reverse inclusion also holds. By Lemma 11.74, each element u e Oe has an expression 

U — Co + ci£ + C2f 2 + • • • + Cp—2^ P 2 , 

where c, e Q (remember that [E : Q] — p — 1). We must show that c; e Z for all i. 
Multiplying by 1 — £ gives 

m( 1 — f) = co(l — ?) + c l(? — ?") + • • • + c p-l($ p ~ ~ K p 1 )- 

By (i), tr(£' — £' +1 ) = tr(£') — tr(£' +1 ) = 0 for 1 < i < p — 2, so that tr(w(l — £)) — 
co tr(l — f); hence, tr(u(l — C) — P c o- because tr(l — f ) = p, by (ii). On the other hand, 
tr(M(l — £)) e pZ, by (iv). Hence, pco — nip for some m e Z, and so co e Z. Now 
£ -1 = e Oe, so that 

(M — Co)C 1 = Cl + C2^ + • • • + C p -2^ P 3 € C7£. 

The argument just given shows that ci e Z. Indeed, repetition of this argument shows that 
all ci e Z, and so u e Z[£], • 

Before we leave this interesting topic, we must mention a beautiful theorem of Dirich- 
let. For proofs of the following statements, see Samuel, Algebraic Theory of Numbers, 
Chapter 4. An algebraic number field E of degree n has exactly n imbeddings into C. If r\ 
is the number of such imbeddings with image in R, then n — ri is even; say, n — r\ = 2ri. 


Theorem (Dirichlet Unit Theorem). Let E be an algebraic number field of degree 
n. Then n — r i + 2ri, (where r\ is the number of imbeddings of E into R), and the 
multiplicative group U(Oe) of units in Oe is a finitely generated abelian group. More 
precisely, 

U(O e ) = Z n+r2 “ 1 x T, 


where T is a finite cyclic group consisting of the roots of unity in E. 
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Exercises 


11.41 (i) If E = Q(V— 3), prove that the only units in Oe are 

±1, !(1±V=3), 1(-1±V=3), 


11.42 


(ii) Let d be a negative squarefree integer with cl ^ — 1 and cl ^ —3. If E = Q (s/d), prove 
that the only units in Oe are ± 1. 

(i) Prove that if E = Q(s/2) C R, then there are no units u e Oe with 1 < u < 1 + s/2. 

(ii) If E = Q(\/2), prove that Oe has infinitely many units. 

Hint. Use (i) to prove that all powers of 1 + s/2 are distinct. 


Definition. If Oe is the ring of integers in an algebraic number field E. then a discriminant of 

O e is 

mo E )="/) 2 ’ 

><j 

where oq, ..., a n is an integral basis of Oe- 


11.43 


11.44 


11.45 


Let d be a squarefree integer, and let E = Q (s/d). 

(i) If cl = 2 mod 4 or d = 3 mod 4, prove that 1, s/d is an integral basis of Oe, and prove 
that a discriminant of Oe is 4 cl. 

(ii) If d = 1 mod 4, prove that 1. 4(1 + s/d) is an integral basis of Oe , and prove that a 
discriminant of Oe is d. 

Let p be an odd prime, and let £ = Q(fp) be the cyclotomic field. 

(i) Show that 1, 1 — fp, (1 — ?p) 2 .(1 — is an integral basis for Oe- 

(ii) Prove that a discriminant of Oe is (—1)2 1 ' pP~ 2 . 

Hint. See Pollard, The Theory of Algebraic Numbers, page 67. 

(i) If A is the field of all algebraic numbers, prove that C7& is not noetherian. 

(ii) Prove that every nonzero prime ideal in O& is a maximal ideal. 

Hint. Use the proof of Corollary 11.53. 


Characterizations of Dedekind Rings 

The following definition involves some of the ring-theoretic properties enjoyed by the ring 
of integers Oe in an algebraic number field E. 

Definition. A domain R is a Dedekind ring if it is integrally closed, noetherian, and its 
nonzero prime ideals are maximal ideals. 
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Example 11.88. 

(i) The ring Oe in an algebraic number field £ is a Dedekind ring, by Proposition 11.46, 
Proposition 11.84, and Corollary 11.53. 

(ii) Every principal ideal domain £ is a Dedekind ring. -4 

It is shown, in Example 11.85, that R — Z[V~5 ] is a Dedekind ring that is not a UFD 
and, hence, it is not a PID. We remind the reader that E. Kummer, in his investigations 
into Fermat’s last theorem in the 1840s, recognized such examples, and he forced unique 
factorization by adjoining “ideal” numbers to rings of integers. About 30 years later, R. 
Dedekind introduced the modern definition of ideal, and showed that Kummer’s ideal num¬ 
bers correspond to Dedekind’s ideals. We will prove, in Theorem 11.95, that every nonzero 
ideal in a Dedekind ring has a unique factorization as a product of prime ideals. 

We now characterize DVRs, and then show that localizations of Dedekind rings are 
well-behaved. 

Lemma 11.89. A domain R is a DVR if and only if it is noetherian, integrally closed, 
and has a unique nonzero prime ideal. 

Proof. If R is a DVR, then it does have the required properties (recall that R is a PID, 
hence it is integrally closed). 

The converse, which requires us to show that R is a PID, is not as simple as we would 
expect. Let p be the nonzero prime ideal, and choose a nonzero a e p. Define M = R/Ra, 
and consider the family A of all the annihilators ann(m) as m varies over all the nonzero 
elements of M. Since R is noetherian, it satisfies the maximum condition, and so there is 
a nonzero element b + Ra e M whose annihilator q = ann(Z? + Ra) is maximal in A. 
We claim that q is a prime ideal. Suppose that x, y e R, xy e q, and x, y f q. Then 
y(b + Ra) — yb + Ra is a nonzero element of M, because y f q. But ann {yb + Ra) D 
ann (b + Ra), because x f an ill/? + Ra), contradicting the maximality property of q. 
Therefore, q is a prime ideal. Since R has a unique nonzero prime ideal p, we have 

q = ann(Z? + Ra) — p. 


Note that 

b/a f R. 

Otherwise, b+Ra = 0+ Ra, contradicting b+Ra being a nonzero element of M — R/Ra. 

We now show that p is principal, with generator a/b (we do not yet know whether 
a/b e Frac (R) lies in R). First, we have pb — qb C Ra, so that p (b/a) c R-, that 
is, p (b/a) is an ideal in R. If p (b/a) C p, then b/a is integral over R, for p is a finitely 
generated R-submodule of Frac ( R), as required in Lemma 11.41. As R is integrally closed, 
this puts b/a e R, contradicting what we noted at the end of the previous paragraph. 
Therefore, p (b/a) is not a proper ideal, so that p (b/a) — R and p = R(a/b). It follows 
that a/b e R and p is a principal ideal. 

Denote a/b by t. The proof is completed by showing that the only nonzero ideals in 
R are the principal ideals generated by t n , for n > 0. Let / be a nonzero ideal in R, and 
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consider the chain of submodules of Frac(R): 

I c It~ l c It ~ 2 c • • • . 

We claim that this chain is strictly increasing. If It~" — It~ n ~ l , then the finitely generated 
R-module 7f _1 satisfies t~ l (It~") C It~ n , so that f -1 = b/a is integral over R. As 
above, R integrally closed forces b/a e R, a contradiction. Since R is noetherian, this 
chain can contain only finitely many ideals in R. Thus, there is n with It~ n C R and 
It~ n ~ l <£ R. If It~ n C p — Rt, then It~"~ l C R, a contradiction. Therefore, It~ n — R 
and I — Rt", as desired. • 

Proposition 11.90. If R is a noetherian domain, then R is a Dedekind ring if and only if 
for every nonzero prime ideal p, the localization Rp is a DVR. 

Remark. Exercise 11.45 on page 948 shows that it is necessary to assume that R is 
noetherian. 4 

Proof If R is a Dedekind ring and p is a maximal ideal. Corollary 11.18(iv) shows that 
Rp has a unique nonzero prime ideal. Moreover, Rp is noetherian (Corollary 11.18(v)), a 
domain (Corollary 11.16), and integrally closed (Exercise 11.26). By Lemma 11.89, Rp is 
a DVR. 

For the converse, we must show that R is integrally closed and that its nonzero prime 
ideals are maximal. Let u/v e Frac( R) be integral over R. For every nonzero prime ideal 
p, the element u/v is integral over R v (note that Frac(R p ) = Frac( /?)). But Rp is a PID, 
hence is integrally closed, and so u/v e Rp. We conclude that u/v e f) p Rp — R, by 
Proposition 11.20. Therefore, R is integrally closed. 

Suppose there were nonzero prime ideals p C q in R. By Corollary 11.18(iv), p q C q q 
in Rq. This contradicts the fact that a DVR has a unique nonzero prime ideal. Therefore, 
nonzero prime ideals are maximal, and so R is a Dedekind ring. • 

Let R be a domain with F = Frac(R), and let I — Ra be a nonzero principal ideal in 
R. If we define J — Ra~ l C F, the cyclic R-submodule generated by a~ l , then it easy to 
see that 

IJ — {uv : u e I and v e J) = R. 

Definition. If R is a domain with F — Frac(R), then a fractional ideal is a finitely 
generated nonzero R-submodule of F. If I is a nonzero ideal in R, then 

L 1 = {v e F : vl c R}. 

It is always true that I~ l I C R\ a fractional ideal I is invertible if I~ l I — R. 

Every finitely generated ideal in R is also a fractional ideal. In this context, we often 
call such ideals (which are the usual ideals!) integral ideals when we want to contrast them 
with more general fractional ideals. 
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We claim that if I = Ra is a nonzero principal ideal in R. then 7 1 = Ra~ l . Clearly, 
(ra~ l )(r'a) — rr' e R for all r’ e R , so that Ra~ l C 7 _1 . For the reverse inclusion, 
suppose that (u/v)a e R, where u, v e R. Then v | ua in R, so there is r e R with 
rv — ua. Hence, in F, we have u — rva ~ l , so that u/v = (rva~ 1 )/v — ra~ l . Therefore, 
every nonzero principal ideal in R is invertible. 

Lemma 11.91. If R is a domain with F — Frac(7?), then a fractional ideal I is invertible 
if and only if there exist a\ ,..., a n e I and q\,... ,q n ef with 

(i) q,I C R for i — 1 

(ii) 1 = YJi =l mi- 

Proof. If I is invertible, then I~ l I — R. Since 1 e 7 _1 7, there are a\, ..., a n € R and 
qi, ..., q n € I~ l with 1 = JT q,aj. Since q / e 7 _1 , we have q/I C R. 

To prove the converse, the /^-submodule J of F generated by q \...., q„ is a fractional 
ideal. Since 1 = Y^"=\ ( li a i e JI is an R -submodule of R containing 1; that is, 
JI — R. To see that I is invertible, it remains to prove that J = I~ l . Clearly, each 
qi e 7 _1 , so that J C 7 _1 . For the reverse inclusion, assume that u e F and ul C R. 
Since 1 = ffj qia,. we have u — 'f2 j (uaj)qj e J because uaj e R for all i. • 

Corollary 11.92. Every invertible ideal I in a domain R is finitely generated. 

Proof. Since 7 is invertible, there exist a\, ..., a n e I and q\,... ,q n e F as in the 
lemma. If b e 7, then b = h 1 = ff ; hqqij <= 7, because bq, e R. Therefore, 7 is 
generated by a \,..., a n e 7. • 

Proposition 11.93. The following conditions are equivalent for a domain R. 

(i) R is a Dedekind ring. 

(ii) Every fractional ideal is invertible. 

(iii) The set of all the fractional ideals T (R) forms an abelian group under multiplication 
of ideals. 

Proof, (i) =>• (ii). 

Let / be a fractional ideal in R. Since R is a Dedekind ring, its localization R v is 
a PID, and so / p , as every nonzero principal ideal, is invertible (in Theorem 9.3, in the 
course of proving that finitely generated torsion-free abelian groups are free abelian, we 
really proved that fractional ideals of PIDs are cyclic modules). Now Exercise 11.50 on 
page 958 gives 

(/ _1 T)p = (/ _1 )p/ p = (/ p ) _1 Tp = R P . 

Proposition 11.30 gives T -1 / = R. and so J is invertible. 

(ii) (iii). 
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If I. J e T( R). then they are finitely generated, by Corollary 11.92, and 
" = {£ a^bi : Uf G I and (>( e i| 

is a finitely generated R -submodule of Frac(7?). If I = (ai, , a„) and J = {b\,..., h m ). 
then IJ is generated by all a,b j. Hence, IJ is finitely generated and IJ e T(R). Asso¬ 
ciativity does hold, the identity is R. and the inverse of a fractional ideal J is / , because 

J is invertible. It follows that T(R ) is an abelian group. 

Conversely, if T(R) is an abelian group and I e T( R), then there is J e TiR) with 
JI — R. We must show that J — 7 _1 . But 

R = JI C I~ l I C R, 

so that JI — 7 _1 7. Canceling I in the group T(R) gives J = 7 , as desired. 

(iii) =*■ (i). 

First, R is noetherian, for (iii) => (ii) shows that every nonzero ideal I is invertible, and 
Corollary 11.92 shows that I is finitely generated. 

Second, we show that every nonzero prime ideal p is a maximal ideal. Let I be an ideal 
with p C I (we allow I = R). Then p7 _1 c 77 _1 = R , so that p7 _1 is an (integral) ideal 
in R. Now (p7 _1 )7 = p, because multiplication is associative in T(R). Since p is a prime 
ideal. Proposition 6.13 says that either p7 _1 C p or 7 C p. The second option does not 
hold, so that p7 _1 C p. Multiplying by p -1 7 gives R C 7. Therefore, I = R, and so p is 
a maximal ideal. 

Third, if a e Frac( R) is integral over R, then Lemma 11.41 gives a finitely generated R- 
submodule J of Frac(7?), i.e., a fractional ideal, with aJC.J. Since J is invertible, there 
are q\, ..., q n e Frac(7?) and a\,...,a n e J with q, J C R for all i and 1 = ^ qta-,. 
Hence, a = q,a,a. But a, a e J and q\J C R gives a = <Ii («;«) e R. Therefore, 
R is integrally closed, and hence it is a Dedekind ring. • 


Proposition 11.94. 

(i) If R is a UFD, then a nonzero ideal I in R is invertible if and only if it is principal. 

(ii) A Dedekind ring R is a UFD if and only if it is a PID. 

Proof, (i) We have already seen that every nonzero principal ideal is invertible. Con¬ 
versely, if 7 is invertible, there are elements a \,..., a n el and q \..... q n e Frac(7?) with 
1 = qiUj and g;7 C 71 for all i. Let qi = h, /a, where b et e R. Since R is a UFD, we 
may assume that q ,• is in lowest terms; that is, {bj, ct) — 1. But (7>,/c,)a/ e R says that 
Cj | biuj, so that c; | aj for all i. j , by Exercise 6.18(i) on page 339. We claim that 7 = Rc, 
where c = lcmfci, ..., c„}. First, cel. for cbi/c, e R and c = cl = '}f i (cbi/ci)ai. 
Hence, Rc C 7. For the reverse inclusion. Exercise 6.18(ii) on page 339 shows that c \ a j 
for all j, so that aj e Rc. for all j. and so 7 c Rc. 

(ii) Since every nonzero ideal in a Dedekind ring is invertible, it follows from (i) that if R 
is a UFD, then every ideal in R is principal. • 
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Definition. If R is a Dedekind ring, then its class group C(R) is defined by 

C{R) = F{R)/V(R), 

where V(R) is the subgroup of all nonzero principal ideals. 

Dirichlet proved, for every algebraic number field E, that the class group of C(Oe ) is 
finite; the order |C(/?)| is called the class number of Oe- The usual proof of finiteness of 
the class number uses a geometric theorem of H. Minkowski which says that sufficiently 
large parallelopipeds in euclidean space must contain lattice points (see Samuel, Algebraic 
Theory of Numbers, pages 57-58). 

L. Claborn proved, for every (not necessarily finite) abelian group G, that there is a 
Dedekind ring R with C(R) = G. 

We can now prove the result linking Kummer and Dedekind. 

Theorem 11.95. If R is a Dedekind ring, then every proper nonzero ideal has a unique 
factorization as a product of prime ideals. 

Proof. Let S be the family of all proper nonzero ideals in R that are not products of prime 
ideals. If S = 0, then every nonzero ideal in R is a product of prime ideals. If <S f 0, 
then S has a maximal element I, because noetherian rings satisfy the maximum condition 
(Proposition 6.38). Now I cannot be a maximal ideal in R, for a “product of prime ideals” 
is allowed to have only one factor. Let m be a maximal ideal containing I. Since I C m, we 
have m -1 / C m -1 m = IP. that is, m -1 / is a proper ideal properly containing I. Neither 
m nor m -1 / lies in S, for each is strictly larger than a maximal element, namely, /, and 
so each of them is a product of prime ideals. Therefore, I = m(m -1 /) (equality holding 
because R is a Dedekind ring) is a product of prime ideals, contradicting I being in S. 
Therefore, S — 0, and every proper nonzero ideal in R is a product of prime ideals. 

Suppose that pi • • • p r = qi • • • q v , where the p; and q j are prime ideals. We prove 
unique factorization by induction on max{r, 5 }. The base step r = 1 = s is obviously 
true. For the inductive step, note that pi 3 qj • • • q s , so that Proposition 6.13 gives q ; with 
Pi =! q/■ Hence, pi = qbecause prime ideals are maximal. Now multiply the original 
equation by pj -1 and use the inductive hypothesis. • 

Corollary 11.96. If R is a Dedekind ring, then T(R ) is a free abelian group with basis 
all the nonzero prime ideals. 

Proof. Of course, TiR) is written multiplicatively. That every fractional ideal is a prod¬ 
uct of primes shows that the set of primes generates T(R)\ uniqueness of the factorization 
says the set of primes is a basis. • 

In light of Theorem 11.95, many of the usual formulas of arithmetic extend to ideals in 
Dedekind rings. Observe that in Z, the ideal (3) contains (9). In fact, Z m 0 Z n if and only 
if m | n. We will now see that the relation “contains" for ideals is the same as “divides,” 
and that the usual formulas for gcd’s and lcm’s (Proposition 1.17) generalize to Dedekind 
rings. 
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Proposition 11.97. Let I and J be nonzero ideals in a Dedekind ring R, and let their 
prime factorizations be 

7 = Pl‘ ' ’ ’ Pn" and J = pf 1 -"pf’> 

where e,- > 0 and /,• > 0 for all i. 

(i) / C I if and only if I = J L for some ideal L. 

(ii) / C I if and only if /; < ej for all i. 

(iii) If mi = min{e;, f) and Mi — max{e,, /;}, then 

/ D / = pf 1 •.. pf" and I + J = pr-'-p™"- 

In particular, I + J = R if and only ifmm{ei, /;} = 0 for all i. 

(iv) Let R be a Dedekind ring, and let I = p^ 1 • • • pf be a nonzero ideal in R. Then 

R/I = R/pf - --pf = (*/p?) X ••• x (R/ pf,"). 


Proof (i) If I C J, then J 1 1 C R, and 

J(J-'l) = /. 

Conversely, if I = JL, then I C J because JL C. JR — J. 

(ii) This follows from (i) and the unique factorization of nonzero ideals as products of 
prime ideals. 

(iii) We prove the formula for I + J. Let I + J = p'j 1 • • • p'" and let A — p'” 1 • • • p” ! ". Since 
/ C I + J and J C I + J, we have r, < e, and r, < f, so that r, < minje,, f) j = m,. 
Hence, A C / + J, For the reverse inclusion, A C I and Ac/, so that A = II' and 
A — JJ' for ideals I' and J', by (i). Therefore, / + / = AI' + AJ' — A(I' + J'), and so 
I + / c A. The proof of the formula for IJ is left to the reader. 

(iv) This is just the Chinese remainder theorem. Exercise 6.11 (iii) on page 325, so that it 
suffices to verify the hypothesis that p'f' and p ' are coprime when i j; that is, p e : l +p ( = 

l J 1 J 

R. But this follows from (iii). • 

Recall Proposition 7.58: an R-module A is projective if and only if it has a projective 
basis: there exist elements {aj : j e /} c A and R-maps {<p ; -: A —> R : j e /} such that 

(i) for each x e A, almost all tpjix ) = 0; 

(ii) for each x e A, we have x — ^j € j(<Pj x ) a j- 
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Proposition 11.98. 

(i) A nonzero ideal I in a domain R is invertible if and only if I is a projective R-module. 

(ii) A domain R is a Dedekind ring if and only if every ideal in R is projective. 

Proof, (i) If I is invertible, there are elements a\,.. ., a n e I and q\ . q n e Tract R ) 

with 1 = q,a, and q, I Cl R for all i. Define tpi : I —> R by (pi: a i— q,a (note that 

mupi C I because q,I C R). If a e /, then 

( pi(a)ai = q,aai = a ^^qia, = a. 

i i i 

Therefore, I has a projective basis, and so I is a projective R-module. 

Conversely, if / is a projective, it has a projective basis \<pj : j e /}, [a j : j e J }. If 
b e I is nonzero, define qj e Frac(/?) by 

qj = cpj(b)/b. 

This element does not depend on the choice of nonzero b: if b' e / is nonzero, then 

b’tpjfb) = (pj(b'b) = btpjib’), so that (pj(b)/b — (pj(b')/b’. To see that qjl C R. note 

that if b e I is nonzero, then qjb = (< pj(b)/b)b — q>j(b) e R. By item (i) in the definition 
of projective basis, almost all <pj(b) — 0, and so there are only finitely many nonzero 
q! = (pj(b)/b (remember that q t does not depend on the choice of nonzero b e I). 
Item (ii) in the definition of projective basis gives, for b e /, 

b = '^2 ( Pj( b > a j = YMl b »l = *>(£**)■ 

j j j 

Canceling b gives 1 = Finally, the set of those aj with indices j for which 

qj f 0 completes the data necessary to show that I is an invertible ideal. 

(ii) This follows at once from (i) and Proposition 11.93. • 

Example 11.99. 

We have seen that R = Z[V~5 ] is a Dedekind ring that is not a PID. Any non-principal 
ideal gives an example of a projective /^-module that is not free. ■* 


Remark. A not necessarily commutative ring R is called left hereditary if every left ideal 
is a projective R -module (there exist rings that are left hereditary but not right hereditary). 
Some examples of left hereditary rings aside from Dedekind rings are semisimple rings, 
noncommutative principal ideal rings, and FIRs (free ideal rings —all left ideals are free R- 
modules). P. M. Cohn proved that polynomial rings over a field in noncommuting variables 
are FIRs, and so there exist left hereditary rings that are not left noetherian. ◄ 

The projective and injective modules over a Dedekind ring are well-behaved. 
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Lemma 11.100. A left R-module P (over any ring R) is projective if and only if every 
diagram below with E injective can be completed to a commutative diagram. The dual 
characterization of injective modules is also true. 

P 

A 

E - E" -0 

Proof. If P is projective, then the diagram can be completed for every not necessarily 
injective module E. Conversely, we must show that the diagram 


P 


A 


f 



can be completed for any module A and any surjection g: A -> A". By Theorem 8.104, 
there is an injective /?-module E and an injection a : A -> E. Define E" — coker oi — 
E / imcr/, and consider the commutative diagram with exact rows 


P 


0- A' 

C' 

V 

0-»- A' 



0 


E E" -*■ o, 


where v: E E" = coker oi is the natural map and h: A" -> E" exists by Propo¬ 
sition 8.93. By hypothesis, there exists a map jr: P E with vn — hf. We claim 
that im7r C imcr. For x e P, suijectivity of g gives a e A with ga = fx. Then 
vixx = hf x = hga — voa, and so rex — oa e ker v = imcr/; hence, txx — oa — oia' 
for some a’ e A and so ttx = a(a + ia') e imcr. Therefore, if x e P, there is a unique 
a e A with era — Ttx (a is unique because a is an injection). Thus, there is a well-defined 
function n ': P -> A, given by tx'x — a , where eta — txx. The reader may check that n' 
is an A 1 -map and that git' = f. • 


Theorem 11.101 (Cartan-Eilenberg). The following conditions are equivalent for a 
domain R. 

(i) R is a Dedekind ring. 

(ii) Every submodule of a projective R-module is projective. 

(iii) Every quotient of an injective R-module is injective. 
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Proof, (i) <£> (ii). 

If R is Dedekind, then we can adapt the proof of Theorem 9.8 (which proves that every 
subgroup of a free abelian group is free abelian) to prove that every submodule of a free 
/?-module is projective (see Exercise 11.47 on page 958); in particular, every submodule of 
a projective /?-module is projective. Conversely, since R itself is a projective /?-module, 
its submodules are also projective, by hypothesis; that is, the ideals of R are projective. 
Proposition 11.98 now shows that R is a Dedekind ring. 

(ii) (iii). 

Assume (iii), and consider the diagram with exact rows 


P <- P' -<-0 



where P is projective and E is injective; note that the hypothesis gives E" injective. To 
prove projectivity of P\ it suffices, by Lemma 11.100, to find a map P' —>• E making 
the diagram commute. Since E" is injective, there exists a map P -> E" giving com¬ 
mutativity. Since P is projective, there is a map P —>■ E also giving commutativity. The 
composite P' -> P -> E is the desired map. The converse is the dual of this, using the 
dual of Lemma 11.100. • 

Corollary 11.102. Let R be a Dedekind ring. 

(i) For all R-modules C and A, then for all n > 2, 

Ext'jj(C, A) = {0} and Torf (C, A) = {0}. 

(ii) Let 0 —> A' -> A -> A" -> 0 be a short exact sequence. For every module C, there 
are exact sequences 

0 -> Hom(C, A') -> Hom(C, A) -> Hom(C, A") 

-* Ext 1 (C, A') Ext 1 (C, A) Ext 1 (C, A") -> 0 

and 

0 -> Torf (C, A') -* Torf (C, A) -* Torf (C, A") 

—> C 0/^ A' —> C A —>• C (£)r A" —>- 0 

Proof, (i) By definition, if 

-> Pi —> P\ —> Po C -» 0 

is a projective resolution of C, then 

Ext"(C, A) = ker£/* +1 /imt/*; 
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moreover. Ext"(C, A) does not depend on the choice of projective resolution, by Corol¬ 
lary 10.74. Now C is a quotient of a free module F, and so there is an exact sequence 

0^K->F-UC-*0, (5) 

where K = kere. Since R is Dedekind, the submodule K of the free R -module F is 
projective, so that (5) defines a projective resolution of C with Pq — F, P\ = K, and 
P n — {0} for all n > 2. Hence, ker d* +1 c Horn( P„. A) — {0} for all n > 2, and so 
Ext" (C, A) = {0} for all A and for all n > 2. A similar argument works for Tor. 

(ii) This follows from Corollary 10.68 and Corollary 10.57, the long exact sequence for 
Ext and for Tor, respectively. • 

We will use this result in the next section to generalize Proposition 10.92. 

Exercises 

11.46 Let R be a commutative ring and let M be a finitely generated IGmodule. Prove that if IM = 
M for some ideal I of R, then there exists a e J with (1 — a)M = {0). 

Hint. If M = {m\ .... m n ), then each m, = ajjinj, where a\j e I. Use the adjoint 
matrix (the matrix of cofactors) as in the proof of Lemma 11.41. 

11.47 Generalize the proof of Theorem 9.8 to prove that if R is a left hereditary ring, then every 
submodule of a free left /^-module F is isomorphic to a direct sum of ideals, and hence is 
projective. 

11.48 Let R be a Dedekind ring, and let p be a nonzero prime ideal in R. 

(i) If a e p, prove that p occurs in the prime factorization of Ra. 

(ii) If a e p e and a p e+l . prove that p e occurs in the prime factorization of Ra, but that 
p < ’ +1 does not occur in the prime factorization of Ra. 

11.49 Let / be a nonzero ideal in a Dedekind ring R. Prove that if p is a prime ideal, then / C p if 
and only if p occurs in the prime factorization of I. 

11.50 If J is a fractional ideal of a Dedekind ring R, prove that (7 _1 )p = (7p) —1 for every maximal 
ideal p. 

11.51 Let be ideals in a Dedekind ring R. If there is no nonzero prime ideal p with 

Ii = p Li for all i for ideals L;, then 

11 T • • • T In = R- 

11.52 Give an example of a projective Z[V—5 ]-module that is not free. 

Hint. See Example 11.99. 

11.53 (i) A commutative ring R is called a principal ideal ring if every ideal in I? is a principal 

ideal (R would be a PID if it were a domain). For example, I n is a principal ideal ring. 
Prove that Z x Z is not a principal ideal ring. 

(ii) Let 1 1 ,...,/„ be pairwise coprime ideals in a commutative ring R. UR/Ij is a principal 
ideal ring for each i, prove that R/(I\ •••/») is a principal ideal ring. 

Hint. Use the Chinese remainder theorem. Exercise 6.11 (iii) on page 325. 
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11.54 Let a be a nonzero element in a Dedekind ring R. Prove that there are only finitely many 
ideals 1 in R containing a. 

Hint. If a e /, then Ra = IL for some ideal L C R. 


Finitely Generated Modules over Dedekind Rings 

We saw, in Chapter 9, that theorems about abelian groups generalize to theorems about 
modules over PIDs. We are now going to see that such theorems can be further generalized 
to modules over Dedekind rings. 

Proposition 11.103. Let R be a Dedekind ring. 

(i) If I Q R is a nonzero ideal, then every ideal in R/I is principal. 

(ii) Every fractional ideal J can be generated by two elements. More precisely, for any 
nonzero a e /, there exists h e J with J — Ra + Rb. 

Proof, (i) Let I = p j 1 • • • p e f be the prime factorization of 1. Since the ideals p £ ' are 
pairwise coprime, it suffices, by Exercise 11.53(h) on page 958, to prove that R/pf is a 
principal ideal ring for each i. Now right exactness of R Pl <S )r shows that {R/p e j ')p i = 
Rp, /(pf )p, • Since R Pi is a PID (it is even a DVR), any quotient ring of it is a principal 
ideal ring. 

(ii) Assume first that J is an integral ideal. Choose a nonzero a e J. By (i), the ideal 
J/Rci in R/Ra is principal; say, J/Ra is generated by b + Ra, where b e J. It follows 
that J = Ra + Rb. 

For the general case, there is a nonzero c e R with cJ c R (if J is generated by 
u\/v\,..., u m /v m , take c — ]”[/ v j)- Since cJ is an integral ideal, given any nonzero 
a e J, there is cb e cJ with cJ — Rea + Rcb. It follows that J — Ra + Rb. • 

The next corollary says that we can force nonzero ideals to be coprime. 

Corollary 11.104. If I and J are fractional ideals over a Dedekind ring R, then there 
are a, b e Frac(R) with 

al + bJ — R. 

Proof. Choose a nonzero a e I~ l . Now al C 7 -1 / = R, so that al C / -1 . By 
Proposition 11.103(h), there is b e / -l with 

J~ l = aIJ~ l + Rb. 

Since b e / , we have bJ C R, and so 

R = JJ~' = J(aIJ~ l + Rb) — al + RbJ = al + bJ. • 


Let us now investigate the structure of R-modules. 
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Lemma 11.105. 

(i) I/O —> M' —> M -> M" —> 0 is a short exact sequence of R-modules, then 

rank(M) = rank(M') + rank(Af"). 

(ii) An R-module is torsion if and only ifrwk(M) — 0. 

(iii) If M is a finitely generated torsion-free R-module with M {0}, then rank(M) = 1 
if and only if M is isomorphic to a nonzero ideal. 

Proof (i) By Corollary 8.103, the fraction field F is a flat /7-module. Therefore, 0 —» 
F M' —*■ F (&r M —*■ F (&r M" —»• 0 is a short exact sequence of vector spaces over 
F, and the result is a standard result of linear algebra (Exercise 3.74 on page 171). 

(ii) If M is torsion, then F ®r M — {0}, by an obvious generalization of Proposition 8.95 
(divisible <g> torsion = {0}). Hence, rank(M) = 0. Conversely, if rank(M) = 0, then 
F (&r M — {0}. By Proposition 11.25, if 5 = /? — {0} and Iim'- M —*■ S~ l M is the 
localization map, then kcr h m = {m e M : cnn = 0 for some a e R — {0}}. Thus, 
M = kerliM here, and so M is torsion. 

(iii) If M = I, where I is an ideal, then rank(M) = rank(7), and there is an exact sequence 
0 —> I —> F. Since F is a flat /7-module, the sequence 0 —> F I -> F F is exact. 
But F <S)r F = F is one-dimensional, so that rank!/) < 1. As I {0} (because fractional 
ideals are nonzero), we have rank(/) = 1. 

Conversely, assume that rank(M) = 1; that is, F <S) r M = F. Choose nonzero elements 
u,veM. If n, v are linearly independent, then (u , v) = (u) © ( v ). But exactness of 
0 —> {u) © (v) —> M gives exactness of 0 -» F <Sir {u) ® F <Sir {v) -> F ®r M (we 
have used the flatness of F once again). This is a contradiction, for a one-dimensional 
space has no two-dimensional subspaces. Choose a nonzero element x e M. It follows 
that if m e M , then there exist nonzero r.s e R with sm = rx. The reader may adapt the 
argument in the proof of Theorem 9.3 to see that the function M —> F, given by m r/s, 
is a well-defined (because M is torsion-free) isomorphism of M and a submodule S of F. 
As M is finitely generated, S is a fractional ideal. 

It remains to show that every fractional ideal J — (a | , a n /b n ) C F is isomor¬ 

phic to an integral ideal. If b — P| ( /?,, then bx e R for all x e J. for multiplication by b 
merely clears denominators. Hence, the map /—>/?, given by x bx, is an /7-map; it is 
injective because fields have no zero divisors. • 

Proposition 11.106. If R is a Dedekind ring and M is a finitely generated torsion-free 
R-module, then 

M = h © • • • © In, 

where /, is an ideal in R. 

Proof The proof is by induction on rank(M) > 0. If rank(M) = 0, then M is torsion, by 
Lemma 11.105(ii). Since M is torsion-free, M = {0}. Assume now that rank(M) — n + 1. 



Sec. 11.2 Dedekind Rings 


961 


Choose a nonzero m e M , so that rank(Rm) = 1. The sequence 

0 Rm M -A- M” -> 0 

is exact, where M" — R/Rm and v is the natural map. Note that rank(M") = n, by 
Lemma 11.105(i). Now M finitely generated implies M" is also finitely generated. If 
T — t{M") is the torsion submodule of M", then M"/T is a finitely generated torsion- 
free R-module with rank(M"/T) = rank(M") = n, because rank(T) = 0. By induction, 
M"/T is a direct sum of ideals, hence is projective. Define 

M' = v _1 (T) — [m e M : rm e Rm for some r ^ 0} C M. 

There is an exact sequence 0 —»■ M' —> M -> M"/T -> 0; this sequence splits because 
M"/T is projective; that is, M = M' © (M"/T). Hence 

rank(M') = rank(M) - rank(M"/r) = 1. 

Since R is noetherian, every submodule of a finitely generated /^-module is itself finitely 
generated; hence, M' is finitely generated. Therefore, M' is isomorphic to an ideal, by 
Lemma 11.105(h), and this completes the proof. • 

Corollary 11.107. If R is a Dedekind ring and M is a finitely generated torsion-free 
R-module, then M is projective. 

Proof. Recall that every ideal in a Dedekind ring is projective, by Proposition 11.98. It 
now follows from Proposition 11.106 that M is a direct sum of ideals, and hence it is 
projective. 

We can also prove this result using localization. For every maximal ideal m, the R m - 
module M m is finitely generated torsion-free. Since R m is a PID (even a DVR), however, 
M m is a free module, and hence it is projective. The result now follows from Corol¬ 
lary 11.39. • 

Corollary 11.108. If M is a finitely generated R-module, where R is a Dedekind ring, 
then the torsion submodule tM is a direct summand of M. 

Proof. The quotient module M/tM is a finitely generated torsion-free R-module, so that 
it is projective, by Corollary 11.107. Therefore, tM is a direct summand of M, by Corol¬ 
lary 7.55. • 

Corollary 11.109. If R is a Dedekind ring, then every torsion-free R-module A is flat. 

Proof. By Lemma 8.97, it suffices to prove that every finitely generated submodule of A 
is flat. But such submodules are torsion-free, hence projective, and projective modules are 
always flat, by Lemma 8.98. • 

It can be proved, over an arbitrary domain R, that every flat R-module is torsion-free 
(see Rotman, An Introduction to Homological Algebra, page 129). 

Using homological algebra, we generalize Corollary 11.108 by removing the hypothesis 
that tM be finitely generated. 
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Corollary 11.110. Let R be a Dedekind ring with F = Frac( R ). 

(i) If C is a torsion-free R-module and T is a torsion module with ann(r) {0}, then 
Extjj(C, T) = {0}. 

(ii) Let M be an R-module. //ann(fM) ^ {0}, where tM is the torsion submodule of 
M, then tM is a direct summand of M. 

Proof. We generalize the proof of Proposition 10.92. Since C is torsion-free, it is a flat 
/^-module, by Corollary 11.109, so that exactness of 0 —> R -» F gives exactness of 
0 —> R <S>« C -> F C. Thus, C = R C can be imbedded in a vector space 
V over F, namely, V — F <S)r C. Applying the contravariant functor Hom/A , T) to 
0-> C -> V V/C -> 0 gives an exact sequence 

ExtJjCV, T) -» Extjj( C, T) -* Ext 2 (V/C, T). 

Now the last term is {0}, by Corollary 11.102, and ExtJj( V, T) is (torsion-free) divisible, 
by (a straightforward generalization of) Example 10.70, so that Extjj(C, T) is divisible. 
Since ann(r) ^ {0}, Exercise 10.41 on page 852 gives ExtJj(C, T) — {0}. 

(i) To prove that the extension 0 —> tM —» M —> M/tM —> 0 splits, it suffices to prove 
that Extjj (M/tM, tM) — {0}. Since M/tM is torsion-free, this follows from part (i) and 
Corollary 10.90. • 

The next result generalizes Proposition 7.73. 

Proposition 11.111. The following statements are equivalent for a domain R. 

(i) R is a Dedekind ring. 

(ii) An R-module E is injective if and only if it is divisible. 

Proof, (i) =>• (ii). 

Let R be a Dedekind ring and let £ be a divisible £-module. By the Baer criterion. 
Theorem 7.68, it suffices to complete the diagram 

E 

A V 

f 

0-> I —^ R 

where I is an ideal and i: I —>• R is the inclusion. Of course, we may assume that I is 
nonzero, and so I is invertible: there are elements a\,... ,a n el and elements q\ .... q n e 
F with qil C R and 1 = qia,. Since E is divisible, there are elements e; e E with 
f (ai) — aje,. Note, for every b e I , that 

fQ>) = f(^2 diUjb) = J ~2(qib)f(ai ) = 'Y^(q i b)a i ei = b '^{q l ai)e i . 
i i i i 
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Hence, if we define e = ff, (q,a,)ei. then e e E and f(b) = be for all h e I. Defining 
g: R —> E by g(r) = re shows that the diagram can be completed, and so E is injec¬ 
tive. That every injective R -module is divisible was proved (for arbitrary domains R ) in 
Lemma 7.72. 

(ii) =>• (i). 

Let E be an injective R -module. If E' is a quotient of E, then E' is divisible and hence, 
by hypothesis, injective. Therefore, every quotient of an injective module is injective, and 
so R is a Dedekind ring, by Theorem 11.101. • 

Having examined torsion-free modules, let us now look at torsion modules. 

Proposition 11.112. Let p be a nonzero prime ideal in a Dedekind ring R. If M is an 
R-module with ann(M) = p e for some e > 0, then the localization map M —> M p is an 
isomorphism (and hence M may be regarded as an Rp-module). 

Proof. It suffices to prove that M = R p <S>r M. If m e M is nonzero and s e R with 
s p, then 

p e + R S = R, 

by Proposition 11.97. Hence, there exist u e p e and r e R with 1 — u + rs, and so 

m — um + rsm = rsm. 

If 1 = ii' + r's, where u' e p and r' e R , then s(r — r')m — 0, so that 

s(r — r') e ann(m) = p e . 

Since s f p'', it follows that r — f e p'' (the prime factorization of Rs does not contain 
p e ; if the prime factorization of Rir — r') does not contain p'', then neither does the prime 
factorization of Rs(r — r')). Hence, rm = r'm. Define s~ l m — rm. Define /: R p x M —>■ 
M by f(r/s , m) = s~ l rm , where s~ l rm has been defined in the preceding paragraph. It is 
straightforward to check that / is R -bilinear, and so there is an /(-map /: R v ® M —> M 
with /: ( r/s <g> m) — s~ l rm. In particular, /(I ® m) — m, so that / is surjective. On 
the other hand, the localization map Iim : M —>• R p <E)r M. defined by liM(m) — 1 ® m, is 
easily seen to be the inverse of/. • 

Definition. Let p be a nonzero prime ideal in a Dedekind ring R. An /(-module M is 
called p-primary if, for each m e M, there is e > 0 with ann(m) = p e . 


Theorem 11.113 (Primary Decomposition). Let R be a Dedekind ring, and let T be a 
finitely generated torsion R-module. If I — ann(T) = p £ j 1 • • • p e f, then 

t = r[pi]©---©r[p„], 


where 


T’ [p,- ] = {m e M : ann(m) is a power of p, }. 


T[p ; ] is called the pi-primary component ofT. 
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Proof. It is easy to see that the p/-primary components 7’|p, | are submodules of T. We 
now check the conditions in Proposition 7.19. If VV, is the submodule of T generated by 
all r[p ; ] with j i, we must show that r[p ; ] fl W, = {0}. Let x e r[p,] fl Wj. If 

j — n e i y.e n 

1 i — P i P/ > 

then p, and /,• are coprime: p, + /, = R. Hence, there are a, e p, and r, e /, with 
1 = a,- + r,-, and so x = cijX + rjX. But = 0, because x e r[p,], and r,x — 0, because 
x e Wj and /, = ann(W,). Therefore, x — 0. 

By Exercise 11.51 on page 958, we have 

II + ■ ■ ■ + I„ — R- 

Thus, there are bj e /, with b\ + •••-)- b n = 1. If t e T, then t = b[t + • • • + b n t. But if 
ci e p^', then c,T»/ e pf f — I = ann(T) and so Cj(bjt) = 0. Hence, pf C ann(/7,f), so 
that ann (bit) = pj" for some e > 0. Therefore, bit e T’[p/], and so 

T = T[ Pl ] +-h T[p n ]. 

The result now follows from Proposition 7.19. • 

Theorem 11.114. Let R be a Dedekind ring. 

(i) Two finitely generated torsion R-modules T and T' are isomorphic if and only if 
r[p/] = T'[pj]for all i. 

(ii) Every finitely generated p-primary R-module T is a direct sum of cyclic R-modules, 
and the number of summands of each type is an invariant ofT. 

Proof, (i) The result follows easily from the observation that if /: T —>■ T' is an isomor¬ 
phism, then ann(f) = ann(/(f)) for all t e T. 

(ii) The primary decomposition shows that T is the direct sum of its primary components 
r[p,-]. By Proposition 11.112, 7’[p,] is an /7 p .-module. But R P: is a PID (even a DVR), 
and so the basis theorem and the fundamental theorem hold: each r[p,] is a direct sum 
of cyclic modules, and the numbers and isomorphism types of the cyclic summands are 
uniquely determined. • 

We now know that every finitely generated R-module M is a direct sum of cyclic mod¬ 
ules and ideals. What uniqueness is there in such a decomposition? Since the torsion 
submodule is a fully invariant direct summand, we may focus on torsion-free modules. 

Recall Proposition 11.3: Two ideals J and J' in a domain R are isomorphic if and only 
if there is a e Frac(R) with J r = a J. 

Lemma 11.115. Let M be a finitely generated torsion-free R-module, where R is a 
Dedekind ring, so that M = Ii © • • • © /„, where the are ideals. Then 

M = R n ~ l © J , 


where J — I\ ■ ■ ■ /„ 
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Remark. We call R" 1 © J a Steinitz normal form for M. We will prove, in Theo¬ 
rem 11.117, that J is unique up to isomorphism. -4 

Proof. It suffices to prove that / © / = R © IJ , for the result then follows easily by 
induction onn > 2. By Corollary 11.104, there are nonzero a. h e Frac( R ) w ith a I+bJ = 
R. Since al = I and bJ = /, we may assume that I and J are coprime integral ideals. 
There is an exact sequence 

where S: x i-> (x,x) and a: (it, v) i-> u — v. Since I and J are coprime, however, 
we have I IT J = IJ and I + J — R. As R is projective, this sequence splits; that is, 
/©/ = /?© IJ. • 

The following cancellation lemma, while true for Dedekind rings, can be false for some 
other rings. In Example 7.78(iii), we described an example of Swan showing that if R = 
R[xi,..., x„]/(l-E; x f) ' s the real coordinate ring of the 3-sphere, then there is a finitely 
generated stably free R -module M that is not free. Hence, there are free /?-modules F and 
F' with M © F = F' © F but M ^ F'. 

Lemma 11.116. Let R be a Dedekind ring. If R © G = R © H, where G and H are 
R-modules, then G = H. 

Proof. We may assume there is a module E = ,4 © G = /!©//, where A = R = B. Let 
p: E -> B be the projection p : (b, h) h. and let p' — p\G. Now 

kerp' = G H H and im /c£ = fi. 

Thus, imp' = L, where L is an ideal in R. 

If imp' = {0}, then G C kerp = H. Since E = A © G, Corollary 7.18 gives 
H = G ® (H l T A). On the one hand, E/G — (A © G)/G = A = R\ on the other hand, 
E/G = (B © H)/G = B © (H/G) = R © (H/G). Thus, R = R@ ( H/G). Since R is 
a domain, this forces H/G = {0}: if R — X © Y. then X and Y are ideals; if x e X and 
y e Y are both nonzero, then xy e X IT Y — {0}, giving zero divisors in R. It follows that 
H/G = {0} and G — H. 

We may now assume that L — im p' is a nonzero ideal. The first isomorphism theorem 
gives G/(G n //) = L. Since R is a Dedekind ring, L is a projective module, and so 

G = / © (G n H), 

where I = L. Similarly, 

H = J © (G n H), 

where J is isomorphic to an ideal. Therefore, 

£ = AfflG = Affi/®(Gn //); 

E = B®H = B®J®(GHH). 
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It follows that 


A © 7 = E/(G D 77) = B@J. 


If we can prove that 7 = /, then 


G = I © (G n 77) = J © (G n 77) = 77. 


Therefore, we have reduced the theorem to the special case when G and 77 are nonzero 
ideals. 

We will prove that if or. R © I -> R ® J is an isomorphism, then 7 = 7. As in our 
discussion of generalized matrices on page 540, a determines a 2 x 2 matrix 


A = 


a 11 
021 


«12 

022 


where on: /?—>/?, 021 : R —> 7, 012 : /—>/?, and 022'- 7 —> 7. Indeed, as maps 
between ideals are just multiplications by elements of F — Frac(7?), we may regard A as 
a matrix in GL(2, F ). Now c /21 e 7 and a 22 1 C /, so that if t/ = det(A), then 


dl — (<311022 — <3l2<3l2)f ^ j■ 

Similarly, p = u ~ 1 determines a 2 x 2 matrix B = A -1 . 

c/ -1 / = det (B)J c 7, 

so that J c. dl. We conclude that / = <//, and so / = 7. • 

Let us sketch a proof using exterior algebra that if 7? is a Dedekind ring and 7 and / are 
fractional ideals, then R © 7 = R © / implies I = J. The fact that 2x2 determinants 
are used in the original proof suggests that second exterior powers may be useful. By 
Theorem 9.143, 

/ \\r e n = (r® /\\n) © (A*(*) A 1 ' 7 )) ® {A 2{R) 7 )- 

Now /\ 2 (R) = {0}, by Corollary 9.138, and /\ l (R) ®r /\ l (I) = R ®r I = I . We now 
show, for every maximal ideal m, that (A 2 (^)) m — {0}. By Exercise 11.24 on page 921, 

But 7? m is a PID, so that 7 m is a principal ideal, and hence /\ 2 (I m ) = {0}, by Corol¬ 
lary 9.138. It now follows from Proposition 11.3l(i) that /\ 2 (I) = {0}. Therefore, 
/\ 2 (R © 7) = 7. Similarly, /\ 2 (R © J) = /, and so 7 = /. 
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Theorem 11.117 (Steinitz). Let R be a Dedekind ring, and let M = I\ @ ■■ ■ /„ and 
M' = /{©••• If be finitely generated torsion-free R-modules. Then M = M' if and only 
if n — i and I\ •••/„ = /J • • • If 

Proof. Lemma 11.105(iii) shows that rank(/,) = 1 for all i, and Lemma 11.105(1) shows 
that rank(M) = n; similarly, rank(M') = l. Since M = M', we have F M = 
F M', so that rank(M) = rank(M') and n = i. By Lemma 11.115, it suffices to prove 
that if R" © 1 = R" © /, then / L 7. But this follows at once from repeated use of 
Lemma 11.116. • 

Let R be a commutative ring, and let C be a subcategory of ^Mod. Recall that two 
R-modules A and B are called stably isomorphic in C if there exists a module C e obj(C) 
with A © C = B © C. 

Corollary 11.118. Let R be a Dedekind ring, and let C be the category of all finitely 
generated torsion-free R-modules. Then M , M' e C are stably isomorphic in C if and only 
if they are isomorphic. 

Proof. Isomorphic modules are always stably isomorphic. To prove the converse, assume 
that there is a finitely generated torsion-free R-module X with 

M @X = M' @X. 

There are ideals I, J , L with M = R"~ l © I, M' = R' 1 ~ i © J , and X = R"'~ l © L, where 
n = rank(M) = rank (AT'). Hence, 

M © X = R n ~ l © / © R'"~ l © L = pn+m-l 0 IL 

Similarly, 

M' ®X = R"+ m ~ l © jl. 

By Theorem 11.117, IL = JL, and so there is a nonzero a e Frac(R) with alL = JL, 
by Lemma 11.3. Multiplying by L~ l gives al — J, and so I = J. Therefore, 

M = R n ~ l © I = R n ~ l @J = M'. • 

Recall that if a category C has finite products, then the Grothendieck group Kq(C) is the 
abelian group with generators (the isomorphism classes of) obj(C) and relations A © B = 
A + B for all A, B e obj(C); that is, Kq(C ) = IF(C)/1Z, where T(C) is the free abelian 
group with basis obj(C) and 1Z is the subgroup generated by all A © B — A — B. If [A] 
denotes the element A + 1Z in Kq(C), where A e obj(C), then [A] = [B] in Kq(C) if and 
only if they are stably isomorphic in C, by Proposition 7.77. 

Notation. If Pr( R ) is the category of all finitely generated projective R-modules over a 
commutative ring R, write 

K q (R) = tf 0 (Pr(R)). 

We end this section by displaying a relation between the class group C (R) of a Dedekind 
ring R and its Grothendieck group Kq(R). If I is a nonzero ideal in a Dedekind ring R, 
denote the corresponding element in C(R) by cls(7). 
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Theorem 11.119. If R is a Dedekind ring, then 

Kq{R) = C(R) © Z, 
where C(R) is the class group of R. 

Proof. If P is a finitely generated projective R -module, then P = R"~ l © I for a nonzero 
ideal I, by Lemma 11.115; moreover, the isomorphism class of I is uniquely determined 
by P, by Theorem 11.117. If P = R n ~ l © J, then there is a e Frac (R) with J — al, so 
that cls(7) = cls(7) in C(R). Therefore, the function ip\ Kq(R) —> C(R) © Z, given by 

<p([P]) = (cls(7), rank(T’)), 

is well-defined. Note that we are writing the first summand C{R) multiplicatively and the 
second summand Z additively. To see that ip is well-defined on Kq(R) = T(Vr{ R))/1Z, it 
suffices to prove that it preserves the relations in 1Z\ that is, 

<P([P © GD - <P([P]) ~ <P([Q]) = 0. 

Let Q — R m ~ 1 © J , where J is a nonzero ideal. Then P © Q = pn+m-\ 0 j j an( j 

<p([P © Q ]) = (els (IJ), n + m) 

= (ch(I)cls(J), n + m) 

— (cls(7), n) + (cls(/), m). 

Since rank(P © Q) — ranl<( P ) + rankf Q), it follows that q> is a well-defined homomor¬ 
phism. 

Now ip is surjective, for (cls(7), n) — ip([R n ~ l © 7]), and C(R) © Z is generated by 
all such elements. To see that ip is injective, recall that Proposition 7.77 says that a typical 
element of Kq(R) has the form [7*] — [Q], If ipi\P] — IQ]) = 0, then ^([T*]) = ip{[Q]). 
Hence, Proposition 7.77 says that P and Q are stably isomorphic. Corollary 11.118 says 
that P = Q, and so [7*] — [ Q] — 0. Therefore, ip is an isomorphism. • 

Remark. There is another Grothendieck group in this context. An /(-module A is called 
invertible if it is finitely generated and A <S)r Hom«(,4, R) = R. By Propositions 8.83 and 
9.97, the category of all invertible /(-modules under tensor product is a *-catcgory, and so 
it has a Grothendieck group, which is called the Picard group , denoted by Pic(7(). It turns 
out that every invertible module is isomorphic to an ideal. Thus, Pic(7() is the abelian group 
(written multiplicately) with generators all invertible ideals in R and relations / © r J — IJ. 
When R is a Dedekind ring, then Pic(7() = C (R). ◄ 


Exercises 


11.55 Let R be a Dedekind ring, and let 7 C R be a nonzero ideal. Prove that there exists an ideal 
J C R with 7 + J = R and IJ principal. 
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Hint. Let 1 = Pj • • • p ", and choose r, e p— p.' . Use the Chinese remainder theorem 

to find an element a e R with a e p e j and a £ pf' +1 , and consider the prime factorization of 
Ra. 

11.56 (i) If R is a commutative ring, prove that R n = R' n implies n = m. 

Hint. If m is a maximal ideal in R, then the (K/m)-vector spaces ( R/m) n and (R/m) m 
are isomorphic. 

(ii) If R is any commutative ring, prove that Z is a direct summand of Kq(R). 

11.57 If R is a PID, prove that Kq(R) = Z. 

11.58 If / is a fractional ideal in a Dedekind ring R, prove that / ® I~ l = R. 

Hint. Use invertibility of I . 

11.59 If R is a local ring, prove that Kq(R) = Z. 

11.60 If R is a commutative ring, prove that rank defines a surjective homomorphism Kq{R) -» Z. 
We usually call the kernel of this map the reduced Grothendieck group, and we denote it by 
Kq(R). Hence, 

K 0 (R) = K 0 (R)(BZ. 

11.61 If C is a subcategory of ^Mod, then we defined a variant of the Grothendieck group on 
page 492: K'(C) is the abelian group with generators obj(C) and relations B = A — C if 
there exists a (not necessarily split) exact sequence 0 —> A —> B —> C — >-0. 

(i) If R is a Dedekind ring, prove that restriction of the homomorphism e: Kq(R) —»■ K'(C) 
of Proposition 7.82 is an isomorphism Kq(R ) —>■ K'(C). 

(ii) If R is a Dedekind ring, prove that K'(C) = C(R). 


11.3 Global Dimension 


There are several types of rings whose finitely generated modules have been classified: 
semisimple rings; PIDs; Dedekind rings. Each of these rings can be characterized in terms 
of its projective modules: a ring R is semisimple if and only if every /^-module is projec¬ 
tive; a domain R is Dedekind if and only if every ideal is projective. The notion of global 
dimension allows us to classify arbitrary rings in this spirit. 

Rings in this section need not be commutative. 

Definition. Let R be a ring and let A be a left /(-module. If there is a finite projective 
resolution 

0 -> P n ->-► Pi -> Pq -> A -► 0, 

then we write pd(A) < n. If n >0 is the smallest integer such that pd(A ) < n, then we 
say that A has projective dimension n; if there is no finite projective resolution of A, then 
pd(A) — oo. 
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Example 11.120. 

(i) A module A is projective if and only if pd(A) = 0. We may thus regard pd(A ) as a 
measure of how far away A is from being projective. 

(ii) If R is a Dedekind ring, then pd(A ) < 1 for every R -module A. By Theorem 11.101, 
every submodule of a free P-module is projective. Hence, if F is a free R -module and 
s: F A is a surjection, then 

0 —> ker s — > F A -> 0 

is a projective resolution of A. This argument extends to left hereditary rings. < 


Definition. Let P. = • • • -> If —> Pi Pq —%• A —> 0 he a projective resolution 
of a module A. If n > 0, then the nth syzygy is 


£2„(A,P.) 


I ker s if n = 0 
ker d„ if n > 1. 


Proposition 11.121. For every n > 1, for all left R-modules A and B, and for every 
projective resolution P. of B, there is an isomorphism 

Ext'jj +1 (A, B ) = Ext)j(^„_i(A, P.), B). 

Proof The proof is by induction on n > 1. Exactness of the projective resolution 


P. 



d i 



0 


di 


gives exactness of 

-► P 2 ^ Pi n 0 (A,P.) -> 0, 

which is a projective resolution P+ of Q.{\(A, P.). In more detail, define 

p„ — Pn+ 1 and d n —d„+ 1. 

Since Ext 1 is independent of the choice of projective resolution of the first variable. 


ExtJ{(£2o(A,P.), B) = T 


kerW^)* kerf<^3 )* 


im(c/j )* imfcP)* 
The inductive step is proved in the same way, noting that 


= Exti(A, B). 


P ~ d n +2 p 
*n +2 ^ ^n +1 


Q„(A, P.) -* 0 




is a projective resolution of Q n (A , P.). 
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Corollary 11.122. For all left R-modules A and B, for all n > 0, and for any projective 
resolutions P. and P'. of A, there is an isomorphism 

ExtJftfVA, P.), B) = Extjj(£2„(A, P'j, B). 

Proof. By Proposition 11.121. both are isomorphic to Ext'^ +1 ( A, B ). • 

Two modules and Q' are called projectively equivalent if there exist projective mod¬ 
ules P and P' with £2 © P = Q' © P'. Exercise 11.62 on page 983 shows that any two nth 
syzygies of a module A are projectively equivalent. We often abuse notation and speak of 
the nth syzygy of a module, writing Q n (A) instead of £2„(A, P.). 

Syzygies help compute projective dimension. 


Lemma 11.123. The following conditions are equivalent for a left R-module A. 

(i) pd(A) < n. 

(ii) Ext^(A, B) = {0 } for all modules B and all k > n + 1. 

(iii) Ext'jj +1 (A, B ) = {0} for all modules B. 

(iv) for every projective resolution P. of A, the (n — l)sf syzygy £2„_i(A, P.) is projec¬ 
tive. 

(v) there exists a projective resolution P. of A with £2„_i(A, P.) projective. 


Proof, (i) =>• (ii). 

By hypothesis, there is a projective resolution P. of A with — {0} for all k > n + 1. 
Necessarily, all the maps dk : If -> Pk-\ are zero for k > n + 1, and so 


Ext^ (A, B) = 


ker(<4 + i)* 
im (d k )* 


= { 0 }. 


(ii) =£> (iii). Obvious. 


(iii) =© (iv). 

If P. = •••—»■ P„ —»■ P n - 1 -» •••—>■ Pi —> Po -> A —>• 0 is a projective resolution 
of A, then Ext'^ +1 (A, B) = Ext^(£2„_i(A, P.), B ), by Proposition 11.121. But the last 
group is {0}, by hypothesis, so that (A) is projective, by Corollary 10.86. 

(iv) =k (v). Obvious. 

(v) =» (i). 

If 

• • • -> P„ -> P„_ i t- • • • > Pi > Pq > A > 0 
is a projective resolution of A, then 

0 S2„_i(A) P„_ i -»- ^Pi-^P 0 ->A^O 

is an exact sequence. Since £2„_i(A) is projective, the last sequence is a projective resolu¬ 
tion of A, and so pd(A ) < n. • 
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Example 11.124. 

Let G be a finite cyclic group with | G \ > 1. If Z is viewed as a trivial ZG-module, then 
pd( Z) = oo, because Corollary 10.108 gives, for all odd n, 

H n (G, Z) = ExtJ G (Z, Z) / {0}. ◄ 

The following definition will soon be simplified. 

Definition. If R is a ring, then its left projective global dimension is defined by 
IpD(R) — sup{pc/(A) : A e obj(^Mod)}. 

Proposition 11.125. For any ring R, 

lpD(R)<n if and only if Ext '^ +1 (A, B) — {0} 
for all left R-modules A and B. 

Proof. This follows at once from the equivalence of fi) and (iii) in Lemma 11.123. • 

Example 11.126. 

(i) A ring R is semisimple if and only if IpD(R) = 0. Thus, global dimension is a measure 
of how far a ring is from being semisimple. 

(ii) A ring R is left hereditary if and only if /p I) ( R) < 1. In particular, a domain R is 

Dedekind if and only if pD{R) < 1. ◄ 

A similar discussion can be given using injective resolutions. 

Definition. Let R be a ring and let R he a left 1?-module. If there is an injective resolution 
0 -» B -» E° -» E l -» E" -> 0, 

then we write id{B ) < n. If n > 0 is the smallest integer such that id(B) < n , then we 
say that B has injective dimension n\ if there is no finite injective resolution of B , then 
id(B ) = oo. 

Example 11.127. 

(i) A module B is injective if and only if id(B ) = 0. We may thus regard id(B ) as a 
measure of how far away B is from being injective. 

(ii) The injective and projective dimensions of a module A can be distinct. For example, 
the abelian group A — Z has pd(A) = 0 and id {A) = 1. 

(iii) If R is a Dedekind ring, then Theorem 11.101 says that every quotient module of an 
injective /^-module is injective. Hence, if i): B -» £ is an imbedding of an /^-module B 
into an injective /^-module E, then 

0 -> B —E —»■ coker 77 —^ 0 
is an injective resolution of B. It follows that id(B) <1. < 
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Definition. Let E* = 0 -> B E° E l E 2 
of a module B. If n > 0 , then the nth cosyzygy is 


be an injective resolution 


13" (B, E*) 


I coker;; if n — 0 
coker d"~ l if n > 1 . 


Proposition 11.128. For every n > 1, for all left R-modules A and B, and for every 
injective resolution E* of A, there is an isomorphism 

Ext'^ + 1 (A, B) = ExtJj (A, 13"~\B, E*)). 

Proof. Dual to the proof of Proposition 11.121. • 


Corollary 11.129. For all left R-modules A and B, for all n > 0, and for any injective 
resolutions E* and E'* of B, there is an isomorphism 

Ext]j(A, 13"(B, E*)) = Ext)j (A, 13" (B, E'*)). 

Proof. Dual to the proof of Corollary 11.122. • 

Two modules 13 and 13' are called injectively equivalent if there exist injective modules 
E and E' with 13 @ E = 13' ® E'. Exercise 11.63 on page 983 shows that any two nth 
cosyzygies of a module B are injectively equivalent. We often abuse notation and speak of 
the nth cosyzygy of a module, writing l)" ( B) instead of 13" (B, E* ). 

Cosyzygies help compute injective dimension. 

Lemma 11.130. The following conditions are equivalent for a left R-module B. 

(i) id{B ) < n. 

(ii) Ext^(A, B) = {0 } for all modules A and all k > n + 1. 

(iii) Ext'jj + 1 (A, B) = {0} for all modules A. 

(iv) for every injective resolution E* ofB, the (n — l)st cosyzygy 13"~ 1 (B, E*) is injective. 

(v) there exists an injective resolution E* of B with 13"~ 1 (B, E*) injective. 

Proof. Dual to that of Lemma 11.123, using Exercise 10.49 on page 869 • 


Definition. If R is a ring, then its left injective global dimension is defined by 
liD(R) — sup{/c/(B) : B e obj(^Mod)}. 

Proposition 11.131. For any ring R, 

liD(R) < n if and only if Ext^ +1 (A, B) = {0} 
for all left R-modules A and B. 

Proof. This follows at once from the equivalence of fi) and (iii) in Lemma 11.130. • 
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Theorem 11.132. For every ring R, 

IpD(R) = HD(R). 

Proof. This follows at once from Propositions 11.125 and 11.131, for each number is 
equal to the smallest n for which Ext'^ +1 (A, B) = {0} for all left /^-modules A and B. • 

Definition. The left global dimension of a ring R is the common value of the left projec¬ 
tive global dimension and the left injective global dimension: 

!D(R) = IpD(R) = liD(R). 

If R is commutative, then we denote its global dimension by D(R) 

There is also a right global dimension rD(R) = ID{R op ) of a ring R. If R is com¬ 
mutative, then ID(R ) = rD(R) and we write D(R ). Since left semisimple rings are also 
right semisimple, by Corollary 8.57, we have / D(R) = 0 if and only if rD(R) — 0. On 
the other hand, there are examples of rings in which these two dimensions differ. 

We are now going to see that ID(R ) can be computed from cyclic left /^-modules. 

Lemma 11.133. A left R-module B is injective if and only ifExt l R (R/I. B) — {0 } for 
every left ideal I. 

Proof. If B is injective, then Ext^(A, B) vanishes for every right R -module A. Con¬ 
versely, suppose that ExtJj (R/I, B) = {0} for every left ideal I. Applying Horn/A , B) to 
the exact sequence 0-»/->/?—»•/?//—>0 gives exactness of 

Homjj(/?, B ) —> Hom^(7, B) —> ExtJj(f?/7, B ) = 0. 

That is, every A 1 -map f: I B can be extended to an A 1 -map R —»• B (see Proposi¬ 
tion 7.63). But this is precisely the Baer criterion. Theorem 7.68, and so B is injective. • 

The next result says that ID{R ) can be computed from pd(M) for finitely generated 
/^-modules M; in fact, 1 D{R) can even be computed from pd(M) for M cyclic. 

Theorem 11.134 (Auslander). For any ring R, 

ID(R) = sup{pd(R/I) : I is a left ideal}. 

Proof. ( Matlis ) If sup{pc/(J?//)} = oo, we are done. Therefore, we may assume there 
is an integer n > 0 with pd(R/I) < n for every left ideal I. By Lemma 11.130, 
Ext'jj +1 (/?//, B) — {0} for every left /^-module B. But IpD(R) — liD(R), by Theo¬ 
rem 11.132, so that it suffices to prove that id(B) < n for every B. Let E* be an in¬ 
jective resolution of B. with (n — l)st cosyzygy 13"~ 1 (B). By Corollary 11.128, {0} = 
Ext'jj +1 (/?//, B) = Ext 1 r (R/I, 15"~ 1 (B)). Now Lemma 11.133 gives U" -1 (B) injective, 
and so Lemma 11.130 gives id(B) < n, as desired. • 

This theorem explains why every ideal in a Dedekind ring being projective is such a 
strong condition. 

Just as Ext defines the global dimension of a ring R , we can use Tor to define the weak 
dimension (or Tor -dimension) of a ring R. 
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Definition. Let R he a ring and let A be a right .R-module. A flat resolution of A is an 
exact sequence 

-> F n -* -> Fi -> F 0 A -> 0 

in which each F„ is a flat right R-module. 

If there is a finite flat resolution 

0 -» F n -»-> Fi -» F 0 -* A -* 0, 

then we write fd(A) < n. If n > 0 is the smallest integer such that fd (A) < n, then we 
say that A has flat dimension n; if there is no finite flat resolution of A, then fd (A) = oo. 


Example 11.135. 

(i) A module A is flat if and only if fd(A) = 0. We may thus regard fd (A) as a measure 
of how far away A is from being flat. 

(ii) Since projective modules are flat, every projective resolution of A is a flat resolution. 
It follows that if R is any ring, then fd(A) < pd(A) for every R -module A. 

(iii) If R is a Dedekind ring and A is an R-module, then fd(A) < pd(A ) < 1, by (ii). 
Corollary 11.109 says that every torsion-free R -module is flat (the converse is true as well). 
Hence, fd(A) — 1 if and only if A is not torsion-free. < 

• • ^2 s 

Definition. Let F. = •••—> Fi —A F\ —> Fo —> A —> 0 be a flat resolution of a 
module A. If n > 0, then the nth yoke is 

Y„(A, F.) — L ere if "=° 

|kerj„ if)i > 1. 

The term yoke is not standard; it is a translation of the Greek ov^vyla (syzygy). 

Proposition 11.136. For every n > 1, for all right R-modules A and left R-modules B, 
and for every flat resolution F. of A, there is an isomorphism 

Tor* +1 (A, B) = Torf (A, Y n _\{B, F.)). 

Proof. Dual to the proof of Proposition 11.121. • 


Corollary 11.137. For every right R-module A and left R-module B, for all n > 0, and 
for any flat resolutions F. and F'. of B, there is an isomorphism 

Torf (A, Y n (B. F.)) = Torf (A, Y n (B, F'.)). 


Proof. Dual to the proof of Corollary 11.122. • 
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Lemma 11.138. The following conditions are equivalent for a right R-module A. 

(i) fd(A ) < n. 

(ii) Tor^(A, B) — {0 }for all k > n + 1 and all left R-modules B. 

(iii) Tor^ +1 (A, B) — {0 }for all left R-modules B. 

(iv) For every flat resolution F* of A, the (n — 1 )st yoke T„_i(A, F*) is flat. 

(v) There exists aflat resolution F* of A with flat (n — 1 )st yoke T„_i(A, F*). 

Proof. As the proof of Lemma 11.123. • 

Definition. The right weak dimension of a ring R is defined by 
rwD(R) = sup{/c/(A) : A e obj(Mod«)}. 

Proposition 11.139. For any ring R, rwD(R) < n if and only if Tor ; ^ +1 (A, B) — {0 }for 
every left R-module B. 

Proof. This follows at once from Lemma 11.138. • 

We define the flat dimension of left R-modules in the obvious way. 

Definition. The left weak dimension of a ring R is defined by 

IwD(R) — sup {fd(B) : B e obj(^Mod)}. 

Theorem 11.140. For any ring R, 

rwD(R) = IwD(R). 

Proof. If either dimension is finite, then the left or right weak dimension is the smallest 
n > 0 with Tor^ +1 (A, B) — {0} for all right R-modules A and all left R-modules B. • 

Definition. The weak dimension of a ring R , denoted by wD(R), is the common value 
of rwD(R) and IwD(R). 

As we have remarked earlier, there are (noncommutative) rings whose left global di¬ 
mension and right global dimension can be distinct. In contrast, weak dimension has no 
left/right distinction, because tensor and Tor involve both left and right modules simulta¬ 
neously. 

Example 11.141. 

A ring R has wD(R) = 0 if and only if every module is flat. These rings turn out to be von 
Neumann regular, for each a e R, there exists a' e R with a a'a = a. Examples of such 
rings are Boolean rings (rings R in which r 2 = r for all r e R ), and End^fV), where V is 
a (possibly infinite-dimensional) vector space over a field k. See Rotman, An Introduction 
to Homological Algebra , pages 119-120. ◄ 

The next proposition explains why weak dimension is so called. 
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Proposition 11.142. For any ring R, 

wD(R) < min {ID(R), rD(R)}. 

Proof. It suffices to prove that fd(A) < pd(A) for any right /^-module A. If pd(A) — 
oo, there is nothing to prove; if pd{A) < n, there is a projective resolution 

0 P n ->-> Pq A -> 0. 

Since every projective module is flat, this is a flat resolution showing that fd(A) < n. 
Hence, u>D(R ) < rD(R). A similar argument shows that wD{R) < ID(R). • 

Corollary 11.143. Suppose thatE\t" R (A, B) — {0} for all left R-modules A and B. Then 
Tor ( f (C, D) — {0 } for all right R-modules C and all left R-modules D. 

Proof. If Ext'jjfA, B) = {0} for all A. B, then ID(R) < n - 1; if Tor,?(C, D) {0} for 
some C, D , then n < wD{DR). This contradicts Proposition 11.142; 

ID(R ) < n — 1 < n < wD(R). • 

Lemma 11.144. A left R-module B is flat if and only if Tor['’( R/1. B ) = {0} for every 
right ideal I. 

Proof. Exactness of 0 -+ I — U- R -+ /?//—»• 0 gives exactness of 

0 = Torf (R, B) -> Torf (R/I, B) -> I ® R B R ® R B. 

Therefore, i ® 1 is an injection if and only if T’or( R/1. B ) = {0}. On the other hand, B is 
flat if and only if i ® 1 is an injection for every right ideal /, by Corollary 8.108. • 

As global dimension, weak dimension can be computed from cyclic modules. 

Corollary 11.145. For any ring R, 

wd(R) = sup{ fd(R/I) : I is a right ideal of R} 

— sup { fd(R/J) : J is a left ideal of R}. 

Proof. This proof is similar to the proof of Theorem 11.134, using Lemma 11.144 instead 
of Lemma 11.133. • 

Theorem 11.146. Let R be a left noetherian ring. 

(i) If A is a finitely generated left R-module, then 


pd(A) = fd{A). 
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(ii) 


wD(R) = 1D(R). 


In particular, if R is a commutative noetherian ring, then 


wD(R) = D(R). 


Proof, (i) It is always true that fd(A ) < pd(A), for every projective resolution is a 
flat resolution. For the reverse inequality, it is enough to prove that if fd(A) < n, then 
pd(A) < n. By Lemma 11.37, there is a projective resolution of A, 

- >P n -+ -> Po -> A -* 0, 

in which each P, is finitely generated. Now this is also a flat resolution, so that, by 
Lemma 11.138, fd(A) < n implies Y n — ker(P„_i -> P n - 2 ) is flat. But every finitely 
generated flat left R -module is projective, by Corollary 8.111 (because R is left noethe¬ 
rian), and so 

0 Y n ~i -+ P „-1 -> - - - P 0 A -> 0 
is a projective resolution. Therefore, pd(A ) < n. 

(ii) By Theorem 11.134, we have ID(R) is the supremum of projective dimensions of 
cyclic left /^-modules, and by Corollary 11.145, we have wd(R) is the supremum of flat 
dimensions of cyclic left R -modules. But part (i) gives fd(A ) — pd(A ) for every finitely 
generated right /^-module A, and this suffices to prove the result. • 

We are now going to compute the global dimension D(k[x 1 ,..., x„]) of a polynomial 
ring over a field (the result is called Hilbert’s theorem on syzygies). 

Lemma 11.147. IfO —> A' —> A —> A" —> 0 is a short exact sequence, then 
pd{A") < 1 + ma x{pd(A), pd(A')}. 

Proof We may assume the right side is finite, or there is nothing to prove; let pd (A) < n 
and pd ( A') < n. Applying Hom( , B), where B is any module, to the short exact sequence 
gives the long exact sequence 

-» Ext" +1 (A', B) Ext" +2 (A", B) -» Ext" +2 (A, B) . 

The two outside terms are {0}, by Lemma 11.123(h), so that exactness forces 
Ext' !+2 (A", B) — {0} for all B. The same lemma gives pd(A") < n + 1. • 

We wish to compare global dimension of R and R[x], and so we consider the R[x]~ 
module R[x] <S)r M = M{x] arising from an /^-module M. In Chapter 9, on page 684, we 
called this module M[;x\. 
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Definition. If M is an R -module over a commutative ring R. define 


M[x] = 

i> 0 

where M-, = M for all i. The /?-module M[x] is an /?[x]-module if we define 

xfcx'mi) — Tx l+1 mj. 

i i 

In Lemma 9.55, we proved that if V is a free /?-module over a commutative ring R , then 
V[x ] is a free /?[x]-module. The next result generalizes this from pd(V ) = 0 to higher 
dimensions. 

Lemma 11.148. For every R-module M, where R is a commutative ring, 

pd R (M) = pd R \ x \(M[x]). 

Proof. It suffices to prove that if one of the dimensions is finite and at most n, then so is 
the other. 

If pd(M) < n, then there is an R -projective resolution 

0 -* P„ -> Pq -* M 0. 

Since R[x] is a free R -module, it is a flat /^-module, and so there is an exact sequence of 
/?[x]-modules 

0 -» /?[x] P„ -» • • • -> /?[x] Po -> /?[x] M -> 0. 

But R[x] <S)r M = M\x\ and /?[x] <S) R P n is a projective /?[x]-module (for a projective is 
a direct summand of a free module). Therefore, pd R \ x \(M[x]) < n. 

If pd(M[x]) < n, then there is an /?[x]-projective resolution 

0 Q n ->-> Qo -> M[x\ -> 0. 

As an /?-module, M[x] = 5~ (J> i where M n = M for all n. By Exercise 11.69 on 
page 984, pd r(M\ x]) = pd r(M). Each projective /?[x]-module Q, is an /?[x]-summand 
of a free /?[x]-module f); a fortiori, Q, is an R -direct summand of F, . But /?[x] is a free 
/?-module, so that F, is also a free /^-module. Therefore, as an R -module, Q, is projective, 
and so pd R (M) < pd R [ x ](M[x]). • 

Corollary 11.149. If R is a commutative ring and D(R) — 00 , then Z)(/?[x]) = 00 . 

Proof. If D{R) — oo, then for every integer n, there exists an R -module M„ with n < 
pd(M n ). By the lemma, n < pd R \ x \(M n [x]), and so D(/?[x]) = oo. • 
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Proposition 11.150. For every commutative ring R, 

£>(//[*]) < D(R ) + 1. 

Proof. Recall the characteristic sequence. Theorem 9.56: If M is an //-module and 
T : M —> M is an //-map, then there is an exact sequence of //[x]-modules 

0 M[x] M[x] -> M t -» 0, 

where M 1 is the //[xj-module M with scalar multiplication given by ax'm = aT l (jn). If 
M is already an //[x]-module and T: M M is the //-map m i->- xm, then M r — M. 
By Lemma 11.147, 


pd R[x ](M) < 1 + pd R [ x ](M[x]) 

= 1 + pd R (M) 

< 1 + D(R). • 

We proceed to prove the reverse inequality. 

Definition. If M is an R-module, where R is a commutative ring, then an element c e R 
is regular on M (or is M-regular) if the //-map M —> M, given by m cm, is injective. 
Otherwise, c is a zero divisor on M; that is, there is some nonzero m e M with cm — 0. 

Before stating the next theorem, let us explain the notation. Suppose that R is a com¬ 
mutative ring, c e R, and R* = R/Rc. If M is an //-module, then M/cM is an ( R/Rc )- 
module; that is, M/cM is a /Z*-module. On the other hand, every /Z*-module A* can also 
be viewed as an //-module. If o : (R/Rc) x A* —> A* is the given scalar multiplication 
and if v: R -* R/Rc is the natural map, then a(v x 1 a*): R x A* -» A* is a scalar 
multiplication. In more down-to-earth language, if r* — r + Rc, then 

r*a = ra. 

We denote A* viewed in this way as an //-module by A'\ Exercise 11.72 on page 984 
asks you to prove that M* = ( R/cR ) M and A b = Horn r (R/cR, A*), so that these 
constructions involve an adjoint pair of functors. 

Proposition 11.151 (Rees Lemma). Let R be a commutative ring, let c e R be neither 
a unit nor a zero divisor, and let R* — R/Rc. If c is regular on an R-module M, then there 
are natural isomorphisms, for every R*-module A* and all n > 0, 

Ext'^(A*, M/cM) = Ext'^ +1 (A t> , M), 

where A’ is the R*-module A* viewed as an R-module. 
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Proof. Recall Theorem 10.45, the axioms characterizating Ext functors. Given a se¬ 
quence of contravariant functors G" : r* Mod -> Ab, for n > 0, such that: 

(i) for every short exact sequence 0 —> A* —> B* -> C* -> 0 of R*-modules, there is a 
long exact sequence with natural connecting homomorphisms 

-> G"(C*) -> G"(B*) -> G"(A*) -* G" +1 (C*) ->•••; 

(ii) G° and Hom«*( , L*) are naturally equivalent, for some /G-modulc L*\ 

(iii) G n (P *) = 0 for all projective /C-modules P* and all n > 1; 
then G" is naturally equivalent to Ext^*( , L*) for all n > 0. 

Define contravariant functors G" : s*Mod —>■ Ab by G n — Ext'^ +1 ( b , M). Thus, for 
all R*-modules A*, 

G n (A*) = Ext'jj +1 (A b , M). 

Since Axiom (i) holds for the functors Ext", it also holds for the functors G". Let us prove 
Axiom (ii). The map /x c : M —> M , defined by m m* cm , is an injection, because c is 
M- regular, and so the sequence 0 -> M -^4- M -> M/cM -> 0 is exact. Consider the 
portion of the long exact sequence, where A* is an /C-module: 

Horn*(A b , M) -> Hom*(A b , M/cM) Ext)/A b , M) ( -H* Extjj(A b , M). 

We claim that 3 is an isomorphism. If a e A b , then ca = 0, because A* is an R*-module 
(remember that R* — R/cR). Hence, if / e Homs(A b , M), then cf(a) = f (ca) — 
/(0) = 0. Since /j. c : M -> M is an injection, f {a ) = 0 for all a e A b . Thus, f — 0, 
ker3 = Hom^(A b ,M) = {0}, and 3 is an injection. The map (/x f )*: ExtJj(A b ,M) -» 
ExtJj(A b , M) is multiplication by c, by Example 10.60. On the other hand. Example 10.70 
shows that if \jl c : A b —>■ A b is multiplication by c, then the induced map on Ext is 

also multiplication by c. But n! c = 0, because A* is a (R/cR)-module, and so (p.' c )* = 0. 
Hence, (/r c )* = (p-' c )* — 0. Therefore, im3 = ker(/r c )* = Extjj(A b , M), and so 3 is a 
suijection. It follows that 

3: Hom«(A b , M/cM ) -> Ext)/A b , M) 

is an isomorphism, natural because it is the connecting homomorphism. By Exercise 11.70 
on page 984, there is a natural isomorphism 

Hom**(A*, M/cM) -> Homs(A b , M/cM). 


The composite 

Homs*(A*, M/cM) -* Hom s (A b , M/cM ) ExtJ/A b , M) = G°(A*) 
is a natural isomorphism; hence, its inverse defines a natural equivalence 

G° Horn r *(,M/cM). 
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Setting L* = M/cM completes the verification of Axiom (ii). 

It remains to verify Axiom (iii): G n (P*) = {0} whenever P* is a projective /?*-module 
and n > 1. In fact, since G n is an additive functor and since every projective is a summand 
of a free module, we may assume that P* is a free R*-module with basis, say, E. If 
Q — YleeE Re ' s ^e free R-module with basis E, then there is an exact sequence of 
/^-modules 


0 -> Q Q -> P* -> 0. (6) 

The first arrow is an injection because c is not a zero divisor in R\ the last arrow is a 
suijection because 

Q/cQ = (J2Re)/(J2 Rce ) 

eeE eeE 

= J2( R / R c)e = J2 R * e = P *- 

eeE 

The long exact sequence arising from (6) is 

-» Ext''(6, M) Ext'j+ 1 (P b , M) -> Ext'' +1 (e, M) 

Since Q is K-t'rcc and n > 1, the outside terms are {0}, and exactness gives G"(P*) = 
Ext" R +l (P b , M) = {0}. Therefore, 

Ext'^ +1 (A b , M) = G n (A*) = Ext^*(A*, M/cM). • 


Theorem 11.152. For every commutative ring k, 

D(k[x ]) = D(k) + 1. 

Proof. We have proved Z)(A:[x]) < D(k) + 1 in Proposition 11.150, and so it suffices to 
prove the reverse inequality. 

In the notation of the Rees lemma. Proposition 11.151, let us write R = k[x~\, c = x, 
and R* — k. Let A be a /c-module with pd(A ) = n. By Exercise 11.65 on page 984, there 
is a free fc-module F with Ext£(A, F) f {0}; of course, multiplication by x is an injection 
F -> F. As in the proof of the Rees lemma, there is a free fc[x]-module Q — A’[v] <S)k F 
with Q/xQ = F. The Rees lemma gives 

Ext"+|(A, Q) = Ext^(A, Q/xQ) = Ext n k (A, F) f {0} 

(A is viewed as a ^[x]-module via k[x~\ —> k). Therefore, pdk\ x \(A) > n + 1, and so 
D(k[xf) > n + 1 = D(k) +1. • 
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Corollary 11.153 (Hilbert’s Theorem on Syzygies). Ifk is afield, then 

D{k[x 1 , ..., Xn\) = n. 

Proof. Since D(k) = 0 and D(k[x]) = 1 for every field k, the result follows from 
Theorem 11.152 by induction on n > 0. • 

Hilbert’s theorem on syzygies implies that if R = k[x\,..., x n ], where A: is a field, then 
every finitely generated /^-module M has has a resolution 

0 -* P n -> Pn-1 ->-► Po -> M -» 0, 

where /( is free for all i < n and P„ is projective. We say that a (necessarily finitely gener¬ 
ated) /^-module M, over an arbitrary commutative ring R, has FFR (finite free resolution) 
if it has a resolution in which every Pj, including the last P n , is a finitely generated free 
module. Hilbert’s theorem on syzygies can be improved to the theorem that if A: is a field, 
then every finitely generated k[x i,..., x„]-module has FFR (see Kaplansky, Commutative 
Rings , page 134). (Of course, this result also follows from the more difficult Quillen- 
Suslin theorem, which says that every projective k[x\,..., x,,]-module, where A: is a field, 
is free.) 

Exercises 


11.62 (i) If A —> B —> C —^ D is an exact sequence, and if X is any module, prove that there 

is an exact sequence 

A ^ B®X f ^X X C®X^ D. 

(ii) Let 



be projective resolutions of a left ^-module A. For all n > 0, prove that there are 
projective modules Q„ and Q' n with 

0,.(A, P.) © Q n = Q n (A. P'.) © Q' n . 


Hint. Proceed by induction on n > 0, using Schanuel's lemma, Proposition 7.60. 


11.63 Let 


0 —>■ B 


and 


0 —>• B 


yO 


E' 


y2 


be injective resolutions of a left B-module B. For all n > 0, prove that there are injective 
modules l n and l' n with 

l3 n (B, E*) 0 I n = U n (B, E") © If 


Hint. The proof is dual to that of Exercise 11.62. 
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11.64 Show that there are flat resolutions 

0 Z Q Q/Z 0 

and 

0 K -* F -> Q/Z 0, 
where F is free abelian, but that Z © F Q © K. 

11.65 If A is an /{-module with pd(A) = n, prove that there exists a free R -module F with 
Ext" (A, F) / (()}. 

Hint. Every module is a quotient of a free module. 

11.66 If G is a finite cyclic group of order not 1. prove that 

ID(ZG) = oo = rD(ZG). 

Hint. Use Theorem 10.107. 

11.67 ( Auslcmder ) If R is both left noetherian and right noetherian. prove that 

ID(R) = rD(R). 

Hint. Use weak dimension. 

11.68 Prove that a noetherian von Neumann regular ring is semisimple. 

Hint. See Example 11.141. 

11.69 If { M a : a € A) is a family of left /{-modules, prove that 

pd( y. Mg) = sup{prf(M„)}. 

V «=A “ sA 

11.70 If <p : R —»• R* is a ring homomorphism and A* and B* are f?*-modules, prove that there is a 
natural isomorphism 

Hom^CA*, B*) -» Hom^(A b , B b ), 
where A b is A* viewed as an R -module. 

11.71 (i) If 0 —>- A' —> A —> A" —> 0 is an exact sequence, prove that 

pd(A ) < max{pd(A'), pd(A")j. 

(ii) If the sequence in part (i) is not split and if pd(A') = pd(A") + 1, prove that 

pd(A) = ma x[pd(A l ), pd(A") j. 

11.72 Let k be a commutative ring, let c e k, and let k* = k/ck. 

(i) If M is a ^-module, define M* = M/cM. Prove that M* = (k/ck) M. 

(ii) If A* is a k*-module, define A , to be A* viewed as a K -module, as on page 980. Prove 
that A b = Horn ^(k/ck. A*). Conclude that M M* and A* i-* A b form an adjoint 
pair of functors. 

11.73 Let (p : k —y k* be a ring homomorphism. 

(i) Prove that k* is a (k* , k)-bimodule. 

(ii) Prove that every fc*-module A* can be viewed as a fc-module, denoted by A b , and that 
A* i->- A 1 ’ gives an exact functor U : £*Mod -» ^Mod. 
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(iii) Prove that if F — Hom*(fc*, ): ^Mod —>• £*Mod, then (U, F) and (F, U) are adjoint 
pairs of functors. (These functors are called change of rings functors.) Conclude that 
U and F preserve all direct limits and all inverse limits. 

11.74 Let R be a commutative ring with FFR. Prove that every finitely generated projective R- 
module P has a free complement; that is, there is a finitely generated free F-module F such 
that P © F is a free F-module. 


11.4 Regular Local Rings 


We are now going to focus on (commutative) noetherian local rings, the main results being 
that such a ring has finite global dimension if and only if it is a regular local ring (regular 
local rings arise quite naturally in algebraic geometry), and that they are UFDs. Let us 
begin with a localization result. 


Proposition 11.154. Let R be a commutative noetherian ring. 
(i) If A is a finitely generated R-module, then 

pd(A ) = sup pd(A m ), 

m 

where m ranges over all the maximal ideals of R. 


(ii) 


D(R) = sup D(R m ), 

m 

where m ranges over all the maximal ideals of R. 


Proof, (i) We first prove that pd{A) > pd(A m ) for every maximal ideal m. If pd(A) = 
oo, there is nothing to prove, and so we may assume that pd(A) — n < oo. Thus, there is 
a projective resolution 

0 -> P„ ^ P n -1 ->-» P 0 -> A -> 0. 

Since R m is a flat F-module, by Theorem 11.28, 

0 —> R m <S> Pn ~ y Rm ® Pn—\ ^ ^ R m ® Po A m —> 0 

is a projective resolution of A m , and so pd(A m ) < n. (This implication does not need the 
hypotheses that R be noetherian or that A be finitely generated.) 

For the reverse inequality, it suffices to assume that sup m pd(A m ) — n < oo. Since 
R is noetherian. Theorem 11.146(i) says that pd(A) = fd(A). Now pd(A m ) < n if and 
only if Tor^’i M m , B m ) = {0} for all /( m -modules B m , by Lemma 11.138. However, 
Proposition 11.35 gives an isomorphism To/'™! L4 m , B m ) = (Tor ; ^ +I (A, B )) m . There¬ 
fore, - I’or^ + | (A, B) — {0}, by Proposition 11.3l(i). We conclude that n > pd(A). 

(ii) This follows at once from part (i), for D(R ) = sup A {pc/(A)}, where A ranges over all 
finitely generated (even cyclic) R -modules, by Theorem 11.134. • 
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We now set up notation that will be used in the rest of this section. 

Notation. We denote a commutative noetherian local ring by R , by (R. m), or by (R, m. k), 
where m is its unique maximal ideal and k is its residue field k — R /m. 

Theorem 11.134 allows us to compute global dimension as the supremum of projective 
dimensions of cyclic modules. When R is a local ring, there is a dramatic improvement; 
global dimension is determined by the projective dimension of one cyclic module: the 
residue field k. 


Lemma 11.155. Let (R. m) be a local ring with residue field k. If A is a finitely generated 
R-module, then 

pd(A) < n if and only if Tor^ +1 (A, k) = {0} 

Proof Assume pd(A ) < n. By Example 11.135(ii), we have fd(A) < pd(A), so that 
T°r^ +1 (A, B) — {0} for every /(-module B. In particular, Tor* + f A. k ) = {0}. 

We prove the converse by induction on n > 0. For the base step n — 0, we must prove 
that Tor^(A, k) = {0} implies pd{A) — 0; that is, A is projective (hence free, since R 
is local). Let {ai,... , a r ] be a minimal set of generators of A (that is, no proper subset 
generates A), let F be the free /(-module with basis {e\, ..., e r }, and let q >: F —> A be the 
/(-map with (pie,) = a,-. There is an exact sequence 

o->/v-Ae-^a-»o 

where N = ker <p and i is the inclusion; as in the proof of Proposition 11.23, 

N c m F. 

Since Tor^(A, k) = {0}, the sequence 

0->N® R k^lF® R k^lA® R k->0 

is exact. Tensor 0—► m —»• /? —> k Oby N; right exactness gives a natural isomorphism 

rjv: N ® R k —>■ N/mN; 

if n e N and b e k, then xn : n ® b i-^ n + m N. There is a commutative diagram 


0- N k " 8ll a - F ® R k 


XN 

v 


N/mN 


t f 

y 


F/xnF, 


where i: n + m N (->■ n + m F. Since i ® 1 is an injection, so is i. But N C m F says that 
the map i is the zero map. Therefore, N/mN = {0}, so that N = m/V. By Nakayama’s 
lemma. Corollary 8.32, N — {0}, and so q >: F -> A is an isomorphism; that is, A is free. 
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For the inductive step, we must prove that if Tor^ +2 (A, k ) = {0}, then pd(A) < n + 1. 
Take a projective resolution P. of A , and let Q„ (A . P.) be its nth syzygy. Since P. must 
also be a flat resolution of A, we have Y n (A , P.) = L!„(A, P.). By Proposition 11.136, 
Tor ; ^ +2 (A, k ) = Tor^(T„(A, P.), k). The base step shows that Y„(A , P.) = f2„(A, P.) is 
free, and this gives pd(A) < n + 1, by Lemma 11.123. • 


Corollary 11.156. Let (R. m) be a local ring with residue field k. If A is a finitely 
generated R-module, then 


pd{A) = sup{/' : Torf (A, k) ^ {0}}. 

Proof. Let n — sup{/ : Tor ( ft (A, k) {0}}. Then pd(A) < n — 1, but pd(A) f n\ that 
is, pd (A) =n. • 


Theorem 11.157. 

(i) 


(ii) 


Let R be a local ring with residue field k. 

D{R) < n if and only if Tor^ +1 (k, k) = {0}. 
D(R) = pd(k). 


Proof, (i) If D(R) < n, then Lemma 11.155 applies at once to give Tor „ +l (k, k ) = {0}. 
Conversely, if Tor^ +1 (k, k) — {0}, the same lemma gives pd(k ) < n. By Lemma 11.138, 
we have Tor^ +1 (A, k) — {0} for every /^-module A. In particular, if A is finitely gener¬ 
ated, then Lemma 11.155 gives pd(A) < n. Finally, Theorem 11.134 shows that I) ( R ) = 
sup^ipt/M)}, where A ranges over all finitely generated (even cyclic) R -modules. There¬ 
fore, D(R) < n. 

(ii) Immediate from part (i). • 


Definition. A prime chain of length n in a commutative ring R is a strictly decreasing 
chain of prime ideals 

Po 2 pl 2 ••• 2 pn- 

The height ht(p) of a prime ideal p is the length of the longest prime chain with p = po. 
Thus, ht(p) < oo. 


Example 11.158. 

(i) If p is a prime ideal, then ht(p) = 0 if and only if p is a minimal prime ideal. If R is a 
domain, then ht(p) = 0 if and only if p = {0}. 

(ii) If R is a Dedekind ring and p is a nonzero prime ideal in R, then ht(p) = 1. 
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(iii) Let k be a field and let R — k[X] be the polynomial ring in infinitely many variables 
X — {xi,X2, ■ ■ ■}■ If Pi = (xi,Xi+ 1 , ...). then p, is a prime ideal (R/pi = k[x i, ... , x,_i] 
is a domain) and, for every n > 1, 


P! D p 2 D ••• D p„ +1 

is a prime chain of length n. It follows that ht(pi ) = oo. < 


Definition. If R is a commutative ring, then its Kriill dimension is 
dim(/?) = sup{ht(p) : p e Spec(W)}; 

that is, dim(/?) is the length of a longest prime chain in R. 

If R is a Dedekind ring, then dim (A 1 ) = 1, for every nonzero prime is a maximal ideal; 
if R is a domain, then dim( R) — 0 if and only if R is a field. The next proposition 
characterizes the noetherian rings of Krull dimension 0. 

Proposition 11.159. If R Is a noetherian ring, then dim (A) = 0 if and only if every 
finitely generated R-module M has a composition series. 

Proof. Assume that R is noetherian with Krull dimension 0. Since R is noetherian. 
Corollary 6.120(iii) says that there are only finitely many minimal prime ideals. Since 
dim(A) = 0, every prime ideal is a minimal prime ideal (and a maximal ideal). We con¬ 
clude that R has only finitely many prime ideals, say, pi,....p„. Now nil(/?) = p|" =1 Pi 
is nilpotent, by Exercise 11.39 on page 938; say, (nil(/?)) m = {0}. Define 

N = Pi • • • p„ C Pi n • • • n p„ = nil(tf), 

so that 

IV m = (Pl---Pn) m = {0}. 

Let M be a finitely generated A 1 -module, and consider the chain 


M3p[M3 P 1 P 2 M 2 • •• 2 NM. 


The factor module pi • ■ • p,_iM/pi • • • p,M is an ( R /p,(-module; that is, it is a vector 
space over the field R/p, (for p,- is a maximal ideal). Since M is finitely generated, the 
factor module is finite-dimensional, and so the chain can be refined so that all the factor 
modules are simple. Finally, repeat this argument for the chains 

N j M 3 pi N J M 3 pip 2 ACM 3 ■ ■ ■ D N J+l M. 


Since N m = {0}, we have constructed a composition series for M. 

Conversely, if every finitely generated A 1 -module has a composition series, then the 
cyclic A’-module R has a composition series; say, of length l. It follows that any ascending 
chain of ideals has length at most l, and so R is noetherian. To prove that dim( R) — 0, 
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we must show that R does not contain any prime ideals p D q. Passing to the quotient ring 
R/ q, we may restate the hypotheses: R is a domain having a nonzero prime ideal as well 
as a composition series R O f 2 • • • 2 Id ¥= {0}- The last ideal I,/ is a minimal ideal; 
choose a nonzero element x e Id- Of course, x I d C I d \ since R is a domain, xlj ^ {0}, 
so that minimality of Id gives x/,/ = /,/. Hence, there is y e Id with xy — x\ that is, 
1 — y & Id, and so Id — R. We conclude that R is a field, contradicting its having a 
nonzero prime ideal. • 

We are going to prove a theorem of W. Krull, the principal ideal theorem, which implies 
that every prime ideal in a noetherian ring has finite height. Our proof is Kaplansky’s 
adaptation of a proof by D. Rees. We begin with a technical lemma. 

Lemma 11.160. Let a and b be nonzero elements in a domain R. If there exists c e R 
such that ca 2 e (b) implies ca e ( b ), then the series (a, b) 3 (a) 3 (a 1 ) and (a 2 , b ) 3 
(a 2 , ab) 3 (a 2 ) have isomorphic factor modules. 1 

Proof. Now (a, b)/(a) = (a 2 , ab)/(a 2 ), for multiplication by a sends (a, b) onto (a b , ab) 
and (a) onto (a 2 ). 

The module (a)/(a 2 ) is cyclic with annihilator (a); that is, (a)/{a 2 ) = R/(a). The 
module (a 2 , b)/(a 2 , ab) is also cyclic, for the generator a 2 lies in (a 2 , ab). Now A = 
ann ((a 2 , b)/(a 2 , ab)) contains (a), and so it suffices to prove that A = (a); that is, if 
cb — ua 2 + vab, then c e (a). This equation gives ua 2 e (b), and so the hypothesis 
give ua = rb for some r e R. Substituting, cb = rab + vab, and canceling b gives 
c — ra + va e (a). Therefore, (a)/(a 2 ) = (a 2 , b)/{a 2 , ab). • 

Recall that a prime ideal p is minimal over an ideal / if / c p and there is no prime 
ideal q with / c q C p. 

Theorem 11.161 (Principal Ideal Theorem). Let (a) be a proper ideal in a noetherian 
ring R, and let p be a prime ideal minimal over (a). Then ht(p) < 1. 

Proof. If, on the contrary, ht(p) > 2, then there is a prime chain 

P 2 Pi 2 P2- 

We normalize the problem in two ways. First, replace R by R /po; second, localize at p/p 2 - 
We now modify the hypotheses accordingly: R is a local domain whose maximal ideal m 
is minimal over a proper principal ideal (x), and there is a prime ideal q with 

m 2 q 2 (0). 

Choose a nonzero element be q, and define 

/, = ((b) : x 1 ') = {ceR : cx { e (b)}. 

7 Our notation for ideals is not consistent. The principal ideal generated by an element a € R is sometimes 
denoted by (a) and sometimes denoted by Ra. 
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There is an ascending chain I\ C / 2 C • • •, that must stop, because R is noetherian: say, 
/„ = I„ + [ = • • •. It follows that if c e h n , then c e that is, if ex 2 " e (b), then 
cx n e ( b ). If we set a = x ", then ca 2 e (b) implies ca e ( b ). 

If R* — R/(a 2 ), then dim(/?*) = 0, for it has exactly one prime ideal. By Propo¬ 
sition 11.159, the /^-module (a, b)/{a 2 ) (as every finitely generated /?*-module) has fi¬ 
nite length I (the length of its composition series). But Lemma 11.160 implies that both 
(i a,b ) and its submodule (a 2 , h) have length l. The Jordan-Holder theorem says this 
can happen only if (a 2 , b) — (a, b ), which forces a e (a , b ): there are s,teR with 
a — sa 2 + tb. Since sa e m, the element 1 — sa is a unit (for (R, m) is a local ring). 
Hence, — a(l — sa) — tb e (b) gives a e (b) C q. But a = x n gives x e q, contradicting 
m being a prime ideal minimal over (x ). • 

We now generalize the principal ideal theorem to finitely generated ideals. 

Theorem 11.162 (Generalized Principal Ideal Theorem). Let I — (at,..., a„) be a 

proper ideal in a noetherian ring R, and let p be a prime ideal minimal over I. Then 
ht(p) < n. 

Proof. The hypotheses still hold after localizing at p, so we may assume that R is a local 
ring with p as its maximal ideal. 

The proof is by induction on n > 1, the base step being the principal ideal theorem. Let 
I — (at,..., fl/i+t), and assume, by way of contradiction, that ht(p) > n + 1: there is a 
prime chain 

P = Po 2 pl 2 • •• 2 Pn+l■ 

We may assume there are no prime ideals strictly between p and p i, for the module p/p i has 
ACC. Now I £ pi, because p is a prime ideal minimal over I. Reindexing the generators 
of I if necessary, a\ </ pi. Hence, (a\. pi) D pi. We claim that p is the only prime ideal 
containing (ai, pi); there can be no prime ideal p' with (ai, pi) c p' C p (the second 
inclusion holds because p is the only maximal ideal in R) because there are no prime 
ideals strictly between p and pi. Therefore, in the ring R/(a \, pi), the image of p is the 
unique nonzero prime ideal. As such, it must be the nilradical, and hence it is nilpotent, 
by Exercise 11.39 on page 938. There is an integer m with p"' C (c/|, pi), and so there are 
equations 

a™ = r,ai + bj , r, e R, bj e pi, and i > 2. (7) 

Define J = (b 2 , ■. ■, b n+ \). Now J C pi, while ht(pi) > n. By induction, pi cannot be a 
prime ideal minimal over /, and so there exists a prime ideal q minimal over J: 

J c q C pi. 

Now af e (a\. q) for all i, by Eq. (7). Thus, any prime ideal p' containing (ci]. q) must 
contain all a hence all cij, and hence I. As p is the unique maximal ideal, / C p' C p, 
But p is a prime ideal minimal over /, and so p' = p. Therefore, p is the unique prime ideal 
containing (ai , q). If R* = R/q, then p* = p/q is a prime ideal minimal over the principal 
ideal (c/| + q). On the other hand, ht(p*) > 2, for p* D p* D {0} is a prime chain, where 
p* = pi/q. This contradiction to the principal ideal theorem completes the proof. • 
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Corollary 11.163. If R is a noetherian ring, then every prime ideal has finite height, and 
so Spec {R) has DCC. 

Proof. Every prime ideal p is finitely generated, because R is noetherian; say, p = 
(ai,... ,a n ). But p is a minimal prime ideal over itself, so that Theorem 11.162 gives 
ht(p) < n. • 

A noetherian ring may have infinite Krull dimension, for there may be no uniform bound 
on the length of prime chains. We will see that this cannot happen for local rings. 

The generalized principal ideal theorem bounds the height of a prime ideal that is min¬ 
imal over an ideal; the next result bounds the height of a prime ideal that merely contains 
an ideal. 

Corollary 11.164. Let R he a noetherian ring, let I — (a i ,... ,a n ) be an ideal in R, and 
let p be a prime ideal in R containing I. //ht(p//) denotes the height of p/I in R/I, then 

ht(p) <n+ ht(p/7). 

Proof. The proof is by induction on h = ht(p//) > 0. If h = 0, then Exercise 11.76 
on page 1011 says that p is minimal over I, and so the base step is the generalized prin¬ 
cipal ideal theorem. For the inductive step h > 0, p is not minimal over I. By Corol¬ 
lary 6.120(iii), there are only finitely many minimal primes in R/I, and so Exercise 11.76 
says that there are only finitely many prime ideals minimal over I; say, q i, ..., q 9 . Since p 
is not minimal over I, p £ q? for any i ; hence. Proposition 6.14 says that p £ qi U • • • U q.,, 
and so there is y € p with y q, for any i. Define J = (I, y). 

We now show, in R/ J, that ht(p/7) < h — 1. Let 

p/J D pi /J 2 ■■■ 2 Pr/J 

be a prime chain in R/J. Since I C. J, there is a surjective ring map R/I -» R/ J. The 
prime chain lifts to a prime chain in R/I: 

p/I 2 Pl/I 2 ••• 2Pr/I- 

Now p r 3 / D /, and J — (I, y) does not contain any q,. But the ideals q,// are the 
minimal prime ideals in R/I, by Exercise 11.76, so that p,- is not a minimal prime ideal 
in R. Therefore, there is a prime chain starting at p of length r + 1. We conclude that 
r + 1 < h, and so ht(p/7) < h — 1. 

Since J = (I,y ) = (a\, ..., a n . y) is generated by n + 1 elements, the inductive 
hypothesis gives 

ht(p) < n + 1 + ht(p/./) 

= (n + 1) + (h — 1 ) — n + h — n + ht(p//) - • 

When we say, in the next proposition, that a generating set X of an ideal I is minimal, 
we mean that no proper subset of X generates I. 

If (R, m, k) is a local ring, then m/m 2 is an (7?/m)-module; that is, it is a vector space 
over k. 



992 


Commutative Rings III Ch. 11 


Proposition 11.165. Let (R , m, k ) be a noetherian local ring. 

(i) Elements x\,... ,Xd form a minimal generating set for m if and only if the cosets 
x* — Xi + tn 2 form a basis of m/m 2 . 

(ii) Any two minimal generating sets of m have the same number of elements. 


Proof, (i) If xi,..., x ( i is a minimal generating set for m, then X* = x*,... ,x*j spans 
the vector space m/m 2 . If X* is linearly dependent, then there is some = Ew.'S'J. 
where r', e k. Lifting this equation to m, we have x; e Jf^/rjXj + m 2 . Thus, if 
B — (xj : j i), then B + m 2 = m. Hence, 

m (m/B) — (B + m 2 )/B = m/5. 

By Nakayama’s lemma, m/B = {0}, and so m = B. This contradicts x\,..., xj being a 
minimal generating set. Therefore, X* is linearly independent, and hence it is a basis of 
m/m 2 . 

Conversely, assume that x *,..., x*j is a basis of m/m 2 , where xf = x; + m 2 . If we 
define A = (xi,..., xf), then A c m. If y e m, then y* = r'x*, where r'. e k, so that 

y e A + rrr. Hence, m = A + m 2 , and, as in the previous paragraph, Nakayama’s lemma 
gives m = A; that is, x \,..., Xd generate m. If a proper subset of xi,..., Xd generates m, 
then the vector space m/m 2 could be generated by fewer than d elements, contradicting 
dirm-lm/m 2 ) = d. 

(ii) The number of elements in any minimal generating set is dim^(m/m 2 ). • 

Definition. If ( R , m, k ) is a noetherian local ring, then m/m 2 is a finite-dimensional 
vector space over k. Write 

/x(m) = dimfc(m/m 2 ). 

Proposition 11.165 shows that all minimal generating sets of m have the same number 
of elements, namely, //(m). 

Corollary 11.166. If (R, m) is a noetherian local ring, then ht(m) < /i(m), and 

dim(5) < |U,(m). 

Proof. If ijl( m) = d, then m = (xi,..., Xd). Since m is obviously a minimal prime over 
itself. Theorem 11.162, the generalized principal ideal theorem, gives ht(m) < d = // (m). 

If p ^ m is a prime ideal in R , then any prime chain, p = po I) Pi D ■ ■ ■ D p/,, can 
be lengthened by to a prime chain m 2 Po 2 Pi 2 ’ ’ ’ 2 P h of length h + 1. Therefore, 
h < /r(m), and so dim(f?) = ht(m) < /x(m). • 
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Definition. A regular local ring is a noetherian local ring ( R , m) such that 

dim(R) = /r(m). 

It is clear that every field is a regular local ring of dimension 0, and every DVR is a 
regular local ring of dimension 1. It is not clear from the definition whether there are any 
other examples. The coming notion of regular sequence will enable us to better understand 
regular local rings. Recall that if M is an R-module, then an element c e R is called regular 
on M if the map M —> M, given by m cm, is an injection; that is, cm = 0 implies 
m — 0 . 

Definition. A sequence xi,..., x n in a commutative ring R is an M-regular sequence 
if x\ is regular on M, X 2 is regular on M/{x\)M, X 3 is regular on M/(x 1 , X 2 )M ,..., x n is 
regular on M/(x 1 , • • • , x n -\)M. If M — R, then x \,..., x n is also called an R-sequence. 

For example, if R = k[x \,..., x n ] is a polynomial ring over a field k, then it is easy to 
see that xi,..., x n is an R-sequence. 

Exercise 11.75 on page 1011 gives an example of a permutation of an R-sequence that 
is not an R-sequence. However, if R is local, then every permutation of an R-sequence is 
also an R-sequence (see Bruns-Herzog, Cohen-Macaulay Rings, page 5). 

The generalized principal ideal theorem gives an upper bound on the height of a prime 
ideal; the next lemma gives a lower bound. 

Lemma 11.167. Let R be a commutative ring. 

(i) If x e R is not a zero divisor, then x lies in no minimal prime ideal. 

(ii) Ifp is a prime ideal in R and rep is not a zero divisor, then 

ht(p/(x))+l <ht(p). 

(iii) If a prime ideal p in R contains an R-sequence xi,..., Xd, then 

d < ht(p). 

Proof, (i) Suppose, on the contrary, that p is a nonzero minimal prime ideal containing 
x. Now R p is a ring with only one nonzero prime ideal, namely, p p , which must be the 
nilradical. Thus, x/1, as every element in p p , is nilpotent. If x m /l = 0 in R p , then there is 
0 <j p (so that o f 0) with ax = 0, contradicting x not being a zero divisor. 

(ii) If h — ht(p/(x)), then there is a prime chain in R/(x ): 

P/CO 2 Pi/CO 2 • • • 2 Ph/(x). 

Lifting back to R, there is a prime chain p D pi D • • • D p;, with p/ ; 3 (x). Since x is not 
a zero divisor, part (i) says that p/ ; is not a minimal prime. Therefore, there exists a prime 
ideal p/)+i properly contained in pwhich shows that ht(p) > h + 1. 
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(iii) The proof is by induction on d > 1. For the base step d — 1, suppose, on the contrary, 
that ht(p) = 0; then p is a minimal prime ideal, and this contradicts part (i). For the 
inductive step, part (ii) gives ht(p/(xi)) + 1 < ht(p). Now p/(xi) contains an (R/(pc re¬ 
sequence X 2 + (xi),..., xj + (xi), by Exercise 11.79(h) on page 1012, so that the inductive 
hypothesis gives d — 1 < ht(p/(xi)). Therefore, part (ii) gives d < ht(p). • 


Proposition 11.168. Let (R. m) be a noetherian local ring. If tu can be generated by an 
R-sequence xi,..., Xd, then R is a regular local ring and 

d — dim(R) = m). 

Remark. We will soon prove the converse: In a regular local ring, the maximal ideal can 
be generated by an /^-sequence. 4 

Proof. Consider the inequalities 

d < ht(m) < m) < d. 

The first inequality holds by Lemma 11.167; the second by Corollary 11.166, and the third 
by Proposition 11.165. It follows that all the inequalities are, in fact, equalities, and the 
proposition follows because dim(R) = ht(m). • 

Example 11.169. 

Let k be a field, and let R — k [ [x i, ..., x,- ] ] he the ring of formal power series in r variables 
xi, .... x r . Recall that an element f & R is a sequence 


/ = (fo,f\,fl. 


where /,, is a homogeneous polynomial of total degree n in k[x \,..., x r ], and that multi¬ 
plication is defined by 


(ft), ft, f 2 , ■ ■ -)(go, gi,g 2 , ■■■)= (ho, h\,hi, ■ ..), 

where h„ — 12i+j— n figj- We claim that R is a local ring with maximal ideal m = 
(xi,..., Xr) and residue field k. First, R/xn = k, so that m is a maximal ideal. To see 
that m is the unique maximal ideal, it suffices to prove that if / e R and / f m, then 
/ is a unit. Now / ^ m if and only if /o 0, and we now show that / is a unit if and 
only if /o 0. If fg — 1, then fogo — 1, and /o ^ 0; conversely, if /o ^ 0, we can 
solve (/ 0 , /i, h,.. .)(g 0 , gi,g2, ■■■) = 1 recursively for g n , and fg = 1 if we define 
g — (go. gi, g2, ■ ■ •)• 

Exercise 11.83 on page 1012 shows that the ring R is noetherian. But R/(x\, ..., x,_i) 
is a domain, because it is isomorphic to £[[x,- ,.... x r ]], and so x,- is a regular element on 
it. Hence, Proposition 11.168 shows that R = k[[x i,..., x r ]] is a regular local ring, for 
Xi,..., x r is an R-sequence. ◄ 

The next lemmas prepare us for induction. 
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Lemma 11.170. Let (R, m, k) be a noetherian local ring, and let x & m — m 2 . 

(i) If x i + (x),..., x, + (x) is a minimal generating set of m/xm, then x, xj , ...,x s is 
a minimal generating set of m. 

(ii) 

/i(m/xm) = /Am) — 1. 

Proof, (i) Write R — R/(x), m = m/xm, and r — r + (x) for all r e R. To see that 
x, x\,... ,x s generate m, let y e m. Then y = JA r-Xj, where r' e k. Lifting to R gives 
y — r/Xi e (x), where r'. = r\ + m. Therefore, there is r e R with y = rx + lyxi. 

To prove minimality. Proposition 11.165 says that it suffices to show that the cosets 
x* = x + m 2 , x* = Xj + m 2 form a basis of m/m 2 . These elements span m/m 2 because 
x, xi, .... x s generate m. To prove linear independence, assume that a'x* + fx* — 0, 
where a' , al e k. Lifting to R, we have 

ax + ^2 aiXj e m 2 , (8) 

and we must show that a , a; e m (for then a', a' = 0 in k = R/m). In R = R/(x), this 
relation becomes 

^2 djXj e m 2 . 

i 

As xi,..., x s is a basis of m/m 2 , we have al — 0 for all i ; that is, at e m for all i. It 
follows from Eq. (8) that ax e m 2 . Since x </ m 2 , it follows that a e m, as desired. 

(ii) This follows at once from part (i). • 


Lemma 11.171. Let (R , m) be a regular local ring. If x e m — m 2 , then R / (x) is regular 
and dim(f?/(x)) = dim(/?) — 1. 

Proof. Since R is regular, we have dim (A 1 ) = /Am). Let us note at the outset that 
dim(/?) = ht(m). We must show that ht(m*) = /x(m*), where m* = m/(x). By Corol¬ 
lary 11.164, ht(m) < ht(m*) + 1. Hence, 

ht(m) - 1 < ht(m*) 

< dim*) 

— /i(m) — 1 
= ht(m) - 1. 

The next to last equation is Lemma 11.170; the last equation holds because R is reg¬ 
ular. Therefore, dim(/C) = ht(m*) = /Am*), and so R* — R /(x) is regular with 
dim(f?/(x)) = dim(/?) — 1. • 

We are now going to prove that regular local rings are domains, and we will then use 
this to prove the converse of Proposition 11.168. 
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Proposition 11.172. Every regular local ring (R. m) is a domain. 

Proof. The proof is by induction on d — dim( R ). If d = 0, then R is a field, by Exer¬ 
cise 11.78 on page 1012. If d ^ 0, let p i ,..., be the minimal prime ideals in R (there 
are only finitely many such, by Corollary 6.120). If m — m 2 C pj U • • • U p s , then Propo¬ 
sition 6.14 would give m c p ; , which cannot occur because d — ht(m) > 0. Therefore, 
there is x e m — m 2 with x f. p, for all i. By Lemma 11.171, R/(x) is regular of dimen¬ 
sion d — 1. The inductive hypothesis gives R/(x) a domain, and so (x) is a prime ideal. It 
follows that (x) contains a minimal prime ideal; say, p, C (x ). 

If p,- = {0}, then {0} is a prime ideal and R is a domain. Hence, we may assume that 
p,- yL {0}. For each nonzero y e p/, there exists r e R with y = rx. Since x f p,. we 
have r e p/, so that y e xp;. Thus, p; C xp; C mp;. As the reverse inclusion mp, C p,- 
is always true, we have p; = mp/. Nakayama’s lemma now applies, giving p/ = {0}, a 
contradiction. • 

Proposition 11.173. A noetherian local ring ( R. tu, k) is regular if and only if m is 
generated by an R-sequence x\, ..., x</. Moreover, in this case, 

d — ii( m). 

Proof. We have already proven sufficiency, in Proposition 11.168. If R is regular, we 
prove the result by induction on d > 1, where d — dim(l?). The base step holds, for R is a 
domain and so x is a regular element; that is, x is not a zero divisor. For the inductive step, 
the ring R/(x) is regular of dimension d — 1, by Lemma 11.171. Therefore, its maximal 
ideal is generated by an (/?/(x))-sequence x* , ..., x5 j. By Lemma 11.170, a minimal 
generating set for m is x, xi, ..., x</_i. Finally, this is an R -sequence, by Exercise 11.79(i) 
on page 1012, because x is not a zero divisor. • 

We are now going to characterize regular local rings by their global dimension. 

Lemma 11.174. Let (R. m, k) be a local ring. If A is an R-module with pd(A) — n and 
if x € m is A-regular, then pd(A/xA) = n + 1. 

Proof. Since x is ,4-regular, there is an exact sequence 

0-> A A -> A/xA -> 0, 

where ji x : a i->- xa. By Lemma 11.147, we have pd(A/xA) < n + 1. 

Consider the portion of the long exact sequence arising from tensoring by k: 

0 = Tor,? +1 (A, k) -> Tor,f +1 (A/xA, k) -U Tor,f (A, k) Tor,f (A, k). 

Now pd{A) < n if and only if Tor^ +1 (A, k ) = {0}, by Lemma 11.155, and so the first 
term is {0}. The induced map (n_ x )* is multiplication by x. However, if p! x \ k —> k is 
multiplication by x, thenx e m implies /f x = 0; therefore, (/x v )* = (li[ x )* = 0. Exactness 
now gives 3: Tor^ +1 (A/xA, k) —>• Tor ; f(A,k) an isomorphism. Since pd(A) — n, we 
have Tor ; f (A, k) ^ {0}, so that Tor^ +1 (A/xA, k) {0}. Therefore, pd(A/xA) > n + 1, 
as desired. • 
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Proposition 11.175. If {R, m, k) is a regular local ring, then 

D(R ) = /i,(m) = dim(/?). 

Proof. Since R is regular. Proposition 11.173 says that m can be generated by an R- 
sequence x\,.. x ( j. Applying Lemma 11.174 to the modules R, R/(x i), R / (x\ , xf), ■ .., 
R/(xi,, Xd) = R/ m = k, we see that pd(k) = d. By Proposition 11.168, d — /r(m) = 
dim(/?). On the other hand. Theorem 11.157(h) gives d — pdik ) = D(R). • 

The converse of Proposition 11.175, A noetherian local ring of finite global dimension 
is regular, is more difficult to prove. 

Lemma 11.176. Let (R , m, k) be a noetherian local ring of finite global dimension. If 
/r(m) < D(R) and D(R) < d, where d is the length of a longest R-sequence in m, then R 
is a regular local ring. 

Proof. By Corollary 11.166, dim(/?) < /r(m). By hypothesis, /i(m) < D(R) < d, while 
Lemma 11.167 gives d < ht(m) = dim (71). Therefore, dim( R) = /x(m), and so R is a 
regular local ring. • 

Let R be a noetherian ring, let M be a finitely generated R -module, and let I be an ideal 
such that IM M. By Exercise 11.82 on page 1012, I contains a longest M-sequence 
(such sequences are usually called maximal M-sequences in I). We are going to prove, 
given an ideal I and a finitely generated /^-module M, that all maximal M-sequences in I 
have the same length. 

Definition. If R is a commutative ring, then an associated prime ideal of a nonzero 
/^-module II is a prime ideal of the form ann(/?) for some nonzero b e B. 

Lemma 11.177. Let B be a nonzero finitely generated module over a noetherian ring R. 

(i) The maximal elements in T(B) = {ann(7>) : b e B and b 0} are associated prime 
ideals of B. 

(ii) There are finitely many associated prime ideals of B, say, pi, ..., p s , such that 


Z(B) = pi U - •• Up s , 

where Z(B) = {r e R : rh = 0 for some nonzero b e 11 }. 

Proof, (i) The set of ideals T(B) has maximal elements, because R is noetherian. Let 
ann(Z?) be such a maximal element. Suppose that rs e ann(fo), where r, s e R and r f 
ann(b). Now ann (b) C ann(r7>), for if ub — 0, then u(rb) = 0; by maximality, ann (b) = 
ann (rb). Hence, s e ann (rb) implies s € ann(Z?), and so ann(7?) is a prime ideal. 

(ii) For each r e Z(B), there is a nonzero b e B with rb — 0; that is, Z(B) = 
Uann (b)eT(B) ann (^)- If we denote the set of maximal elements in !F(B) by 971, then 
Z(B) = Upewt P’ f° r every ann(7?) e TIB ) is contained in a maximal element. 
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It suffices to prove that 971 is finite. Define B' — (b : ann(Z?) e 971). Now B’ is finitely 
generated, for R noetherian implies that every submodule of a finitely generated R -module 
is itself finitely generated; let B' = (b\,..., b n ) , and denote ann(/?,•) by p,-. Suppose there 
is q = ann(foo) e 9ft with bo bj for i = 1As bo € B r , there are r,- e R with 
bo — J2i r ibi■ It follows that if r e P|, p,, then rbo = 0; that is, P|, p/ c ann(7?o) = q. 
Since q is a prime ideal. Proposition 6.13 gives p, C q for some i. As p, is a maximal 
element in !F(B), we have q = p,, as desired. • 


Remark. The set Ass( B) of all associated primes of an /^-module B is important in 
deeper studies [971 may be a proper subset of Ass(B)]. For example, it is related to primary 
decompositions (see Matsumura, Commutative Ring Theory, pages 39 - 42). ■* 

The next lemma is a generalization of the observation that Hom^(I m , I„) = {0} if 
(m, n ) = 1. 

Lemma 11.178. Let R be a commutative ring, and let A and B be R-modules. 

(i) //ann(A) contains a B-regular element, then Hom^fA, B ) = {0}. 

(ii) Conversely, let R be noetherian, and let A and B be finitely generated R-modules. 
//Hom^fA, B) = {0}, then ann(A) contains a B-regular element. 

Proof, (i) If r e ann(A), then ra — 0 for all a e A. Hence, for all / e Homs (A, B), 
we have 0 = f (ra) = rf(a). On the other hand, if r is B-regular, then rf (a) — 0 implies 
f (a) = 0, and so / = 0. 

(ii) Assume, on the contrary, that ann(A) contains no /i-regular elements; that is, ann(A) C 
Z(B). By Lemma 11.177, there are finitely many associated prime ideals of B, say, 
pi, ..., pi, such that ann(A) C Z(B) = pi U • • • U p^, and so Proposition 6.14 says 
that there is some p = p,- with ann(A) C p. 

Suppose that A p = {0}. If A = (a i, ..., a „), then there are a/ ^ p with a,a; = 0, by 
Proposition 11.25. Since p is prime, cr = cti(T 2 • • • <y n f P- But a e ann A = / c p, and 
this is a contradiction. Therefore, A p {0}. 

We wish to prove that Horn/A A. B) {0}. By Lemma 11.32, it suffices to prove 
that Hom^fA, B) p = Hom^pfAp, B p ) ^4 {0}. Thus, we may assume that (R, p, k) is a 
local ring with maximal ideal p and residue field k. Now there is an element b e B with 
ann(Z?) = p, so that (b) = R/p = k. Hence, there is a nonzero map <p: k —> B (taking 
1 b). Since A p ^4 {0}, Nakayama’s lemma gives A/pA ^4 {0}. But A/pA is a nonzero 

vector space over k, so there exists a nonzero map A/p A —> k. The composite of this map 
followed by rp is a nonzero map A -* B, and so Horn/A A. B) f {0}. • 


Lemma 11.179. Let R be a commutative ring, let A and B be R-modules, and let 
x \,..., x n be a B-sequence in ann(A). If I = (jci ,..., x n ), then 


Hom s (A, B/IB) = Ext'/(A, B). 
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Proof. The proof is by induction on n > 0. We define / = {0} in case n = 0, and so 
the base step holds. Assume now that x\,..., x n +\ is a B-sequence in ann(A), that I = 
(xi,..., x„+i), and that J = (xi,..., x„). Observe first that there is an exact sequence 
0 —y B —y B —y B/x\B -> 0, for xi is a regular element on B. Consider the portion of 
the long exact sequence, where x\ is multiplication by xi: 

Ext^(A, B) -^4 Ext* (A, B) Ext'^A, B/xiB) -> Ext'j+'lA, B) -^4 Ext* +1 (A, B). 
Since xj e ann(A), the induced map xi * is the zero map, and there is a short exact sequence 

0 -* Ext^(A, B) -> Ext^(A, B/xiB) Ext'^ +1 (A, B) -> 0. 

By induction, Horn* (A, B / J B) = Ext* (A, B). Multiplication by x„+i: B/ JB —> B/ J B 
is an injection, because x„+\ is (B/ J B (-regular, and left exactness of Horn* (A. ) shows 
that (x„+i)* is an injection Hom*(A, B/JB) —> Hom*(A, B/JB). On the other hand, 
(x„+i)* is the zero map, for x„+i e ann(A). Hence, Horn R (A, B/JB) — {0}, and 
Ext* (A, B) = {0}. Therefore, 3: Ext* (A, B/xjB) —>■ Ext* +1 (A, B) is an isomorphism. 
By induction, if B' = B/x\B, then Hom*(A, B'/{x 2 ,..., x„+i)B') = Ext*(A, B/x\B). 
But 

B'/ (x 2 ,..., x„+i) B' = (B / x\B) / (I B / x\B) = B/IB , 
so that Horn* (A, B/IB ) = Ext* (A, B/x\ B). We conclude that Hom*(A, B/IB) = 
Ext* +1 (A, B), as desired. • 

The following result is due to D. Rees. 

Proposition 11.180. Let R be a commutative noetherian ring, B a finitely generated R- 
module, and I an ideal with IB B. Then all maximal B-sequences in I have the same 
length, say, g, where 

g = min{/ : Ext*(R//, B) £ {0}}. 

Proof. Let xi,.... ,,x g be a maximal B-sequence in I. For all i — 1, 2, ..., g, define 
f = (xi,..., x,_i) (with I[ = {0}). Now Xj is a (B///B)-regular element, and so 

Ext'* -1 (/?//, B) = Hom*(R/7, B/fB) = {0}, 

by Lemma 11.179, which applies because ann(7?/7) = / D On the other hand, since 
xi,..., x g is a maximal B-sequence in 7, the ideal I contains no (B/IB )-regular elements. 
Thus, Lemma 11.178 gives 

E xt g R (R/I, B) = Horn r (R/I, B/IB ) / {0}. • 

Definition. If R is a noetherian ring, B a finitely generated R -module, and I an ideal 
such that IB f B, then the grade of B in I is 

G(I, B) — length of a maximal B-sequence in I. 

If (R, m) is a noetherian local ring, then G(m, B) is called depth of B: 

depth(B) = G(m, B). 

The number d in Lemma 11.176 is depth(R). 
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Proposition 11.181 (Auslander-Buchsbaum). Let ( R , m) be a noetherian local ring, 
and let B be a finitely generated R-module with pd(B) = n < oo. Then 

pd(B ) + depth(B) = depth!/?). 

Proof. The proof is by induction on n — pd{B) > 0. If n — 0, then B is a finitely 
generated projective /?-module, and so B is free, by Proposition 11.23. Hence, B = 
Yl"j =i Rj • where R/ = R, and so Extjfik. B) = Y^j=\ Ext q R {k, R) for all q. It follows 
that depth(B) = depth(/?), as desired. 

For the inductive step, there is an exact sequence 

0-±Q-^F->B->0, 

where F is a finitely generated free /?-module. The long exact sequence is 

Ext ‘ R (k, F ) -* E xt‘ R (k, B) -> Ext'+'(k, f 2 ) -* Ext‘+ l (k, F). 

By Lemma 11.178, Ext Q R (k, F) — Hom^(L, F ) = {0}; since F is free. Ext ' R (k, F) = {0} 
for all i > 0. Therefore, Ext ' R (k, B ) = Ext'^ 1 (k. Q) for all i > 0. It follows that 

depth( Ei) = depth! B) + 1. 

Since n — pd(B ) > 0, we have B not projective, and so pd(Q) = n — 1. By induction, 
pd(E2) + depth (O) = depth!/?). Therefore, 

depth(/?) = pd(E2) + depth(£2) 

= pd(E2) + 1 + depth(f2) — 1 
= pd{B) + depth(B). • 


Corollary 11.182. If (/?, m) is a noetherian local ring of finite global dimension, then 

D(R ) < depth!/?). 

Proof By Proposition 11.181, pd(M) < depth!/?) for every finitely generated/?-module 
M. But D(R) = swp{pd(M) : M is finitely generated}, by Theorem 11.134, and so 
D(R ) < depth!/?). • 

To complete the homological characterization of regular local rings, it remains to estab¬ 
lish the second inequality in Lemma 11.176: /x!m) < D(R ). Recall that Theorem 11.157 
shows that D(R ) = pd(k). If we write s — /i(m), we will prove that s < pd(k) by com¬ 
paring a Koszul complex for k with a minimal resolution of k. Our exposition is merely a 
more detailed version of the account given in Serre, Algebre Locale: Multiplicitys, pages 
112-116. 

If /: A —>■ B is a map of /?-modules, let / = / <S> ljt: A®r k -» B ®r k. Recall that 
A k = A/mA, as can easily be seen by tensoring the short exact sequence 0 —> m -> 
/?—>£-» 0 by A. With this identification, f: a + mA i-> f{a) + m B. 
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Lemma 11.183. Let ( R , m, k) be a noetherian local ring, let f: A —> B be a map of 
finitely generated R-modules, and let f — f ® 1^. 

(i) / is surjective if and only if f is surjective. 

(ii) If in addition, both A and B are free R-modules, then f injective implies that f is 
a (split) injection. 

Proof, (i) If / is suijective, tensor the exact sequence A —B —»■ coker / —»■ 0 by k to 
obtain the exact sequence 

A®r k —B k —»■ (coker /) <g >r k —> 0. 

Since / is surjective, (coker /) k — {0}. But (coker /) k = coker//m coker/, 
so that coker/ = mcoker/. Now coker / is finitely generated, because B is finitely 
generated, and so Nakayama’s lemma gives coker / = {0}; that is, / is surjective. 
Conversely, since ®^k is right exact, / suijective implies / surjective. 

(ii) Assume that / is injective. Let x\,..., x t be a basis of A, and let b , = f(x,) for 
i = 1 ,... ,t. Since / is injective, the elements bj = bj + mil are linearly independent in 
B/mB, and so they extend to a basis: There are c\,..., c s e B with b\,... ,b t ,c\,... ,c s 
a basis of B/mB. An application of Nakayama’s lemma, as in the proof of Proposi¬ 
tion 11.23, shows that b\, — , b u c\,..., c s is a basis of B. If we define h: B -> A 
by h(bj) — Xj and h(cj) — 0, then we see that hf — 1^, and so / is injective. • 

Definition. Let (R, m, k ) be a noetherian local ring. A map /: A —> B of R -modules is 
minimal if ker / C mA. 

Thus, the lemma says that if /: A —»• 11 is minimal, where A and II are free R -modules 
of finite rank, then /: A —> 11 injective implies / injective. 

Definition. Let (R, m, k) be a noetherian local ring, and let A be a finitely generated 
/^-module. A free resolution 



is a minimal resolution if all L n are finitely generated and ker d n C mL„ for all n > 0; 
that is, all d n are minimal. 

Proposition 11.184. Let (R, m, k) be a noetherian local ring. Every finitely generated 
R-module A has a minimal resolution. 

Proof. Since A is finitely generated, it has a minimal generating set, say, {ai,... ,a n }. Let 
Lq be the free /^-module with basis {ei,..., e n }, and define do: Lq ► A by do(ei) = a; 
for all i. We saw, in the proof of Proposition 11.23 that ker do L mLo, and so do is minimal. 
Since R is noetherian, ker do is finitely generated, and so this construction can be iterated. 
Thus, induction shows that a minimal resolution of A exists. • 
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Proposition 11.185. Let ( R , m, k) be a noetherian local ring, let A be a finitely generated 
R-module, and let 

t d 2 d\ d 0 

• • • —> L 2 — ► L i — > L o — > A — > 0 
be a minimal resolution. Then for all i > 0, 

Torf (A, k) = Li/mLj. 


Therefore, 

rank (Torf (A , k)) — rank(L,). 


Proof Deleting A from the minimal resolution gives a complex La; tensoring La by k 
gives a complex 


L A = 


— d 2 — 

l 2 -4 Lj 



0. 


Now im d[ + \ — kerd, C mL, implies dj — 0 for every i, so that //, (L ; i) = L, for all 
i > 0. On the other hand, L,i = ® ^ k, and so the definition of Tor gives //, (L a ® ^ k) = 

Torf (A, /c). Therefore, Torf (A, k) = L,- = Li/mLj. • 

Let us make an elementary observation. If (R, m, k) is a noetherian local ring and M is 
an /^-module, then we have already seen that M/mM = M <S)r k\ let us denote M/mM by 
M. A map tp\ M —* M' induces a map 7p\ M —> M by 


<p : m + m M <p(u ) + m M'\ 


if ip satisfies the additional condition im^» c m;V/', then there is a second induced map 
<p\ M/mM -> mM/m 2 M, given by 

<p(u + m M) = <p(u) + m 2 M'. 


Lemma 11.186. 

and let 


Let (R, m, k) be a local ring, let A be a finitely generated R-module, 


L2 


d 2 


d 1 


L 0 


do 


D 2 

be a minimal resolution of A. If ■ ■ ■ -> M 2 — M\ 
satisfying 


M q 


0 is a complex 


(i) each M p is a finitely generated free R-module', 

(ii) e: Mq —> A is injective', 

(iii) For all p > 0, we have D p {M p ) C mM p -\, and D p : M p —> mM p -\/m 2 M p -\, 
given by u p + m M p D p (u p ) + m 2 M p -\, is an injection ; 

then, for all p > 0, rank(M p ) < rank(L p ) = rank(Torf (A, k)). 
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Proof. We will show that each M p is isomorphic to a direct summand of L p . By The¬ 
orem 10.46, the comparison theorem, there are maps f p making the following diagram 
commute: 


D 1 £ 

Mi U- Mo —^ A 


/l 

/o 


Y 


Lx 


d 1 


Lq 


do 


0 

0. 


It suffices to find surjections g p : L p —> M p : since M p is free, hence projective, M p 
would then be isomorphic to a direct summand of L p . We claim that such maps g p exist if 
f p : M p -> L p is injective. Now M p and L p are vector spaces over k, so that the subspace 
f p (Mp) = M p is a direct summand of L p \ thus, there is a (necessarily) surjective map 
Y : L p —> M p with yf p — 1^ . Let n: M p -> M p and v: L p -» L p be the natural 
maps (regard M p = M p /mM p and L p = L p /mL p ), and consider the diagram 


Sp 

A 


Mp 



0. 


Since L p is free, there exists g p with vg p — yn\ that is, g p — yjx. Hence, g p is surjective, 
and so g p is suijective, by Lemma 11.183. 

It remains to show, by induction on p > 0, that the conditions listed in the statement 
imply each f p is injective. For the base step, dofo = s implies do / 0 = s. By hypothesis, 
both e and do are injections. However, Lemma 11.183(i) shows that both are isomorphisms, 
because both s and do are surjections. It follows that / 0 is an injection (in fact, it is even 
an isomorphism). 

For the inductive step, consider the following commutative diagram. 


Mp/mMp 


Lp/mLp 


mMp-i/m-Mp-i — — *-xnL p -i/m z L p -i. 

f p -1 


Since L. is a complex, we have im d p C kcrd p - \; since L. is a minimal resolution, we 
have kerdp-i c hence, the map d p is defined. By induction, f p _\ is injective; 

hence, / p -\D p is injective, because D p is injective, by hypothesis. Therefore, d p f p is 
injective, and this implies that f p is injective. • 

The following complex M. will be seen to satisfy the conditions in Lemma 11.186. 
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Definition. Let x \,..., x s be a sequence of elements in a commutative ring R. The 
Koszul complex M(xi,..., x s ), is defined as follows. 

M(xi,..., x s ) p = /\ P (F), 

where F is the free /^-module with basis {e\,..., e s }. The differentiations D p : f\ p (F) -> 
/ \ p ~ l {F ) are defined by 

S S 

D\ ^Tcei) — YciXi, 

i=i i=i 

where c, e R for all i (so that D\ (ej) = x,-), and, for p > 1, 

p 

D p (e il A • • • A e ip ) = TVir-Vq A • • • A 7 ir A • • • A e ip . 

r =0 

If A is an R -module, the Koszul complex M(xi,..., x s . A), is defined by 

M(xi,..., x s , A). — A®r M(xi, ..., x s ),. 

We leave the straightforward calculation that Dp -1 D p — 0 to the reader; it is similar to 
that in the proof of Lemma 10.114. Thus, the Koszul complex really is a complex. 

Note that f\°(F) — R and that imc/i = /, where I — (xi,..., x v ). In general, the 
Koszul complex is not acyclic; that is, it is not an exact sequence. However, if xi,.... x. v 
is an R -sequence, then augmenting it with the natural map e: /\°(F) —> R/I gives a free 
resolution of R/I (see Bruns-Herzog, Cohen-Macaulay Rings , page 49). 

Observe that the pth term of M(xi, ..., x s . k), is, by definition, k <g>^ /\ P (F). Since 
F is free of rank ,y, we know, from Theorem 9.140, the binomial theorem, that /\ p (F ) is 
free of rank (*), and so k /\ P (F) is a vector space over k of dimension (*). Thus, if 

we denote /\ P (F) by M p , as in Lemma 11.186, then k <S>r f\ P (F) is M p . 

If xi,...,x s is a minimal generating set for m, then Proposition 11.165 says that 
x*, ..., x* is a basis for m/m 2 , where x* = x,- + m 2 . If M is an /^-module, then there is 
an isomorphism mM/m 2 M —> (m/m 2 ) M, given by 

y, Xi Vi + m 2 M i-> y x* ® Vi , 
i i 


where r>, e M. If <p: M — > M' has the property that mup C m M\ then (p{u) = JL x/v', 
where vj e M'. Composing \p: M/mM -» m M'/m 2 M' with the isomorphism above 
allows us to regard cp: M/mM -» (m/m 2 ) ®r M': 

<p: u + m M i-A ip(u) + m 2 M' = y Xj v\ + m 2 M' i-> y x* <E> v[. 

i i 
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Lemma 11.187. Let (R. m, k) be a noetherian local ring, and let x \,..., x s he a minimal 
generating set for m. Then the Koszul complex M(xi,..., x,). satisfies the conditions in 
Lemma 11.186. 

Proof. First, each term M p — f\ p (F) of the Koszul complex is a finitely generated 
free R-module. Second, define e: R —> A: to be the natural map. Since kere = m, the 
map e is an injection, by Lemma 11.183. For the third condition, recall the formula for 
D p : f\ p {F) —> f\ p ~ l (F) (where F is the free R-module with basis e \, ..., e s ): 

p 

D p (e il A • • • A e ip ) = Yf-iy-\e h A • • • A 7 ir A • • • A e ip . 

r=l 

The presence of the factor x r forces each term, and hence the sum, into m M p - \. 

Finally, if M p = /\ P (F) and M p _\ = f\ p ~ l (F), let us show that D p : M p /mM p -» 
mM p -i/m 2 M p -i, given by 

D p {u + m M p ) — D p (u) + xxcMp-i, 

is an injection. Recall that if e \,..., e s is a basis of the free module F, then a basis for 
M p = /\ P (F) is the set of all ej , where I = i\ < ■ ■ ■ < i p is an increasing p < s list and 
ej = e,-j A-■ ■ Ae,- . If u = aiej, we may assume that aj is defined for every increasing 
list I (some aj may be 0). Let us now define a/ for every, not necessarily increasing, 
p-tuple i\,... ,i p of indices, possibly with a repeated index: set a/ = 0 if some index is 
repeated, and set a/ = —a// if I’ is obtained from I by transposing two indices. With this 
notation, we may now rewrite the formula for D p : 

D p (u ) = DplYaid) 

I 

S 

= J2 x jJ2 oi i LeL ’ 

1=1 L 

where L — l\ < • • • < t p _ \ is an increasing p — 1 < s list, and jL — j, l\, .. l p -\. 

Note that oiji is either 0 or ±cy/, where I is the rearrangement of jL into an increasing 
list. Suppose now that u — otjei f m M p \ that is, u + m M p y 0 in M p /mM p . 
Since the ejs are a basis of M p , we must have some «/ j m; that is, a/ is a unit in R. If 
I — i\ < ■ ■ ■ < i p , define j — i\ and L — ij < ■ ■ ■ < i p . The coefficient aji — aj of 
does not lie in m, and so a j^ei y 0 in (because the e^s form a basis of M p _j). 

Under the isomorphism mM p -\/m 2 M p -\ —> (m/m 2 ) M p -\, 

S 

D p (u ) = Yh x *j ® Y. a jLeL , 

1=1 L 

where x* = xj + m. Since x *,..., x* is a basis of m/m 2 , an element j x* ® Vj = 0 if 
and only if each vj — 0. Therefore, if u </ m M p , then D p (u + m M p ) 0, and so D p is 
an injection. • 
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Proposition 11.188. If ( R , m, k) is a noetherian local ring of finite global dimension 
D(R), then 

/r(m) < D(R). 

Proof Let s = /x(m), and let {x\, ..., x s | be a minimal generating set of m. Then 
rank(M(xi...., x s ) p ) < rankfTor^ (k, k )), 

by Lemma 11.186. Now M(xi, ..., x s ) p — /\ P (F), where F is the free /?-module with 
basis e\, ..., e s , so that rank(M(xi,..., x s ) p ) = rank(/\ p (F)) = (*) and 

^ < rank(Tor^(k, k)). 

Therefore, 1 < rank(Torf (k, k)), so that Torf(k, k) f {0}. But Lemma 11.155 gives 
pd{k) — maxjp : Tor^(k, k) f {0}}, 
so that s < pd(k) — D(R), by Corollary 11.156. • 

Theorem 11.189 (Serre). A noetherian local ring R is regular if and only if D{R) is 
finite. 

Proof In Lemma 11.176, the theorem was reduced to checking two inequalities. These 
inequalities are proved in Corollary 11.182 and Proposition 11.188. • 

Corollary 11.190. If R is a regular local ring, and it p is a prime ideal in R, then Rp is 
also a regular local ring. 

Proof. Since p is a prime ideal, the localization R p is a local ring; it is noetherian because 
R is noetherian. In Proposition 11.154, we saw that I)(R) > D(R p ). Therefore, R p is a 
regular local ring, by Serre’s theorem. • 

We are now going to prove that every regular local ring is a UFD, and we begin with 
several elementary lemmas. 

Lemma 11.191. If R is a noetherian domain, then R is a UFD if and only if every prime 
ideal of height 1 is principal. 

Proof. Let R be a UFD, and let p be a prime ideal of height 1. If a e p is nonzero, then 
a — p j 1 • • • p e f , where the p, are irreducible and d > 1. Since p is prime, one of the 
factors, say, pj e p. Of course, Rp ; C p. But Rpj is a prime ideal, by Proposition 6.17, 
so that Rp j — p, because ht(p) = 1. 

Conversely, since R is noetherian. Lemma 6.18 shows that every nonzero nonunit in 
R is a product of irreducibles, and so Proposition 6.17 says that it suffices to prove, for 
every irreducible n e R, that Rjt is a prime ideal. Choose a prime ideal p that is minimal 
over Rjt. By the principal ideal theorem. Theorem 11.161, we have ht(p) = 1, and so the 
hypothesis gives p — Ra for some a e R. Therefore, it — ua for some u e R. Since tt is 
irreducible, we must have u a unit, and so Rjt = Ra — p, as desired. • 
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Lemma 11.192. Let R be a noetherian domain, let x € R be a nonzero element with Rx 
a prime ideal, and denote S~ l R by R x , where S = [x n : n > 0}. Then R is a UFD if and 
only if R x is a UFD. 

Proof. We leave necessity as an exercise for the reader. For sufficiency, assume that R x 
is a UFD. Let p be a prime ideal in R of height 1. If x e p, then Rx C p and, since 
ht(p) = 1, we have Rx — p (for Rx is prime), and so p is principal in this case. We 
may now assume that x f. p; that is, S Hi p = 0. It follows that p R x is a prime ideal 
in R x of height 1, and so it is principal, by hypothesis. There is some a e p and n > 0 
with p R x = R x (a/x n ) = R x a, for x is a unit in R x . We may assume that a ^ Rx. If 
a — a\x and a i f Rx, then replace a by a\, for R x a = R x a\. If fli = a 2 X and 02 f Rx, 
then replace a\ by a 2 . for R x a 1 = Rx®2- If this process does not stop, there are equations 
a m = a m+ \x for all in > 1, which give rise to an ascending sequence Ra\ C Raj c • • •. 
Since R is noetherian, Ra m = Ra m+ \ for some m. Hence, a m+ \ — ra m for some r e R, 
and a m = a m+ \x — ra m x. Since R is a domain, 1 = rx; thus, x is a unit, contradicting 
Rx being a prime (hence, proper) ideal. Clearly, Ra C p; we claim that Ra = p. If b e p, 
then b = (r/x m )a in R x , where r e R and m > 0. Hence, x m b = ra in R. Choose m 
minimal. If m > 0, then ra = x m b e Rx; since Rx is prime, either r e Rx or a e Rx. 
But a f. Rx since S (T p = 0, so that r = xr'. As R is a domain, this gives r'a = x m ~ l b, 
contradicting the minimality of m. We conclude that m — 0, and so p = Ra is principal. 
Lemma 11.191 now shows that R is a UFD. • 

The following elementary lemma is true when the localizing ideal is prime; however, 
we will use it only in the case the ideal is maximal. 

Lemma 11.193. Let R be a domain, and let I be a nonzero projective ideal in R. Ifm is 
a maximal ideal in R, then 

lm — R m . 

Proof. Since I is a projective R -module, I m is a projective /? m -module. As R m is a local 
ring, however, I m is a free Ajn-modulc. But 7 m is an ideal in a domain R m , and so it must 
be principal; that is, 7 m = R m . • 

Theorem 11.194 (Auslander-Buchsbaum). Every regular local ring R is a UFD. 

Proof. ( Kaplansky) The proof is by induction on the Krull dimension dim(/(), the cases 
n = 0 (R is a field) and n = 1 (R is a DVR) being obvious (see Exercise 11.78 on 
page 1012). For the inductive step, choose x e m — m 2 . By Lemma 11.171, R/Rx is a 
regular local ring with dim(7?/7?x) < dim(R); by Proposition 11.172, R/Rx is a domain, 
and so Rx is a prime ideal. It suffices, by Lemma 11.192, to prove that R x is a UFD (where 
R x — S~ l R for S = {x n : n > 0}). Let ')3 he a prime ideal of height 1 in R x ; we must 
show that is principal. Define p = n R (since R is a domain, R x C Frac(R), so 
that the intersection makes sense). Since R is a regular local ring, D(R) < 00 , and so the 
/Lmodule p has a free resolution of finite length: 

0 -» F n —> F n -1 ->-► To -> p 0. 
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Tensoring by R x , which is a flat //-module (Theorem 11.28), gives a free /?*-resolution of 
(for = R x p): 


K K -1 


K 


o, 


(9) 


where Fj = R x <2 >r F,. 

We claim that (p is projective. By Proposition 11.154, it suffices to show that every 
localization (Pa^ is projective, where 911 is a maximal ideal in R x . Now (R x )<m is a local¬ 
ization of R, and so it is a regular local ring, by Corollary 11.190; its dimension is smaller 
than D(R ), and so it is a UFD, by induction. Now being a height 1 prime ideal in the 
UFD ( R x )<m , is principal. But principal ideals in a domain are free, hence projective, and 
so iPait is projective. Therefore, is projective. 

The exact sequence (9) “factors” into split short exact sequences. Since (p is projective, 
we have FL = *P © Co, where Co = ker(F , Q —> *P). Thus, Co is projective, being a 
summand of a free module, and so /-'( = 12 1 © Co, where C i = kerf /-)' -> /-)'). More 
generally, F{ = C; © C,_i for all i > 1. Hence, 

Fq © F[ © • • • © F' n = (<p © C 0 ) © (Ci © C 0 ) © • • • . 

Since projective modules over a local ring are free, we see that there are finitely generated 
free R x -modules Q and Q' with 

Q = <P© Q'. 

Recall that rank(2) = dim/f (K <S)r, Q), where K = Frac( R x )\ now rank (ip) = 1 and 
rank(Q') = r, say, so that rank(g) = r + 1. 

We must still show that 'jj is principal. Now 

A cg) = A w®g>- 

Since Q is free of rank r+ 1, Theorem 9.140, the binomial theorem, gives /\' +l (Q) = R x . 
On the other hand. Theorem 9.143 gives 

A r+1 ^®e') = A ' +1 ”'(e'))- ao) 

!= 0 

We claim that /\'((P) = {0} for all i > 1. By Lemma 11.193, we have = (R.x)tm for 
every maximal ideal 9 71 in R x . Now Exercise 11.24 on page 921 gives 

(^ m )m = A'^an) = A '((«x)tm) 

for all maximal ideals 971 and all i. But /\' ((R x )m) — {0} for all i > 1 (by the binomial 
theorem or by the simpler Corollary 9.138), so that Proposition 11.31 gives /\' (<P) = {0} 
for all i > 1. 
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We have just seen that most of the terms in (10) are {0}; what survives is: 

A' +1 (<p® e') = (A° ( ^ A' +1 (2')) ® (A 1 w ®** A' ( e'))- 

But /\ r+l (Q') — {0} and /\' (Q') = R x , because Q' is free of rank r. Therefore, 

A r+1 0P© GO = V- Si nce = A^'^ © Q') = K + \Q) = Rx, we have qj = R x is 
principal. Thus, R x , and hence R, is a UFD. • 

Having studied localization, we turn, briefly, to globalization, merely describing its set¬ 
ting. To a given commutative noetherian ring R, we have associated a family of local rings 
Rp, one for each prime ideal p. Local rings are simpler than general rings; for example, if 
R is a Dedekind ring, its localizations are all principal ideal domains. Globalization asks 
how we can make use of all the localizations to gather information. Consider the disjoint 
union 

E(R)= IJ R P . 

peSpec (R) 

We call Rp the stalk of R over p, and we define the projection tc : E(R) -> Spec (A 1 ) by 
jt(e) = p if e e R p \ that is, it sends each point in the stalk over p into p. 8 Each element 
a e R defines a function s a : Specff?) —> E(R) by 

s a : p i—^ a/1 €= Rp. 

Note that ns a = 1 spec: R)- We claim that distinct elements a,b e R give different func¬ 
tions; that is, if s a — Sb , then a = b. Let / = R(a — b). If (a — b)/\ = 0 in Rp for every 
prime ideal p, then Ip = {0} for every p. Proposition 11.31 applies to show that a — b (in 
fact, this proposition only needs 7 m = {0} for all maximal ideals m). Thus, we can regard 
the elements of any commutative ring R as E( A)-valued functions on Spec(77). 

Consider the question, given f e Rp and g e R q . whether there exists a e R with 
f — a/I e Rp and g — a/1 e Rql That is, is there a e R with s a (p) = / and s fl (q) = gl 
A “good” answer might be if p and q are “close” to each other, then such an element a e R 
exists. This suggests that a topology on X — Spcc( R) may be of interest, and the Zariski 
topology is a good candidate (a subset F C Spec(7?) is closed if q e F implies p e F 
whenever p is a prime ideal with q C p). Of course, once we decide on a topology for 
Speci A), we expect that E(R) should also be topologized, and that interesting functions 
should be continuous. 

The Zariski topology is much different from the topology on euclidean space. Not 
only is it not a metric space (that is, no distance between two points is defined), one-point 
subsets need not be closed sets; for example, {p} is closed if and only if p is a maximal 
ideal. In spite of this, continuity of a function /: X —»• Y can still be defined: / is 
continuous if f~ l (V ) is an open subset of X for every open subset V / C Y. Equivalently, 
/ is continuous if and only if, for every closed subset C in Y. the subset is a 

^The're is a possible source of confusion, for p can be viewed in two ways: as a prime ideal—a subset of 
A; as a point in Spec! R). To distinguish these viewpoints, we often write p r to denote p viewed as a point of 
X = Sped R). Thus, the projection n is defined by 7i(e) = p T for all e e Rp. 
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closed subset of X. Similarly, we can define the notions of compactness (if [Fj : i e / is 
a family of closed subsets, then there are finitely many of them, say, Fj n such that 

p| /s/ Fj = n"=i Pij) an d connectedness (not the union of two nonempty disjoint closed 
subsets) for arbitrary topological spaces. 

Let us mention an elementary gluing result. Suppose that X and Y are topological 
spaces, that {Uj : i e 1} is a family of open subsets of X, and that {/,: f/; -> Y} is a 
family of continuous functions. If the functions agree on overlaps; that is, if f |(t/,- HU j) = 
fj\(Uj fl Uj) for all i, j e /, then there is a unique continuous function /; [_J ( f/; -> Y 
with f\Ut — fi for all i. Does this remind you of a direct limit? 

Here is the formal definition of a sheaf from this viewpoint; there is also a second, 
equivalent, version using presheaves (introduced in Chapter 7) that we will describe after¬ 
ward. 

Definition. If E and X be topological spaces, and let it: E —> X be a continuous surjec¬ 
tion. Then T — (E, X. n) is a sheaf of abelian groups if 

(i) 7r is a local homeomorphism : for each e e E, there is an open set U containing e 
such that n(U) is an open subset of X and n\U is a homeomorphism 9 from U to 
n(U). 

(ii) The subsets T x — n~ l (x) of E, for x e X , are called the stalks of T , and each T x 
is an abelian group. 

(iii) If E + E is the subset of E x E consisting of all (a, b) with n(a) — nib), then the 
maps £’ + £’-»£’, given by (a, b) a + b and (a, b) i->- a — b , are continuous. 

Ignoring the algebraic structure on the stalks, the reader may recognize the basic ingre¬ 
dients present in covering spaces, for example. 

Recalling the functions s a : Spec (R) —> E(R) mentioned above, we see that the fol¬ 
lowing notion is of interest. 

Definition. If T — (E, X, n) is a sheaf of abelian groups, and if U is an open subset of 
X, then a section over U is a continuous function s: U -* E with ns = 1;/. We write 

r(U, T) — {all sections s: U — > £}. 

A global section is a section in T (X, T). 

It is easy to check that T(f/, T) is an abelian group. If V C U are open subsets of X, 
then there is a restriction map 


Py : T(f/, T) -> T(V, T), 

given by s i-> s|V. Moreover, the functions py are homomorphisms. 

9 A homeomorphism is a bijection f:X—>Y, where X and Y are topological spaces, such that both / and 
f ~ 1 are continuous. 
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For fixed x e X, let 


I(x) = {open sets U C X containing x}. 

It is easy to see that I (x) is a partially ordered set under reverse inclusion: 

U < V means U 3 V. 

In fact, I (x) is a directed index set, for given U,VeI (x ), then UDV e I(x),UnV C U, 
and 1/ny C V; that is, U < U H V and V < U fl V. We can recapture the stalks from 
the sections. Let T — ( E , X , n) be a sheaf of abelian groups. For each x e X, 

T x = lim r (u,r>. 

Uel(x) 

If X is a topological space, then the family of its open sets, with inclusion maps of 
subsets as morphisms, is a category; denote it by Open(X). We may now define a presheaf 
of abelian groups. In fact, there are presheaves with values in any category, say, modules 
or commutative rings, not just Ab. 

Definition. If A is a topological space and C is a category, then a presheaf over X with 
values in C is a contravariant functor T ; Open! X) -> C. 

A sheaf can be reconstructed from its presheaf of sections if we further assume a version 
of the gluing result mentioned above. 

Given a commutative ring R, sheaves of /^-modules are defined whose stalks are R p - 
modules for p e Spcc( R ). These form an abelian category with enough injectives; that is, 
every sheaf can be imbedded as a subsheaf of an injective sheaf. Moreover, global sections 
r(A, ) is a left exact functor, and cohomology of sheaves is defined as derived functors of 
r(A, ). These cohomology groups provide the most important method of globalizing. We 
recommend the article by J.-P. Serre, Faisceaux Algebriques Coherents, Annals of Math. 
(61) 1955, pages 197-278, for a lucid discussion. 

Exercises 

11.75 Let R = k[x, y, z], where k is a field. 

(i) Prove that x. y(l — x), z(l — x) is an ff-sequence. 

(ii) Prove that y(l — x), z(l — x), x is not an f?-sequence. 

11.76 Let R be a commutative ring. Prove that a prime ideal p in R is minimal over an ideal / if and 
only if ht(p//) = 0 in R/I. 

11.77 If (R, m, k ) is a noetherian local ring, and if B is a finitely generated f?-module, prove that 


depth(B) = min{; : Ext'^(k, B) {0}}. 
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11.78 Let R be a regular local ring. 

(i) Prove that R is a field if and only if dim(R) = 0. 

(ii) Prove that R is a DVR if and only if dirn(R) = 1. 

11.79 (i) Let (R, m) be a noetherian local ring, and let a e R be a regular element; i.e., x is not a 

zero divisor. If x\ + (a),..., x s + (a) is an (R/(A))-sequence, prove that x, x \, ..., x s 
is an R-sequence. 

(ii) Let R be a commutative ring. If jcj, ..., aj is an R-sequence, prove that the cosets 
x 2 + (jci), ... ,xj + (x \) form an (R j(x\ ((-sequence. 

11.80 Let R be a noetherian (commutative) ring with Jacobson radical J = J(R). If 6 is a finitely 
generated R-module, prove that 

n 

n> 1 

Conclude that if (R, m) is a noetherian local ring, then fln>l tn n B = {0}. 

Hint. Let D = fl/ol J n B, observe that JD = D, and use Nakayama’s lemma. 

11.81 Use the Rees lemma to prove a weaker version of Proposition 11.175: If (R, m) is a regular 
local ring, then D(R ) > /i(m) = dim(R). 

Hint. If Ext^(k, R) ^ {0), then D(R) > d — 1; that is, D(R ) > cl. 

Let m = (x\, ..., xj), where x\, ..., Xd is an R-sequence. Then 

Extort, R) = Ext d R -} X]) (k. R/(x i)) 

= Ext R/(" B/( x l , x 2)) = ■■■ 

= Ext^ik. k) = Homilk, k) = k £ (0). 

11.82 Let R be a commutative ring, let M be a finitely generated R-module, and let I be an ideal 
such that IM ^ M. 

(i) If A ], x 2 , ..., x n is an M-sequence contained in /, prove that 

(x i) C (. xi , X 2 ) £ • • • C (x\ -- x n ). 

(ii) If R is noetherian, prove that there is a longest M-sequence contained in I. 

11.83 If k is a field, prove that k[[x\ .a,,]] is noetherian. 

Hint. Define the order o(f) of a nonzero formal power series / = (/q, f\, fx ....) to be 
the smallest n with f n ^ 0. Find a proof similar to that of the Hilbert basis theorem. (See 
Zariski-Samuel, Commutative Algebra II, page 138.) 

11.84 If k is a field, prove that the ring of formal power series L[[ai.a,,]] is a UFD. 



Appendix 

The Axiom of Choice and Zorn’s 
Lemma 


Nowadays, most mathematicians accept the axiomatization ZFC of set theory, due to E. 
Zermelo and A. Fraenkel; the letter C abbreviates choice. Using consequences of this 
axiomatization, we will prove the equivalence of the axiom of choice, the well-ordering 
principle, and Zorn’s lemma. Let us begin by recalling some definitions from Chapter 6. 

Definition. If A is a set, let V(A)* denote the family of all its nonempty subsets. The 
axiom of choice states that if A is a nonempty set, then there exists a function ft : V(A ) # —> 
A with p(S) e S for every nonempty subset S of A. Such a function ft is called a choice 
function. 

Informally, the axiom of choice is a harmless looking statement; it says that we can 
simultaneously choose one element from each nonempty subset of a set. We now show 
that the axiom of choice is equivalent to a statement we would hate to be false. 

Proposition A.l. The axiom of choice holds if and only if the cartesian product fl (S / A; 
of nonempty sets is itself nonempty ft 

Proof. Let us assume the axiom of choice. Recall that an element of n,- 6 / A, is an /-tuple 
A' = (xi ) with Xi e Xj for all i e I. Now an /-tuple is really a function 

/: /-> A = {JXi 
iel 

with f(i) = xi e Xi for all i e /. Define <p : / -* V(A) # by <p(i) = Xj. If ft \ V(A) # -> 
A is a choice function, then the composite / = ft o <p\ I -> A satisfies /(;) = ft(<p(i)) — 
ft(Xj) e Xj , and so it is an element of n,e/ A/. Therefore, the cartesian product is 
nonempty. 

Conversely, let A be a nonempty set. Define I = V(A) # , and consider rise/ By 
hypothesis, this product is nonempty, and so it contains an element ft. where ft(S) e S 

1 By definition, a set X is nonempty if there is an element x € X. If X\ / 0 and Xi A t2, then there is 
x\ € Xi and xi € Xi. and hence (,V|, * 2 ) G V] x Xi\ that is, Xj x AS A 0. More generally, we can prove, by 
induction, that if the index set I = {1, 2,.... «} is finite, then Jlte/ -C — X ] x • • ■ x X,, / 0. Thus, the axiom 
of choice is significant only when the index set I is infinite. 
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for all S e V(A) # : that is, p is a choice function for A. Therefore, the axiom of choice 
holds. • 

There are various equivalent forms of the axiom of choice that are more convenient to 
use, the most popular of which are the well-ordering principle and Zorn’s lemma, which we 
state after some preliminary definitions. Most mathematicians accept the axiom of choice 
(as do we), and so they also accept these equivalent forms as well. 

Recall that a set A is a partially ordered set if there is a relation x < y defined on X 
that is reflexive, antisymmetric, and transitive. 

If it is necessary to display the ordering relation, we may also say that ( X, <) is a 
partially ordered set. 

Definition. A partially ordered set X is well-ordered if every nonempty subset S of X 
contains a smallest element, that is, there is ,vo € S with 

sq < s for all s e S. 

A partially ordered set A is a chain if any two elements are comparable; that is, for all 
x, y e X, either x < y or y ■< x. 

Example A.2. 

(i) The least integer axiom in Chapter 1 says that N, the natural numbers, is well-ordered. 
More generally, N" equipped with a monomial order, defined in Chapter 6, is a well- 
ordered set. 

(ii) The empty set 0 is well-ordered; otherwise, 0 would contain a nonempty subset (with¬ 
out a smallest element), and this is a contradiction. 

(iii) The integers Z is not well-ordered, for there is no smallest integer. 

(iv) The subset X of Q, defined by 

X = {l-£:n>l} U {2—i;n>l} 

is well-ordered. Note that 1 = 2 — y has infinitely many predecessors. 

(v) Let A be a well-ordered set. An element r e A is a top element if there is no a e X 
with r -< a. If a e X is not the top element of A (should one exist), then A“ = \fi e A : 
a -< P) 0, and so it has a smallest element a', called the successor of a. The successor 
a' is the “next” element after or. formally, a < ol' and there is no p e X with a < p < a' 
(if there were such a p, then P € A“ and so a' < P). An element p e A is a limit if it 
is not a successor; that is, there is no a e X with p — a'. The smallest element is A is a 
limit; in part (iv), we saw that A = {1 — ^:n>l} U {2 — ^ : n > 1} is well-ordered, 
and it is clear that 1 = 2 — j is a limit in A. Thus, every element in A is either a successor 
or a limit. ◄ 


Here are some basic properties of well-ordered sets. 
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Proposition A.3. 

(i) Every subset Y of a well-ordered set X is itself well-ordered. 

(ii) Let X be a well-ordered set. If x, y € X, then either x < y or y < x. 

(iii) If X is a well-ordered set, then every strictly decreasing sequence X\ > X 2 > • • • in 
X is finite. 

(iv) Assuming the axiom of choice, the converse of part (iii) is true. If X is a chain in 
which every strictly decreasing sequence xi > X 2 > ■ • • in X is finite, then X is 
well-ordered. 


Proof, (i) If S is a nonempty subset of Y, then it is also a subset of X and, as any nonempty 
subset of X, it contains a smallest element. Therefore, Y is well-ordered. 

(ii) The subset S = {x, y} has a smallest element, which is either x or y. In the first case, 
x < y, and in the second case, y < x. 

(iii) If X is well-ordered, then S — { x \, xi , ...} has a smallest element, say, x;; that is, 
x„ > Xi for all n > 1. In particular, if n — i + 1, then x,+t >; x,-, which contradicts 

Xj > Xj+1. 

(iv) Assume that there exists a nonempty subset S of X that has no smallest element. 
Choose sq e S\ since .so is not smallest, it is not true that ,vo < s for all s e S. Thus, 
either there exists si e S with ,vo >- ,v ] or there is s e S with s 0 and s not comparable; 
the latter cannot occur because A is a chain. Similarly, there is *2 £ S with ,V| >- .sx. By 
induction, for all n > 0, there are elements s, e S with so >- si • ■ • >- s n > s„+i. We want 
to assemble these infinitely many choices, one for each n, into one descending sequence 2 ; 
that is, we want a function /; N —> S with fin) = s„. Here is the formal way to do this. 
Let T be the family of all functions g from all initial segments {0, 1, ..., n} —> S, and let 
fi be a choice function on IF: that is, f (T ) e T for every nonempty subset T C. IF. We use 
f to construct the desired sequence. Choose an element so e S , which is possible because 
S 0. Define Fq — {g e F : domain(g) = {0} and g(0) — so}) (there is only one g in 
To), and define go = P(Fq) . For n > 0, we know, by induction, that F n +\ fi: 0, where 

F „+1 = {g: {0, 1, ...,n + 1} ->■ X : g|{0,= g„ andg(n) >- gin + 1)}. 

Therefore, we may define g„+\ = piF n+ \). Finally, define g* to be the union of the g„; 
that is, g*in) = g„(n) for all n. The function g* is a strictly descending sequence in S, and 
this contradicts the hypothesis that every strictly decreasing sequence in S is finite. • 


Well-ordering Principle. Every set X has some well-ordering of its elements. 

- We have already done this, without comment, in Proposition 6.38, when we showed that ACC implies the 
maximum condition. Actually, the proof only uses a weaker form of the axiom of choice in which the index set 
is countable. 
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If X happens to be a partially ordered set, then a well-ordering, whose existence is 
asserted by the well-ordering principle, may have nothing to do with the original partial 
ordering. For example, Z can be well-ordered: 

0 ^ 1 ^ -l <2< -2 ;<•••. 

That N is well-ordered is just another way of stating mathematical induction. Thus, 
the well-ordering principle suggests the possibility of a generalized induction that applies 
to a collection of statements indexed by a well-ordered set of any, possibly uncountable, 
cardinality. Such a generalization does, in fact, exist, and it is called transfinite induction. 
Let {5(a) : a e 1} be a family of statements indexed by a well-ordered set I. If ao is the 
smallest index in /, then the base step is the statement that 5(ao) is true. The inductive 
step is the statement that if f is an index and 5(a) is true for all a -< f, then 5(/S) is 
true. Transfinite induction says that if the base step and the inductive step hold, then all 
the statements 5(a) are true. (Often, the proof of the inductive step splits into two cases, 
depending on whether f is a successor or a limit). Here is a surprising use of transfinite 
induction: There exists a subset Q of the plane that intersects every straight line in exactly 
two points. The idea of the proof is to construct Q by well-ordering the set of all the lines 
in the plane in such a way that every line has only countably many predecessors. Now, 
from each line in turn, judiciously select at most two of its points to put into Q. 

We will be able to state Zorn’s lemma after the following definitions. 

Definition. Let A be a partially ordered set. An upper bound of a subset 5 of A is an 
element x e X, not necessarily in 5, such that 

s < x for all s e S. 

An element m e X is a maximal element if there is no x e X for which in < x\ that is, if 
x e X and if m <x, then in = x. 

A partially ordered set may have no maximal elements: for example, R, with its usual 
ordering, is a chain having no maximal elements. A partially ordered set may have many 
maximal elements: for example, if X is the partially ordered set of all the proper subsets of 
a set U, then a subset 5 is a maximal element if and only if 5 = U — {u} for some u e U\ 
that is, 5 is the complement of a point. 

Zorn’s lemma is a criterion that guarantees the existence of maximal elements. 

Zorn’s lemma. If X is a nonempty partially ordered set in which every chain has an 
upper bound, then X has a maximal element. 

Theorem A.4. The following statements are equivalent. 

(i) Zorn’s lemma. 

(ii) The well-ordering principle. 

(iii) The axiom of choice. 
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We split Theorem A.4 into three separate theorems. Let us begin with a definition and 
a lemma. 

Definition. If A is a well-ordered set and c e X, then the open segment Seg(c) is the 
subset 

Seg(c) = {x e X : x -< c}. 

The next result supplements Proposition A.3. 

Lemma A.5. A chain X is well-ordered if and only if every open segment of X is well- 
ordered. 

Proof Necessity is obvious, for every subset of a well-ordered set is well-ordered. Con¬ 
versely, let S be a nonempty subset of A. Of course, if S is a singleton, then it contains 
a smallest element, and so we may assume that S contains at least two elements, say, c' 
and c. Since A is a chain, we may assume that c' < c. Hence, Seg(c) IT S ^ 0; as every 
nonempty subset of a well-ordered set is well-ordered, there is a smallest element, say, z, 
in Seg(c) 0 S. Now z is the smallest element in S , for if there is s' e S with s' < z. then 
s' e Seg(c) 0 .S', contradicting z being the smallest element in Seg(c) 0 S. Therefore, A is 
well-ordered. • 

In general, an ascending union of well-ordered subsets of a partially ordered set need 
not be well-ordered. For example, it is easy to see that for every positive integer n, the 
subset 

S„ — {m el : m > — n} 

is a well-ordered subset of Z, but S n — Z is not well-ordered. 

With an extra assumption, we can force a union of well-ordered subsets to be well- 
ordered. 

Notation. If B and C are subsets of a partially ordered set A, then we write 

B < C 

if either B — C or B is an open segment of C; that is, there exists c e C with B — Seg(c). 

Lemma A.6. Let (A, <) be a partially ordered set, and let {5/ : i e 1} be a family 
of well-ordered subsets of X indexed by some set I. If, for each i , j, either Si < Sj or 
S j < Si, then U/e/ $ is a well-ordered subset of X. 

Proof Let U = U; A• By Lemma A.5, it suffices to show that any open segment Seg(c), 
where c e U, is well-ordered. Now c e Si for some i ; since Si is well-ordered, so is any 
of its subsets; thus, it suffices to show that Seg(c) = [x e U : u -< c} c 5,; that is, if 
u -< c, we must show that u e Sj. Now u e Sj for some j. If Sj < Sj, then u e Si and we 
are done. If Sj < Sj, then .S', C Sj, so that c e Sj ; moreover, since .S', is an open segment 
of Sj, u < c implies u e Sj, as desired. • 
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Definition. A subset A of a well-ordered set (A, ;<) is closed in A if A / 0 and if x < a, 
where x e X and a e A, implies x e A. (Thus, if A is closed and a e ,4, then A contains 
everything smaller than a as well.) 

Given a well-ordered set A and c e A, it is obvious that the “closed segment’’ 

A = {x e A : x < c] 

is a closed subset. If c is the top element of A (should such exist), then A = A; if c is not a 
top element, then it has a successor c' , and A — Seg(c'). Thus, closed segments are closed 
subsets, but they are nothing new. 

Lemma A.7. If ( A, <) is a well-ordered set, then A is closed in X if and only if A < A; 
that is, either A = X or A — Seg (c) for some c e A. 

Proof It is clear that open segments are closed, and so only necessity needs proof. 

Assume that A is closed; we must show that if A ^ A, then A is an open segment. Since 
A is well-ordered, there is a smallest element c e X — A, and we claim that A — Seg(c). 
If there is some a e A with c < a, then c e A, because A is closed, and this contradicts 
c ^ A. Therefore, a -< c for all a e A (we are using the fact that well-ordered sets are 
chains); that is, c is an upper bound of A, and so A C Seg(c). For the reverse inclusion, 
suppose that x e Seg(c); that is, x -< c. If x ^ A, then x e A — A, and so c < x, a 
contradiction. • 

Here is the first step of Theorem A.4. . 

Theorem A.8. If Zorn’s lemma holds, then the well-ordering principle holds : every set 
X can be well-ordered. 

Proof Since 0 is well-ordered, we may assume that A / 0. Let C be the family of all 
the well-ordered subsets of A; more precisely, an element of C is an ordered pair ( S , 0) 
consisting of a subset S of A together with some well-ordering g of it. Thus, a subset S of 
A may appear several times in £, equipped with different well-orderings. We now make £ 
into a partially ordered set. Define 


(, s , g) ^ ( s g') 

to mean either (i) S — S' and g = g' or (ii) S C S', the orderings coincide on S (that is, if 
a, b e S, then a C b holds if and only if« c' h holds), and S is an open segment of S'. 

We now show that £ satisfies the hypothesis of Zorn’s lemma. Note that £ 0, for 

any 1-point subset is a well-ordered subset, and so it gives an element of £. Let C = 
{(Si, £ii) ■ i e /} be a chain in £; that is, for each i, j, either (5,-, g,) < (Sj, g j) or 
( Sj , g ; ) < (Si, g,). Define U = (J, 5/, and define a partial order g on U as follows. 
If u, v e U , then there are indices i and j with u e S, and v e Sj; we may assume 
that (Si, g,■) < (Sj, g j), so that both u, v e Sj, and we define u g u if u g/ v. This 
definition does not depend on the choice of indices, for if there are indices k and l with 
u e Sk and v e Si, then (Sk, g k) £ (Si, g £), say, and u g^ v is a competing definition. 
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But (Sj, c ; ) ^ (Si, and so u tZj v if and only if u v. It is now routine to prove 
that (U , C) is a partially ordered set. In fact, (U, C) is well-ordered, by Lemma A.6, and 
so ( U , C) g C. Furthermore, we claim that each (Si, C ( ) is closed in (U, C). Suppose 
that u c si, where s,- e S, and u e U. Now u e S / for some j ; if (Sj, c ; ) < (Sj, c,-), 
then u e S) as desired; if (Sj , c,-) < (Sj, Cy), then S, is closed in Sj, and so u e Sj. 
Lemma A.7 now gives (Si, C,-) ■< (U, C) for all i; that is, (U, C) is an upper bound of C. 

By Zorn’s lemma, C has a maximal element, say (M, <). If M contains every element 
of X, then X can be well-ordered. If there is some x e X with x^M, then define a well¬ 
ordering <' of MU{x} extending the given well-ordering of M by defining m < x for every 
m e M. Since M — Seg(x) in M U {x}, we have ( M , <) -< (M U {x}, <'), contradicting 
the maximality of (M, <). Therefore, M — X and X can be well-ordered. • 

Almost all proofs involving Zorn’s lemma have the same format: define an appropriate 
nonempty partially ordered set; show that its chains have upper bounds; show that a max¬ 
imal element, whose existence is guaranteed by Zorn’s lemma, can be used to prove the 
theorem (this last step is usually an indirect proof). 

Here is a small comment about the axiom of choice before we present the second step in 
the proof of Theorem A.4. Given sets A and X, one way to define a function /; A -* X is 
to specify its values. For example, there exists a function /; N —> N with f(n) = n +1 for 
each n e N. Not every function is given by a formula, however, and the axiom of choice 
deals with the problem of when a function is actually defined. If {G a : a e A} is a family 
of groups, then we may define a choice function /: A —»■ |J fl G a by f(a) = l a , where l a 
is the identity element of G„; we do not need the axiom of choice to define /. In contrast, 
if we merely “choose” some element x a e G a , then the “function” /z: A -> |J G a with 
h (a) = x a is not well-defined. Such a “function” h ought not be a bona fide function. How 
could one possibly detect any properties of /z; for example, is h an injection? 

Theorem A.9. The well-ordering principle implies the axiom of choice. 

Proof. Let A be a nonempty set. We may assume that A has some well-ordering of its 
elements, and it follows that every nonempty subset of A is also well-ordered. Define a 
choice function ft : V(A) # —> A by defining, for each nonempty subset S of A, f(S) to be 
the smallest element of 5. • 

Daniel Grayson has shown me an elegant proof that the axiom of choice implies Zorn’s 
lemma; it is a variant of a proof of E. Zermelo in 1904, as adapted by H. Kneser in 1950. 

Theorem A.10. The axiom of choice implies Zorn’s lemma. 

Proof. Assume that X has no maximal elements. If A is a well-ordered subset of X, then 
A is a chain, and hence A has an upper bound, say x. Since x is not a maximal element, 
there exists y e X with x < y, it follows that every well-ordered subset A has an upper 
bound that is not in A. Let W denote the family of all the well-ordered subsets of X. For 
each A e W, define 


Ua — {all upper bounds u of A with u <£ A}; 



A-8 


The Axiom of Choice and Zorn’s Lemma Appendix 


each Ua ^ 0, by hypothesis, and so Proposition A.l says there is some g in riAeW • 
Thus, for all A e VV, we have g(A) an upper bound of A and g (A) A. 

We use g to construct some special well-ordered subsets. Define an element co e X by 
cq = g(0). Call a well-ordered subset C of X a g-set if c is the upper bound of C fl Seg(c) 
chosen by g; 3 that is, co e C and c = g(C fl Seg(c)) for every c e C. 

We are going to show that the union of all the g-sets is itself a g-set, and this will then 
be shown to give a contradiction. 

If C and D are g-sets, we claim that either C < D or D < C. Define W to be the union 
of all those subsets B with B < C and B < I). We claim that W < C and W < D\ that is, 
W is closed in C and in D. Take w e VV; this element got into W because it lies in some 
B , where B < C and B < D. If c e C and c < w, then c e B (because B is closed in C). 
Hence, c e B C W (for W is, by definition, the union of all such subsets B ). Therefore, W 
is closed in C. Similarly, W is closed in D. If either W — C or W = D, then the claim is 
true. Hence, we may assume that W < C [so that W — C C 1 Seg(c') for some c' e C — VV'|, 
and W < D [so that W — D fl Seg (d') for some d' e D — W]. Since C and D are g-sets, 
d = g(C fl Seg(c')) = g(W) and d' = g(D fl Seg(c/')) = g(W). Therefore, d = d'. 
But now W U \c'} — W U \c'} is closed in C and in D, for it is a closed interval. Thus, 
W U \c'} C VV, contradicting c' ^ W. Therefore, either W — C or W — D\ that is, either 
C < D or D < C, as claimed. 

Finally, let Q be the union of all the g-sets. The just-established claim shows that the 
hypothesis of Lemma A.6 is satisfied, and so £2 is a well-ordered subset. Let us show that 
Q is itself a g-set. Ifc e Q, then there is some g-set C containing c, and c = g(CflSeg(c)). 
But C fl, by Lemma A.7 and the fact just proved above that either C fj C or Q. fj C; 
hence, C fl Seg(c) = f2 (T Seg(c). Therefore, c = g(fl IT Seg(c)), and f2 is a g-set. On the 
other hand, fT = f2 U {g(Q)) is a g-set not contained in Q, and this is a contradiction. We 
conclude that no such function g can exist, and hence that X has a maximal element. • 

It appears that we have used a weaker hypothesis than that of Zorn’s lemma: only well- 
ordered subsets need upper bounds. However, it is shown in Exercise 6.45 on page 374 
that every chain C in a partially ordered set contains a well-ordered subset W such that C 
and W have the same upper bounds. Hence, if all well-ordered subsets have upper bounds, 
then all chains have upper bounds as well. 

If B = C, then W — C; otherwise, B is an open segment of C, and so there is b e C 
with B — Seg(/?). If B' is also a proper subset of C, then B' — Seg(7/). We may assume 
that b < //, and so B < B'. It now follows from Lemma A.7 that W < C. 


T:ach of the following sets are g-sets. If c\ = g({co)). define C 2 = g({r<)■ cj}), and, by induction, = 
g({cQ, ... ,c n )). Note that cq -< ci -< C 2 -< • • •. Each subset {cq, ci, ..., c n ) is a g-set. There are infinite g-sets 
as well. For example, if C' = { c n : n e N}, let c' = g(C' ), and define C" = C' U jc'}. 
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homogeneous ideal, 715 
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homology groups of G, 870 
homomorphism 

R-homomorphism, 424 
algebra, 541 
commutative ring, 143 
graded algebra, 715 
group, 73 
conjugation, 77 
natural map, 85 
Lie algebra, 776 
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ring, 525 
semigroup, 300 
homotopic, 820 
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Houston, E., 235 
Hurewicz, W„ 435, 870 
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finitely generated, 341 
generated by X, 341 
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invertible, 950 
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Lie algebra, 776 
maximal, 322 
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monomial, 410 
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primary, 391 
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right, 524 
two-sided, 524 
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image 
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429 

ring homomorphism, 525 
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1-8 


INDEX 


induced module, 624, 887 
induced representation, 624 
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infinite-dimensional, 163 
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irreducible module, 534 
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Lie’s theorem, 778 
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linear representation, 607 
linear transformation, 171 
nonsingular, 171 
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linearly independent, 164 
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Luigi Ferrari, 209 
Liiroth’s theorem, 359 
Liiroth, J., 359 


lying over, 927 
Lyndon, R. C„ 897 
Lyndon-Hochschild-Serre, 
897 

M-regular sequence, 993 
Mac Lane, S„ 373, 461,717, 
871 

Mann, A., 105 
mapping problem 
universal, 449 
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generator, 601 
injective, 480 
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left, 424, 525 
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simple, 431, 534 
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torsion, 647 
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homomorphism, 300 
monomial ideal, 410 
monomial order, 402 
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Morita theory, 513, 603 
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multiplicity, 140 

Nagata, M., 781 
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Noether, E„ 85, 200, 342, 
393, 734, 739 
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left, 542 

nondegenerate, 698 
nonsingular 
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matrix, 54 
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algebraic integer, 335 
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normal extension, 211 
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decomposition, 395 
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normal subgroup, 76 
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Nullstellensatz, 386, 937 
weak, 385, 937 
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order ideal, 646 
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orthogonal group, 708 
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p-adic integers, 503 
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p-group, 104, 106, 112, 276 
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pairing, 575 
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parity, 48 
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discrete, 499 
monomial, 402 
partially ordered set, 226 
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disjoint, 42 
even, 48 
odd, 48, 49 
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transposition, 41 
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Picard, E., 968 
PID. 147 

Poincare, H., 782, 783 
pointwise operations, 120 
polar coordinates, 15 
polar decomposition, 15 
Polya, G., 112 
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polynomial, 126, 128 
associated reduced 
polynomial, 239 
cyclotomic, 20 
function, 377 
general, 192 
leading coefficient, 21 
monic, 21, 128 
separable, 201 
zero, 126 

polynomial function, 129, 377 
polynomial identity, 725 
polynomial ring 
noncommuting variables, 
724 

polynomials, 127 
n variables, 129 
noncommuting variables, 
724 
skew, 521 

Ponomarev, V. A., 572 
Pontrjagin duality, 488 
Pontrjagin, L. S., 488 
power series, 130, 518, 994 
powers, 55 

pre-additive category, 445 
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group, 306 
module, 473 

preserves multiplications, 835 
pre sheaf, 519 

primary component, 256, 652 
primary decomposition, 393, 
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irredundant, 394 
normal, 395 
primary ideal, 391 
prime field, 184 
prime ideal, 321 
associated, 394, 997 
minimal, 374 
minimal over ideal, 396 
prime integer, 1 
primitive element, 134 
theorem, 230 
primitive polynomial, 331 
associated, 332 


primitive ring, 571 
primitive root of unity, 20 
principal fcG-module, 552 
principal character 
see trivial character, 612 
principal derivation, 807 
principal ideal, 146 
principal ideal domain, 147 
principal ideal theorem, 989 
product 
categorical 

family of objects, 453 
two objects, 449 
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projections 
direct sum, 250 
direct sum of modules, 432 
product, 453 

projective dimension, 969 
projective limit (see inverse 
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projective module, 474 
projective plane, 779 
projective resolution, 813 
projective unimodular group, 
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projectively equivalent, 971 
proper 
class, 442 
divisor, 329 
ideal, 145 
subgroup, 63 
submodule, 428 
subset, 26 
subspace, 160 
Priifer, H„ 659 
Priifer group, 659 
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pure extension, 206 
pure subgroup, 257 
pure submodule, 663 
purely inseparable, 371, 776 
purely transcendental, 362 
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Pythagorean triple, 13 
primitive, 13 

quadratic field, 938 
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quaternions, 79, 81, 82 
division ring, 522 
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Quillen, D„ 477, 498 
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division algorithm 
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group, 84 
Lie algebra, 776 
module, 429 
ring, 182 
space, 170 
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^-isomorphism, 425 
R-linear combination, 428 
R-map, 424 
R-module, 423 
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maximal, 997 
Rabinowitch trick, 386 
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radical ideal, 383 
radical of ideal, 383 
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free group, 305 
free module, 472 
linear transformation, 181 
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rational functions, 129 
Razmyslov, Yu. P, 726 
realizes the operators, 790 
reduced abelian group, 658 
reduced basis, 418 
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Rees, D„ 980, 989, 999 
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Reiten, I., 572, 863 
relative Brauer group, 739 
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UFD, 331 
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mod G, 409 
repeated roots, 142 
representable functor, 518 
representation 
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group, 550 
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linear, 607 
regular, 607 
ring, 527 
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bar, 877 
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flat, 975 
free, 813 
injective, 814 
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representation, 628 
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division ring, 522 
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endomorphism ring, 521 
hereditary, 955 
Jacobson, 935 
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local, 326 
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semisimple, 552, 563 
simple, 559 
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Roiter, A. V., 572 
root 

multiplicity, 140 
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Seidenberg, A., 927 
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semigroup, 300 
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basis, 373 
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Shapiro, A., 884 
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signum, 48 
similar matrices, 177 
Simmons, G. J., 194 
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Smith normal form, 689 
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commutator, 284 
conjugate, 101 
cyclic, 64 
Frattini, 288 


fully invariant, 277 
Hall, 803 
normal, 76 
generated by X , 306 
normalizer, 101 
proper, 63 
pure, 257 
subnormal, 212 
Sylow, 269 
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